
Starred repositories
Apache Nutch is an extensible and scalable web crawler
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.
Transform company lists into actionable data. Upload a CSV to quickly get CEO, funding, products, and more for lead generation and market research.
Prometheus-based Kubernetes Resource Recommendations
MindSQL: A Python Text-to-SQL RAG Library simplifying database interactions. Seamlessly integrates with PostgreSQL, MySQL, SQLite, Snowflake, and BigQuery. Powered by GPT-4 and Llama 2, it enables β¦
SoTA LLM for converting natural language questions to SQL queries
π€ Chat with your SQL database π. Accurate Text-to-SQL Generation via LLMs using RAG π.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on dβ¦
TLS-Spoofing HTTP library, based on requests. Automatically updates JA3 fingerprints.
Extremely fast Query Engine for DataFrames, written in Rust
A python based HTML to text conversion library, command line client and Web service.
Open-source collection of AI agents covering web scraping, discovery, and data extraction. Each agent includes detailed documentation and proven integration patterns.
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for reβ¦
π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web without worrying about infrastructure.
Search, create and update Airtable bases, tables, fields, and records using Claude Desktop and MCP (Model Context Protocol) clients
ποΈπ€ Airtable Model Context Protocol Server, for allowing AI systems to interact with your Airtable bases
MCP server to provide Figma layout information to AI coding agents like Cursor
AWS MCP Servers β helping you get the most out of AWS, wherever you use MCP.
A Model Context Protocol server to connect to MongoDB databases and MongoDB Atlas Clusters.
The most powerful MCP Slack Server with no permission requirements, Apps support, multiple transports Stdio and SSE, DMs, Group DMs and smart history fetch logic.
Production ready MCP server with real-time search, extract, map & crawl.