The AI-Native Web Data Infrastructure.
Connect LLMs, Agents, and RAG pipelines to the real-world web.
Ecosystem Overview: All core SDKs and flagship agents are production‑ready and fully typed. We continuously ship new integrations, example projects, and AI‑native workflows.
Thordata is not just a proxy provider; we are the data layer for the AI era. We provide the infrastructure that allows developers, data scientists, and AI agents to access public web data reliably, anonymously, and at scale.
With a network of 100M+ Ethical Residential IPs and advanced Web Unlocking technology, we handle the complexity of fingerprints, captchas, and retries so you can focus on the data.
We organize our open-source projects into layers, from core infrastructure to high-level AI agents.
The fundamental building blocks for integrating Thordata into your stack.
| Repository | Language | Description | Status |
|---|---|---|---|
| thordata-python-sdk | Python | 🐍 Flagship SDK. Async support, fully typed, Pandas integration. The standard for data pipelines. | 🟢 Stable |
| thordata-js-sdk | Node.js | 📦 TypeScript. Built for serverless environments and Puppeteer/Playwright control. | 🟢 Stable |
| thordata-go-sdk | Go | 🐹 High Performance. Designed for massive concurrency and enterprise-grade scrapers. | 🟢 Stable |
| thordata-java-sdk | Java | ☕ Enterprise. Thread-safe, rigid implementation for legacy banking/enterprise systems. | 🟢 Stable |
Native protocols to connect Thordata with the modern AI stack.
| Repository | Protocol | Description | Status |
|---|---|---|---|
| thordata-mcp-server | MCP | 🤖 Model Context Protocol implementation. Connect Claude Desktop / OpenAI directly to Thordata tools. | 🔥 NEW |
| thordata-langchain-tools | LangChain | 🦜🔗 Official LangChain Tool definitions. Give your Agents "Browsing" capabilities. | 🟠 Evolving |
| thordata-rag-pipeline | Vector DB | 🧠 End-to-end pipeline: Scrape -> Clean -> Chunk -> Embed. Optimized for RAG. | 🟠 Evolving |
Ready-to-use scraper templates and hands-on guides for high-value targets. Batteries included.
| Repository | Target | Features |
|---|---|---|
| apify-amazon-search-product-scraper | Amazon Search & Product | Multi‑marketplace search & product data, with rating / reviews filters and optional enrichment. |
| how-to-bypass-amazon-captcha-when-scraping | Anti-bot / CAPTCHA | 2026 hands‑on guide to reliably bypass Amazon CAPTCHA in scraping workflows, with free examples and a production‑ready integration using Thordata Web Scraper Tools. |
| how-to-scrape-amazon-product-data-for-free | Amazon Product | Practical tutorial on scraping Amazon product data with requests + BeautifulSoup in 2026, plus an optional upgrade path to Thordata's Amazon Scraper API. |
Full-blown applications and demos showcasing the power of Thordata.
| Repository | Type | Description |
|---|---|---|
| thordata-web-qa-agent | Demo Agent | An AI Agent that searches the web to answer complex questions (Perplexity-style clone). |
| google-play-reviews-rag | Analytics | Sentiment analysis pipeline for App Store reviews using local LLMs. |
| Repository | Description |
|---|---|
| thordata-proxy-examples | 🍳 "Copy-Paste" Recipes. End-to-end examples of proxy configuration, rotation, and Web Unlocker usage. |
Access the world's most stable proxy network.
- Residential Proxies: 100M+ IPs, Real devices, Ethical compliance.
- Mobile Proxies: 3G/4G/5G IPs for high-trust mobile app verification.
- ISP Proxies: Static residential IPs for keeping sessions alive.
- Datacenter Proxies: High speed, cost-effective bandwidth.
Stop worrying about being blocked.
- Web Unlocker API: A simple API endpoint that automatically handles:
- Captcha Solving (ReCaptcha, hCaptcha, Cloudflare, etc.)
- TLS Fingerprint Spoofing
- JavaScript Rendering
- Automatic Retries & Rotation
Run your Puppeteer/Playwright/Selenium scripts on our cloud browsers.
- CDP (Chrome DevTools Protocol) support.
- Scale to thousands of concurrent browsers without managing infrastructure.
This ecosystem is open for contributions!
- All SDKs are licensed under MIT.
- We welcome Pull Requests for bug fixes and new features.
- Please check the
CONTRIBUTING.mdin each repository.