🐼 web-scraping

👇 4 个项目

crawlee

18.5k TypeScript Apache-2.0

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

1 8 年前 1 个月前

anything-llm

46.9k JavaScript MIT

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

1 2 年前 1 个月前

maxun

13.5k TypeScript AGPL-3.0

🔥 Open-source no code web data extraction platform. Instantly turn any website into API or spreadsheet 🔥

1 1 年前 9 天前

changedetection.io

26.1k Python Apache-2.0

Best and simplest tool for website change detection, web page monitoring, and website change alerts. Perfect for tracking content changes, price drops, restock alerts, and website defacement monitoring—all for free or enjoy our SaaS plan!

1 4 年前 9 天前