🐼 web-scraping

👇 4 个项目

crawlee

18.5k TypeScript Apache-2.0

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

9 年前 11 个月前

查看详情代码仓库

anything-llm

46.9k JavaScript MIT

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

3 年前 11 个月前

查看详情代码仓库

maxun

13.5k TypeScript AGPL-3.0

🔥 Open-source no code web data extraction platform. Instantly turn any website into API or spreadsheet 🔥

2 年前 10 个月前

查看详情代码仓库

changedetection.io

26.1k Python Apache-2.0

Best and simplest tool for website change detection, web page monitoring, and website change alerts. Perfect for tracking content changes, price drops, restock alerts, and website defacement monitoring—all for free or enjoy our SaaS plan!

5 年前 10 个月前

查看详情代码仓库

GitTrend

🔥 热门主题

🐼 web-scraping

crawlee

anything-llm

maxun

changedetection.io