🐼 parser

👇 2 Items

cheerio

29.1k TypeScript MIT

The fast, flexible, and elegant library for parsing and manipulating HTML and XML.

1 13 year(s) ago 15 day(s) ago

MinerU

27.5k Python AGPL-3.0

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

1 1 year(s) ago 7 day(s) ago