Menu ☰

🌍 Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

πŸ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search ↻
πŸ”Ž
🌐

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: Documents

Page 5

Showing 10 results from 11,137

L

run-llama/liteparse

GitHub Rust Apache License 2.0

A fast, helpful, and open-source document parser

β˜… 9,294 Forks 559 run-llama Updated 06 Jun 2026
W

Kozea/WeasyPrint

GitHub Python BSD 3-Clause "New" or "Revised" License

The awesome document factory

β˜… 9,250 Forks 825 Kozea Updated 06 Jun 2026
D

bytedance/Dolphin

GitHub Python Other

The official repo for β€œDolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

β˜… 9,007 Forks 764 bytedance Updated 05 Jun 2026
I

codenotary/immudb

GitHub Go Other

immudb - immutable database based on zero trust, SQL/Key-Value/Document model, tamperproof, data change history

β˜… 8,980 Forks 368 codenotary Updated 06 Jun 2026
D

rednote-hilab/dots.ocr

GitHub Python MIT License

Multilingual Document Layout Parsing in a Single Vision-Language Model

β˜… 8,907 Forks 801 rednote-hilab Updated 06 Jun 2026
P

Future-House/paper-qa

GitHub Python Apache License 2.0

High accuracy RAG for answering questions from scientific documents with citations

β˜… 8,642 Forks 876 Future-House Updated 06 Jun 2026
B

funstory-ai/BabelDOC

GitHub Python GNU Affero General Public License v3.0

Yet Another Document Translator

β˜… 8,631 Forks 700 funstory-ai Updated 06 Jun 2026
P

Hopding/pdf-lib

GitHub TypeScript MIT License

Create and modify PDF documents in any JavaScript environment

β˜… 8,492 Forks 887 Hopding Updated 05 Jun 2026
K

kreuzberg-dev/kreuzberg

GitHub Rust Other

A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST AP... Read more

β˜… 8,448 Forks 497 kreuzberg-dev Updated 06 Jun 2026
L

LearningCircuit/local-deep-research

GitHub Python MIT License

~95% on SimpleQA (e.g. Qwen3.6-27B on a 3090). Supports all local and cloud LLMs (llama.cpp, Ollama, Google, ...). 10+ search engines - arXiv, PubMed, your private documents. Everything Local & Encrypted.

β˜… 8,381 Forks 728 LearningCircuit Updated 06 Jun 2026
Pagination Page 5 of 100

10 results on this page Β· 11,137 total found

Showing first 1,000 accessible GitHub results.