Menu โ˜ฐ

๐ŸŒ Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

๐Ÿ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search โ†ป
๐Ÿ”Ž
๐ŸŒ

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: Documents

Page 1

Showing 10 results from 11,136

M

microsoft/markitdown

GitHub Python MIT License

Python tool for converting files and office documents to Markdown.

โ˜… 146,006 Forks 9,997 microsoft Updated 06 Jun 2026
S

storybookjs/storybook

GitHub TypeScript MIT License

Storybook is the industry standard workshop for building, documenting, and testing UI components in isolation

โ˜… 90,212 Forks 10,124 storybookjs Updated 06 Jun 2026
P

PaddlePaddle/PaddleOCR

GitHub Python Apache License 2.0

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

โ˜… 80,791 Forks 10,644 PaddlePaddle Updated 06 Jun 2026
R

redis/redis

GitHub C Other

For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.

โ˜… 74,718 Forks 24,654 redis Updated 06 Jun 2026
M

opendatalab/MinerU

GitHub Python Other

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

โ˜… 66,645 Forks 5,616 opendatalab Updated 06 Jun 2026
D

docling-project/docling

GitHub Python MIT License

Get your documents ready for gen AI

โ˜… 61,064 Forks 4,263 docling-project Updated 06 Jun 2026
P

zylon-ai/private-gpt

GitHub Python Apache License 2.0

Interact with your documents using the power of GPT, 100% privately, no data leaks

โ˜… 57,206 Forks 7,605 zylon-ai Updated 06 Jun 2026
L

run-llama/llama_index

GitHub Python MIT License

LlamaIndex is the leading document agent and OCR platform

โ˜… 49,948 Forks 7,519 run-llama Updated 06 Jun 2026
P

paperless-ngx/paperless-ngx

GitHub Python GNU General Public License v3.0

A community-supported supercharged document management system: scan, index and archive all your documents

โ˜… 41,934 Forks 2,794 paperless-ngx Updated 06 Jun 2026
C

carbon-language/carbon-lang

GitHub C++ Other

Carbon Language's main repository: documents, design, implementation, and related tools. (NOTE: Carbon Language is experimental; see README)

โ˜… 33,768 Forks 1,538 carbon-language Updated 06 Jun 2026
Pagination Page 1 of 100

10 results on this page ยท 11,136 total found

Showing first 1,000 accessible GitHub results.