🌱 Open Source β–Ύ

🌍 Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

πŸ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search ↻
πŸ”Ž
🌐

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: DATA-AI

Page 2

Showing 10 results from 2,312

C

apify/crawlee

GitHub TypeScript Apache License 2.0

Crawleeβ€”A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both he... Read more

β˜… 23,759 Forks 1,429 apify Updated 13 Jun 2026
D

huggingface/datasets

GitHub Python Apache License 2.0

πŸ€— The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

β˜… 21,618 Forks 3,248 huggingface Updated 13 Jun 2026
A

airbytehq/airbyte

GitHub Python Other

Open-source data movement for ELT pipelines and AI agents β€” from APIs, databases & files to warehouses, lakes, and AI applications. Both self-hosted and Cloud.

β˜… 21,446 Forks 5,218 airbytehq Updated 13 Jun 2026
T

Avaiga/taipy

GitHub Python Apache License 2.0

Turns Data and AI algorithms into production-ready web applications in no time.

β˜… 19,241 Forks 1,988 Avaiga Updated 12 Jun 2026
D

eosphoros-ai/DB-GPT

GitHub Python MIT License

open-source agentic AI data assistant for the next generation of AI + Data products.

β˜… 18,977 Forks 2,738 eosphoros-ai Updated 12 Jun 2026
M

getmaxun/maxun

GitHub TypeScript GNU Affero General Public License v3.0

πŸ”₯ The open-source no-code platform for web scraping, crawling, search and AI data extraction β€’ Turn websites into structured APIs in minutes πŸ”₯

β˜… 15,868 Forks 1,322 getmaxun Updated 13 Jun 2026
W

Canner/WrenAI

GitHub Python Other

Give AI agents the context to query business data correctly through the open context layer that gives AI agents grounded, governed memory, context, SQL across 20+ data sources, that helps you build agentic GenBI, text-to-sql, dashboards, and agentic analytics.

β˜… 15,524 Forks 1,765 Canner Updated 13 Jun 2026
O

open-metadata/OpenMetadata

GitHub TypeScript Apache License 2.0

The Open Context Layer for Data and AI , OpenMetadata is the open platform for building trusted data context and business semantics for humans, AI assistants, and agents.

β˜… 14,179 Forks 2,160 open-metadata Updated 13 Jun 2026
G

geldata/gel

GitHub Python Apache License 2.0

Gel supercharges Postgres with a modern data model, graph queries, Auth & AI solutions, and much more.

β˜… 14,099 Forks 449 geldata Updated 13 Jun 2026
O

OpenNHP/opennhp

GitHub Go Apache License 2.0

A lightweight, cryptography-powered, open-source toolkit built to enforce Zero Trust security for infrastructure, applications, and data in the AI-driven world.

β˜… 13,797 Forks 2,489 OpenNHP Updated 12 Jun 2026
Pagination Page 2 of 100

10 results on this page Β· 2,312 total found

Showing first 1,000 accessible GitHub results.