๐ŸŒฑ Open Source โ–พ

๐ŸŒ Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

๐Ÿ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search โ†ป
๐Ÿ”Ž
๐ŸŒ

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: spark

Page 1

Showing 10 results from 5,211

S

apache/spark

GitHub Scala Apache License 2.0

Apache Spark - A unified analytics engine for large-scale data processing

โ˜… 43,439 Forks 29,219 apache Updated 11 Jun 2026
D

DataTalksClub/data-engineering-zoomcamp

GitHub Jupyter Notebook

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here ๐Ÿ‘‡๐Ÿผ

โ˜… 42,315 Forks 8,374 DataTalksClub Updated 11 Jun 2026
D

donnemartin/data-science-ipython-notebooks

GitHub Python Other

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

โ˜… 29,150 Forks 8,030 donnemartin Updated 10 Jun 2026
R

getredash/redash

GitHub Python BSD 2-Clause "Simplified" License

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

โ˜… 28,632 Forks 4,601 getredash Updated 10 Jun 2026
X

dmlc/xgboost

GitHub C++ Apache License 2.0

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

โ˜… 28,461 Forks 8,878 dmlc Updated 11 Jun 2026
D

yeasy/docker_practice

GitHub Go

ๆœ€ๆ–ฐDockerๅฎนๅ™จๆŠ€ๆœฏ๏ผŒไปŽ็œŸๅฎžๆกˆไพ‹ไธญๅญฆไน ๆœ€ไฝณๅฎž่ทต๏ผ| Learn and understand Docker&Container technologies, with real DevOps practice!

โ˜… 26,096 Forks 5,788 yeasy Updated 10 Jun 2026
S

Dujltqzv/Some-Many-Books

GitHub

ไธชไบบๆ”ถ่—ไนฆ็ฑๅˆ—่กจใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€... Read more

โ˜… 20,201 Forks 2,010 Dujltqzv Updated 11 Jun 2026
B

heibaiying/BigData-Notes

GitHub Java

ๅคงๆ•ฐๆฎๅ…ฅ้—จๆŒ‡ๅ— :star:

โ˜… 16,901 Forks 4,285 heibaiying Updated 10 Jun 2026
D

FavioVazquez/ds-cheatsheets

GitHub MIT License

List of Data Science Cheatsheets to rule the world

โ˜… 16,236 Forks 4,055 FavioVazquez Updated 10 Jun 2026
D

apache/doris

GitHub Java Apache License 2.0

Apache Doris is an easy-to-use, high performance and unified analytics database.

โ˜… 15,460 Forks 3,816 apache Updated 11 Jun 2026
Pagination Page 1 of 100

10 results on this page ยท 5,211 total found

Showing first 1,000 accessible GitHub results.