๐ Live Open Source Explorer
Explore live open-source projects and AI models.
Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.
๐ Live Search
Search live open-source data
Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.
Live Results
GitHub Open Source Repositories
Search: spark
Page 1
Showing 10 results from 5,211
apache/spark
GitHub Scala Apache License 2.0Apache Spark - A unified analytics engine for large-scale data processing
External source
GitHub
DataTalksClub/data-engineering-zoomcamp
GitHub Jupyter NotebookData Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here ๐๐ผ
External source
GitHub
donnemartin/data-science-ipython-notebooks
GitHub Python OtherData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
External source
GitHub
getredash/redash
GitHub Python BSD 2-Clause "Simplified" LicenseMake Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
External source
GitHub
dmlc/xgboost
GitHub C++ Apache License 2.0Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
External source
GitHub
yeasy/docker_practice
GitHub GoๆๆฐDockerๅฎนๅจๆๆฏ๏ผไป็ๅฎๆกไพไธญๅญฆไน ๆไฝณๅฎ่ทต๏ผ| Learn and understand Docker&Container technologies, with real DevOps practice!
External source
GitHub
Dujltqzv/Some-Many-Books
GitHubไธชไบบๆถ่ไนฆ็ฑๅ่กจใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใใ... Read more
External source
GitHub
heibaiying/BigData-Notes
GitHub Javaๅคงๆฐๆฎๅ ฅ้จๆๅ :star:
External source
GitHub
FavioVazquez/ds-cheatsheets
GitHub MIT LicenseList of Data Science Cheatsheets to rule the world
External source
GitHub
apache/doris
GitHub Java Apache License 2.0Apache Doris is an easy-to-use, high performance and unified analytics database.
External source
GitHub
10 results on this page ยท 5,211 total found
Showing first 1,000 accessible GitHub results.