Menu

🌍 Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

🔎 Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search
🔎
🌐

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: data-collection

Page 91

Showing 10 results from 2,130

E

kylemcdonald/EmbeddingScripts

GitHub Python

Collection of scripts for visualizing high dimensional data with scikit-learn and bh_tsne

★ 34 Forks 4 kylemcdonald Updated 08 Aug 2024
H

Tomotsugu-dev/HumanMoveMouse

GitHub Python MIT License

HumanMoveMouse is a realistic mouse‑movement simulator based on a statistical model trained on 300 samples of real human behavior. It generates natural cursor trajectories with realistic acceleration, deceleration, micro‑jitter, and path curvature—ideal for UI testing, automation, and mouse‑mov... Read more

★ 34 Forks 3 Tomotsugu-dev Updated 03 Jun 2026
E

VenkyAdi/EDA---Projects

GitHub Jupyter Notebook

Exploratory Data Analysis (EDA) Projects A collection of EDA projects exploring various datasets to uncover patterns, gain insights, and visualize trends across different industries. Projects include analyses of Amazon Prime content, banking fraud detection, logistics performance, hotel booking t... Read more

★ 34 Forks 16 VenkyAdi Updated 24 May 2026
S

microsoft/SdnDiagnostics

GitHub PowerShell MIT License

SdnDiagnostics is a PowerShell module that is designed to simplify the diagnostic troubleshooting and data collection process when troubleshooting issues related to Microsoft Software Defined Network.

★ 34 Forks 4 microsoft Updated 22 May 2026
L

lanxiang1017/Language-Modeling-on-Tabular-Data-Survey

GitHub

A collection of AWESOME language modeling techniques on tabular data applications.

★ 34 Forks 1 lanxiang1017 Updated 13 May 2026
R

GPT-Laboratory/RAG-LLM-Development-Guidebook-from-PDFs

GitHub Python MIT License

Code for building specialized RAG systems using PDF documents with OpenAI Assistant API for GPT and LLaMA models, covering the full pipeline from data collection to generation.

★ 34 Forks 5 GPT-Laboratory Updated 26 May 2026
P

PhilipYip1988/python-tutorials

GitHub Python

Python tutorials in markdown format. These tutorials look at installation on Python and Python IDEs, object orientated programming, the object orientated design pattern known as the Python data model, the concept of inheritance and how the data model is extended for text, numeric and collection b... Read more

★ 34 Forks 6 PhilipYip1988 Updated 05 Jun 2026
3

katya201165/3D-designe

GitHub

A collection of data visualization examples using D3.js. This repository includes various charts and graphs to illustrate data insights and patterns.

★ 34 Forks 0 katya201165 Updated 05 May 2026
C

manishkumar8312/CS-Fundamentals

GitHub MIT License

CS-Fundamentals is a structured collection of core Computer Science subjects including Operating Systems, Theory of Computation, Data Structures, and related topics, designed for B.Tech CSE students and technical interview revision.

★ 34 Forks 3 manishkumar8312 Updated 03 Jun 2026
Z

SakuraPuare/ZhiHu_Spider

GitHub Python

知乎内容爬虫 | Web scraper for Zhihu content extraction

★ 34 Forks 7 SakuraPuare Updated 08 Mar 2026
Pagination Page 91 of 100

10 results on this page · 2,130 total found

Showing first 1,000 accessible GitHub results.