๐ŸŒฑ Open Source โ–พ

๐ŸŒ Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

๐Ÿ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search โ†ป
๐Ÿ”Ž
๐ŸŒ

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: DATAGEN

Page 3

Showing 10 results from 55

K

confluentinc/kafka-connect-datagen

GitHub Java Apache License 2.0

Connector that generates data for demos

โ˜… 49 Forks 90 confluentinc Updated 13 May 2026
D

SFDO-Community-Sprints/DataGenerationToolkit

GitHub JavaScript BSD 3-Clause "New" or "Revised" License

Open Source Community Sprint project focused on building a tool to generate test data.

โ˜… 45 Forks 21 SFDO-Community-Sprints Updated 12 Jun 2026
D

cliffano/datagen

GitHub JavaScript MIT License

Multi-process test data files generator

โ˜… 40 Forks 4 cliffano Updated 25 Aug 2024
D

Oqura-ai/deepresearch-datagen-cli

GitHub Python MIT License

Using deep research workflow to generate datasets for finetuning LLMs.

โ˜… 40 Forks 7 Oqura-ai Updated 14 May 2026
D

lpreterite/datagent

GitHub JavaScript MIT License

ไธ€ไธช็”จไบŽๆจกๅ—ๅŒ–็ฎก็†ๅ‰็ซฏ่ฏทๆฑ‚็š„ๅทฅๅ…ท

โ˜… 40 Forks 2 lpreterite Updated 23 Jan 2026
T

EPFL-ENAC/TOPO-DataGen

GitHub Jupyter Notebook MIT License

[CVPR'22] TOPO-DataGen: an open and scalable aerial synthetic data generation workflow

โ˜… 35 Forks 1 EPFL-ENAC Updated 06 Jun 2026
D

whx156580/datagenerate

GitHub Python

็”Ÿๆˆๆ•ฐๆฎ

โ˜… 34 Forks 0 whx156580 Updated 12 Jun 2026
D

Alignment-Lab-AI/datagen

GitHub Python

a pipeline for using api calls to agnostically convert unstructured data into structured training data

โ˜… 32 Forks 2 Alignment-Lab-AI Updated 19 Nov 2025
M

apple/ml-lucid-datagen

GitHub Python Other

No description available from source.

โ˜… 32 Forks 3 apple Updated 02 Jun 2026
K

xushiyan/kafka-connect-datagen

GitHub Java Apache License 2.0

A Kafka Connect source connector that generates data for tests

โ˜… 29 Forks 16 xushiyan Updated 02 Dec 2024
Pagination Page 3 of 6

10 results on this page ยท 55 total found

Showing first 55 accessible GitHub results.