๐ŸŒฑ Open Source โ–พ

๐ŸŒ Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

๐Ÿ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search โ†ป
๐Ÿ”Ž
๐ŸŒ

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: DataGenerator

Page 1

Showing 10 results from 15

F

Data-Centric-AI-Community/fg-data-synthetic

GitHub Jupyter Notebook MIT License

Synthetic data generators for tabular and time-series data

โ˜… 1,642 Forks 258 Data-Centric-AI-Community Updated 12 Jun 2026
D

databrickslabs/dbldatagen

GitHub Python Other

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

โ˜… 475 Forks 97 databrickslabs Updated 13 Jun 2026
D

FINRAOS/DataGenerator

GitHub Java Apache License 2.0

DataGenerator is a Java library for systematically producing large volumes of data. DataGenerator frames data production as a modeling problem, with a user providing a model of dependencies among variables and the library traversing the model to produce relevant data sets.

โ˜… 164 Forks 165 FINRAOS Updated 25 May 2026
D

mmmmaomao/DataGenerator

GitHub Python

No description available from source.

โ˜… 55 Forks 13 mmmmaomao Updated 27 Apr 2026
D

loresoft/DataGenerator

GitHub C# MIT License

Automatically generate data for an object

โ˜… 50 Forks 22 loresoft Updated 01 Apr 2026
M

gimnathperera/mocktopus

GitHub TypeScript MIT License

๐Ÿ™ Generate mock data effortlessly from TypeScript interfaces in a web application powered by Next.js and NextUI.

โ˜… 42 Forks 0 gimnathperera Updated 24 Nov 2025
P

ruslanlap/PowerToysRun-RandomGen

GitHub C# MIT License

๐ŸŽฒ RandomGen for PowerToys Run Generate random data instantly with a single keystroke

โ˜… 29 Forks 0 ruslanlap Updated 12 Jun 2026
S

jehumtine/synthetic_data_generator

GitHub Python

This script is designed to convert bodies of text into a question and answer JSON format using the GPT-4 language model. The process involves extracting text from PDF files, tokenizing the text, generating questions and answers, and then saving the results in a JSON file.

โ˜… 24 Forks 4 jehumtine Updated 12 Nov 2025
S

RWaltersMA/StockPriceGenerator

GitHub Python

Python application to write stock security data to a MongoDB Cluster. Supports a variable amount of stocks, variable amount of time and can write to a MongoDB time-series collection

โ˜… 23 Forks 7 RWaltersMA Updated 05 Mar 2024
D

frischHWC/datagen

GitHub Java Apache License 2.0

Datagenerator for Data Services

โ˜… 16 Forks 6 frischHWC Updated 19 Feb 2025
Pagination Page 1 of 2

10 results on this page ยท 15 total found

Showing first 15 accessible GitHub results.