High-performance open-source synthetic data engine. Uses LLMs for schema design and vectorized NumPy for deterministic, scalable generation.
-
Updated
Jan 15, 2026 - Python
High-performance open-source synthetic data engine. Uses LLMs for schema design and vectorized NumPy for deterministic, scalable generation.
Generate realistic SQL INSERT statements from CSV files with automatic type inference and batch sizing
Add a description, image, and links to the database-seeding topic page so that developers can more easily learn about it.
To associate your repository with the database-seeding topic, visit your repo's landing page and select "manage topics."