Code for Machine Learning for Algorithmic Trading, 2nd edition.
-
Updated
Aug 18, 2024 - Jupyter Notebook
Code for Machine Learning for Algorithmic Trading, 2nd edition.
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
Open source data anonymization and synthetic data platform for developers. Anonymize your production data and sync it across your environments so that developers can safely use it.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.
A procedural Blender pipeline for photorealistic training image generation
Synthetic data generation for tabular data
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Synthetic Patient Population Simulator
SDG is a specialized framework designed to generate high-quality structured tabular data.
UnrealCV: Connecting Computer Vision to Unreal Engine
Synthetic data generators for tabular and time-series data
The Declarative Data Generator
Conditional GAN for generating synthetic tabular data.
PostgreSQL database anonymization and synthetic data generation tool
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
Synthetic data curation for post-training and structured data extraction
A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
Curated list of open source tooling for data-centric AI on unstructured data.
Add a description, image, and links to the synthetic-data topic page so that developers can more easily learn about it.
To associate your repository with the synthetic-data topic, visit your repo's landing page and select "manage topics."