📦 A curated list of JSON / BSON datasets from the web in order to practice / use in MongoDB
-
Updated
Jul 5, 2019 - Shell
📦 A curated list of JSON / BSON datasets from the web in order to practice / use in MongoDB
The National Gallery of Art Open Data Program
[ICLR 2024] DNABERT-2: Efficient Foundation Model and Benchmark for Multi-Species Genome
Visual Odometry with Inertial and Depth (VOID) dataset
🐸TTS recipes for different datasets
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
A spoken question answering dataset on SQUAD
Collections of many datasets you may need and play with.
International Securities Identification Numbers for various Indian Securities
Tracing Versus Freehand for Evaluating Computer-Generated Drawings (SIGGRAPH 2021)
Classes and Metriсs (CaM): a dataset of Java classes from public open-source GitHub repositories
Scripts for preprocessing the CoNLL-2005 SRL dataset.
A dataset of SCP Items, Articles, and Metadata - Updated Daily
A periodically updated list of websites known to be blocked in India on the Airtel Broadband network.
Add a description, image, and links to the dataset topic page so that developers can more easily learn about it.
To associate your repository with the dataset topic, visit your repo's landing page and select "manage topics."