Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
-
Updated
Feb 3, 2021 - Python
Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
A Malware classifier dataset built with header fields’ values of Portable Executable files
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
jazznet dataset of piano patterns for music audio machine learning research
2D Geometric shapes generator
We currently maintain 488 data sets as a service to the machine learning community. You may view all data sets through our searchable interface. For a general overview of the Repository, please visit our About page. For information about citing data sets in publications, please read our citation policy. If you wish to donate a data set, please c…
SPREAD is a large-scale synthetic dataset for image- and point-cloud- based tasks in forestry.
A duplicate-free variant of the CIFAR test set.
UCLA Dining Hall Menus Dataset
Corpus of Coq code related to MathComp including several machine-readable representations
Extract Japanese characters database.
Classification dataset for comparing cats and dogs images
OpenFrameworks program that generates training data from font-faces installed on your Mac.
Korpus ręcznie sklasyfikowanych komentarzy do uczenia maszynowego (filtrowanie komentarzy obraźliwych)
Marktplaats.nl (Dutch Classifieds) Listing Scraper
CSV datasets for ML/AI models from captured network traffic during ZAP scanning with web applications like Django, Flask, React, Vue and Spring - Anti-Nex training datasets
Simple task for mixed image-graph data
Given a product name, the python program downloads all the images. This includes pagenation also.
Generate captchas for ML tasks in parallel.
This repo is the dataset for the paper "A New Dataset and Methodology for Malicious URL Classification"
Add a description, image, and links to the machine-learning-dataset topic page so that developers can more easily learn about it.
To associate your repository with the machine-learning-dataset topic, visit your repo's landing page and select "manage topics."