Skip to content

Releases: huggingface/datasets

1.14.0

19 Oct 16:46
Compare
Choose a tag to compare

Dataset changes

Dataset features

General improvements and bug fixes

1.13.3

15 Oct 15:50
Compare
Choose a tag to compare

Dataset changes

Bug fixes

1.13.2

14 Oct 16:02
Compare
Choose a tag to compare

Bug fixes

1.13.1

14 Oct 12:50
Compare
Choose a tag to compare

Bug fixes

1.13.0

13 Oct 15:15
Compare
Choose a tag to compare

Dataset changes

Metric changes

Dataset features

  • Use with TensorFlow:
  • Better support for ZIP files:
    • Support loading dataset from multiple zipped CSV data files #3021 (@albertvillanova)
    • Load private data files + use glob on ZIP archives for json/csv/etc. module inference #3041 (@lhoestq)
  • Streaming improvements:
    • Extend support for streaming datasets that use glob.glob #3015 (@albertvillanova)
    • Add remove_columns to IterableDataset #3030 (@cccntu)
    • All the above ZIP features also work in streaming mode
  • New utilities:
    • Add get_dataset_split_names() to get a dataset config's split names #2906 (@severo)
  • Replace script_version with revision #2933 (@albertvillanova)
    • The script_version parameter in load_dataset is now deprecated, in favor of revision
  • Experimental - Create Audio feature type #2324 (@albertvillanova):
    • It allows to automatically decode audio data (mp3, wav, flac, etc.) when examples are accessed

Dataset cards

Documentation

General improvements and bug fixes

Breaking changes:

  • Due to the big refactoring at #2986, the prepare_module function doesn't support the return_resolved_file_path and return_associated_base_path parameters. As an alternative, you may use the dataset_module_factory instead.

1.12.1

15 Sep 17:45
Compare
Choose a tag to compare

Bug fixes

  • Fix fsspec AbstractFileSystem access #2915 (@pierre-godard)
  • Fix unwanted tqdm bar when accessing examples #2920 (@lhoestq)
  • Fix conversion of multidim arrays in list to arrow #2922 (@lhoestq):
    • this fixes the ArrowInvalid: Can only convert 1-dimensional array values errors

1.12.0

13 Sep 18:35
Compare
Choose a tag to compare

New documentation

  • New documentation structure #2718 (@stevhliu):
    • New: Tutorials
    • New: Hot-to guides
    • New: Conceptual guides
    • Update: Reference

See the new documentation here !

Datasets changes

Datasets features

Dataset streaming - better support for compression:

Metrics changes

Dataset cards

General improvements and bug fixes

1.11.0

30 Jul 14:27
Compare
Choose a tag to compare

Datasets Changes

General improvements and bug fixes

1.10.2

22 Jul 10:08
Compare
Choose a tag to compare

The error message to tell which dataset config name to load was not displayed:

Docstrings:

1.10.1

22 Jul 08:47
Compare
Choose a tag to compare