Skip to content

Latest commit

 

History

History
150 lines (115 loc) · 5.8 KB

NEWS.md

File metadata and controls

150 lines (115 loc) · 5.8 KB

mlr3oml 0.9.0-9000

  • Bugfix: Caching of parquet files failed when the temporary directory was on a different partition as the cache directory
  • feat: Add option mlr3oml.retries to control number of retries when downloading data from OpenML. The default is 3.

mlr3oml 0.9.0

  • Fix: Parquet datasets now work where columns simultaneously have to be renamed and converted.
  • Added upload functions:
    • publish_data to upload a dataset on OpenML
    • publish_task to create a task on OpenML
    • publish_collection to create a collection on OpenML

mlr3oml 0.8.0

  • Listing functions don't return the tables invisibly anymore.
  • Address CRAN NOTE regarding unused bit64 import.
  • Improved the printer for all OpenML objects.
  • Removed benchmark_grid_oml(), which was already deprecated in release 0.7.2.
  • Removed the fields runs, flows, data, tasks from the OMLCollection class. Consequently, the cache option can no longer be set for OMLCollection objects, see the class documentation for more information.
  • Removed the examples, as they caused problems with CRAN checks when OpenML was unavailable.
  • Caching can no longer be specified at the instance level but only globally through the option mlr3oml.cache
  • Added $download() method for all OML objects to fully download an object for offline usage.
  • Incremented the cache version for parquet data due to a change in OpenML.
  • Added an online tutorial for the package.

mlr3oml 0.7.2

  • Fix: target is added to features when converting a OMLData object to a task with an explicit target variable that is not the default target.
  • Deprecated benchmark_grid_oml() in favour of mlr3::benchmark_grid(..., paired = TRUE)
  • Fix: Incremented cache version for data objects for int64 data types (introduced in the previous release).
  • Fix: Incremented cache version for data description and fixed bug, as make.names() was not applied to ignore attributes.
  • Fix bug in task converter (features were sometimes not set correctly)
  • Collection now shows name in printer
  • Better error message when parquet dataset creation fails

mlr3oml 0.7.1

  • Fixed argument names of S3 method for as_data_backend to comply with new CRAN checks

mlr3oml 0.7.0

  • feature: Add argument task_type to function list_oml_tasks().
  • fix: strings and nominals are distinguished for parquet files
  • docs: Fixed some OpenML links
  • docs: Renamed the docs for OpenML objects
  • Renamed the sugar functions from:
    • oml_data() is now odt()
    • oml_task() is now otsk()
    • oml_flow() is now oflw()
    • oml_run() is now orn
    • oml_collection() is now ocl()
  • Addresses a CRAN issue: examples fail gracefully if OpenML server is busy.

mlr3oml 0.6.0

Features

  • Add R6 classes for OMLCollection, OMLRun, OMLFlow.
  • Added function benchmark_grid_oml that allows for easier creation of benchmark designs from OpenML task-resampling pairs.
  • Added sugar functions oml_flow, oml_data, oml_task, oml_run, oml_collection for all OpenML objects.
  • Conversion from OpenML to mlr3 objects is now only possible with the usual s3-converters as_<object>. This improves consistency by ensuring that the subcomponents of OpenML objects are always OpenML objects and not suddenly mlr3 objects.
  • Added more converter functions: as_learner, as_resample_result, as_data_backend, as_benchmark_result.
  • Added support for parquet files that were recently introduced on OpenML. The global option mlr3oml.parquet can be used to enable or disable this. By default it is FALSE. This is implemented via the duckdb backend from mlr3db.
  • Support to use the OpenML test server. This can be globally enabled using the option mlr3oml.test_server or individually for objects. Options to globally define an API-key for the test server are through the environment variable TESTOPENMLAPIKEY or the option mlr3oml.test_api_key

Fixes

  • Removed support for survival tasks as mlr3proba is no longer on CRAN
  • OpenML tasks can now also be filtered according to the task type

Other

  • Implement an arff writer and remove the arff dependency, therefore also removing the option "farff" as the mlr3oml.arff_parser
  • Increment the cache version number due to changes in the cache structure: This will flush the previous cache folder.
  • Simplified the code structure by adding OMLObject class from which all other OpenML objects like OMLData, OMLTask inherit.

mlr3oml 0.5.0

  • Support for downloading survival tasks (via mlr3proba).
  • More functions to list objects from OpenML:
    • list_oml_evaluations()
    • list_oml_flows()
    • list_oml_measures()
    • list_oml_runs()
    • list_oml_setups()

mlr3oml 0.4.3

  • Fixed a bug regarding unquoting fields in ARFF files.

mlr3oml 0.4.2

  • If not set via option mlr3oml.api_key, the API key is retrieved from the environment variable OPENMLAPIKEY.
  • Implemented a retry mechanism as a workaround for temporary connection errors.

mlr3oml 0.4.1

  • Added a heuristic to detect the quote char.

mlr3oml 0.4.0

  • The parsers for ARFF files can now be explicitly selected via option "mlr3oml.arff.parser". Default is the internal parser based on data.table::fread().
  • Improved stability of the internal ARFF parser in case of malformed ARFF files and non-standardized quotes.

mlr3oml 0.3.0

  • The connectors used in mlr_tasks and mlr_resamplings now signal errors of class missingDefaultError if some defaults are not set.
  • Target columns are now automatically converted to the require storage mode during task creation.
  • Removed dependency on orphaned package bibtex.

mlr3oml 0.2.0

  • Support filtering data sets and tasks via data id or task id (#5).
  • Added fallback to RWeka for sparse ARFF files (#6).
  • Fixed import from backports.

mlr3oml 0.1.0

  • Initial release.