Release v0.1.16rc1 #639

bouthilx · 2021-08-23T18:31:54Z

Experiment Version Control (EVC) will now be disabled by default. When the EVC is disabled, any changes in the experiment will be saved to the DB, overriding the previous version. See configuration doc to enable the EVC: https://orion.readthedocs.io/en/stable/user/config.html#experiment-version-control.

🏗 Enhancements

Feature/filter duplicate in evc @bouthilx (Feature/filter duplicate in evc #630)
Add enable option for EVC, with default = false @bouthilx (Add enable option for EVC, with default = false #626)

🐛 Bug Fixes

Warn only if diffs exists during exp build @bouthilx (Warn only if diffs exists during exp build #638)
Fix lost trials of parents @bouthilx (Fix lost trials of parents #637)
Compute cardinality for loguniform with precision @bouthilx (Compute cardinality for loguniform with precision #635)
Duplicate pending trials from parent/child for exc @bouthilx (Duplicate pending trials from parent/child for exc #631)
Remove space upgrade from DB upgrade @bouthilx (Remove space upgrade from DB upgrade #627)
Allow branching when parent script isn't available @bouthilx (Allow branching when parent script isn't available #625)
fix benchmark ranking and regret visualization @donglinjy (fix benchmark ranking and regret visualization #620)
fix tpe space cardinality @donglinjy (fix tpe space cardinality #619)

📜 Documentation

Change algo documentation @bouthilx (Change algo documentation #623)

Merge back master in develop after release

fix tpe space cardinality

fix benchmark ranking and regret visualization

To simplify documentation of algorithm plugins, they have been moved to separate docs, with only pointers in core documentation. The algorithms class documentation is also reused to avoid rewriting the documentation of the arguments in sphinx.

Why: Suppose we want to branch from a parent running from a different computer, or for which we lost the execution script. The branching should not fail because script is missing, we do not need it because we wont execute trials from the parent anyway. How: Use allow_non_existing_files=True when building the cmdline parser to compare cmdlines of experiments.

The parent may have a script configuration file that is missing at the time of branching. Branching should not fail in such case and rely on the saved content of the configuration file to verify changes.

Change algo documentation

Why: The space upgrade relies on local files if the experiment's search space is defined in a configuration file. Parsing these file during the DB upgrade can break the DB because all experiments may not be executed on the same file system and thus some configuration files may not be present. The space should only be upgraded when the user attempts running an experiment, in which case the configuration file is available. The space does not need to be upgraded during DB upgrade anyway, because experiment built is backward compatible with experiments lacking an explicit space definition in DB (relying on cmdargs and config file to define space at run-time). How: Remove space upgrade from db upgrade script.

…ng_files Allow branching when parent script isn't available

The DB upgrade does not update the space and priors anymore. The are handled anyway at runtime, no need to update them in the DB

The tests for the different EVC options were all inter-dependent. This commit makes them independent, all starting from the same DB state using the same command. This makes it much easier to make modifications in these tests without affecting all following tests.

The EVC is a constant source of confusion for users. It should be disabled by default with warning messages when different versions of experiments are used. Users who wants using advanced features of the EVC would still be able to use it by enabling it. Making it false by default is a breaking change and may cause issues to user currently using the EVC. Based on discussion with users there does not seem to have much usage of the EVC so far so this breaking change should be relatively harmless. Avoiding further confusion by making it disabled by default is worth the breaking change.

Why: If the EVC is not enabled, the consumer should always ignore the code changes. It would not make sense to raise an error between 2 trials because user's script code changed while EVC is disabled. How: If EVC is disabled, force ignore_code_changes to True in Consumer.

Remove space upgrade from DB upgrade

Add enable option for EVC, with default = false

During execution of the experiment the producer verifies that suggested trials do not already exist in parent or children, but race conditions can lead to duplicates. Also, in attempt to solve #576, we will need to duplicate trials that are not completed in parents into executed experiments to allow reserving and executing the trials. This will lead to more potential duplicated trials and raise the important of handling duplicate properly. When fetching from the EVC, we should ignore duplicates from parent or children if the trials are available in current experiment. This will recursively solve the issue during recursive fetch from EVC. This will also simplify the handling of potential duplicates during {naive-}algorithm updates, as there will simply be no duplicates. How: During the call to adaptors, a set of hash is generated from trials of current nodes based on hyperparameter values (ignores experiment id). Any trials from the parents or child that has a hash found in this set of hash will be filtered out. When there is a duplicate, only the trial of the current node is kept. This also applies recursively to call from children experiments to grand-children.

Why: When a dimension is deleted or added, the adaptor should not transform them with a default value of None if there was no default values. This would lead to invalid trials if None is an invalid value of this dimension. How: If the default value is the unique NO_DEFAULT_VALUE object, then the trials should be filtered out.

Feature/filter duplicate in evc

Why: Experiment cannot reserve trial of parent experiment. This is very problematic as non-completed trials of parents cannot be execute anymore unless the environment state is reverted to the one used for parent experiment (ex: resetting code). It should look for executable trials across the EVC tree. Running trials from parent experiments may cause issue if the child experiment has a different script path, different code version or different cmdline call. We should attempt running the trial with the corresponding experiment configuration. It's not clear what to do if it fails. If we simply leave the trial status to interrupted the child experiment will try it again. Another option is to copy the trial to the child experiment and run it with child configuration. If the user checkpointed the trial state with trial.hash_params, the checkpoint will be lost as trial.hash_params will change based on the experiment id. This is safe, protecting users from resuming with a different code version. How: Fetch trials from EVC tree and duplicate any pending trials to current experiment. A hash of the params is used to avoid duplicating trials that are already available in the current experiment.

Max and mean strategies were failing when all trials observed have no valid objectives.

Why: We cannot use python debugger (or pytest.set_trace()) during the execution of the workers with joblib backend. We should have a simple executor backend that is not using multithreading or multi-processing to enable simple debugging. Also, since client's `workon()` helper function does not support parallelism, it should use this simple executor. How: Use functools.partial to wrap submitted functions for future execution.

Why: With loguniform the number of possible values is limited if precision is used. Cardinality computation should account for this otherwise algorithms may get stuck in suggest(). It happened to a user with a prior loguniform(1e-4, 1e-2, precision=2). This gives only 181 possible values. How: If real dimension has precision and prior loguniform, then compute cardinality. There is a problem with transformed space however. A linearized dimension for instance would attempt to compute the cardinality with the linearized bounds. What matters is the smallest cardinality between the transformed space and the original space. The only case where cardinality is smaller in transformed space is when real values are discretized. Therefore, we only compute cardinality of transformed dimensions if transformation lead to integer, otherwise we use the cardinality of the original dimension.

Duplicate pending trials from parent/child for exc

Why: When running an experiment, the parents may have lost trials that are stuck to status reserved. If the user cannot run the parents, it must then use `orion db set` to fix this manually. This should be done automatically instead. How: Loop over the EVC and call `fix_lost_trials` for each experiments. Note that this can increase significantly the cost of the command for large EVC. A ugly hack is used to allow running `fix_lost_trials` on the parents that are in read mode. It would be great to find a better work around...

Compute cardinality for loguniform with precision

A warning about conflicts was always printed when building an experiment. There should be no warning if there are no difference.

Fix lost trials of parents

Warn only if diffs exists during exp build

bouthilx and others added 30 commits May 19, 2021 18:51

Update backward comp test versions

57aee70

Merge pull request #612 from Epistimio/ci/sync_master_back_to_dev

2b09684

Merge back master in develop after release

fix tpe space cardinality

b2cbed4

Merge pull request #619 from donglinjy/tpe-int

5cfa9d6

fix tpe space cardinality

fix benchmark ranking and regret visualization

1e48df2

Merge pull request #620 from donglinjy/benchmark-viz

20e28d4

fix benchmark ranking and regret visualization

Change algo documentation

4cebc86

To simplify documentation of algorithm plugins, they have been moved to separate docs, with only pointers in core documentation. The algorithms class documentation is also reused to avoid rewriting the documentation of the arguments in sphinx.

doc8

7f9d6dc

Add tests for branching with missing config file

b39f7f0

The parent may have a script configuration file that is missing at the time of branching. Branching should not fail in such case and rely on the saved content of the configuration file to verify changes.

Merge pull request #623 from bouthilx/doc/external_algo_plugin

28a6331

Change algo documentation

Fix isort

403ae8e

Remove unused backward

b547fc9

Merge pull request #625 from bouthilx/hotfix/branch_with_parent_missi…

7ba8c89

…ng_files Allow branching when parent script isn't available

Do not test for priors in backward comp tests

f557555

The DB upgrade does not update the space and priors anymore. The are handled anyway at runtime, no need to update them in the DB

Adapt tests to --enable-evc

38fc543

Enable EVC for parallel worker test

b1e4534

Adapt new tests from develop branch

32da6f7

Adapt consumer test

bb4e2b6

Merge pull request #627 from bouthilx/hotfix/db_update_break_paths

e409a89

Remove space upgrade from DB upgrade

Merge pull request #626 from bouthilx/feature/optional_evc

4d89c20

Add enable option for EVC, with default = false

Make test less stringent

7e86e8d

Make test even less stringent...

cd5f357

Merge pull request #630 from bouthilx/feature/filter_duplicate_in_evc

0921b7f

Feature/filter duplicate in evc

bouthilx added 24 commits July 29, 2021 16:50

Handle empty list of trials in strategies

c6a9549

Max and mean strategies were failing when all trials observed have no valid objectives.

Rename test file to avoid stupid pytest name clash....

00bb335

Fix doc refs

e1f3802

Add missing evc testing module

1f8384f

Move generic test utils for evc to testing module

ec2c7bd

Adjust tests for new duplication behavior

32a8a73

Fix isort and black

8e0701f

isort

8f2d7fb

Merge pull request #631 from bouthilx/hotfix/reserve_parent_trials

23a4127

Duplicate pending trials from parent/child for exc

Add missing tests

95ede66

Remove duplicated method

faada45

Yield mocked datetime properly

2aff79d

Access EVC inside OrionState context

823a9b4

Move utils tests to utils folder

7bb7bda

Cover fetch_lost_trials is experiment view tests

77d885d

Blackify...

f3b32e0

Merge pull request #635 from bouthilx/hotfix/precision_cardinality

ce644dc

Compute cardinality for loguniform with precision

Warn only if diffs exists during exp build

feb541a

A warning about conflicts was always printed when building an experiment. There should be no warning if there are no difference.

Merge pull request #637 from bouthilx/hotfix/parent_heartbeat

76656ea

Fix lost trials of parents

Merge pull request #638 from bouthilx/hotfix/diff_warning

40a0659

Warn only if diffs exists during exp build

bouthilx added the release label Aug 23, 2021

bouthilx added this to the v0.1.16 milestone Aug 23, 2021

bouthilx merged commit 6bc3b79 into master Aug 23, 2021

bouthilx deleted the release-v0.1.16rc1 branch August 23, 2021 19:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release v0.1.16rc1 #639

Release v0.1.16rc1 #639

bouthilx commented Aug 23, 2021 •

edited

Loading

Release v0.1.16rc1 #639

Release v0.1.16rc1 #639

Conversation

bouthilx commented Aug 23, 2021 • edited Loading

🏗 Enhancements

🐛 Bug Fixes

📜 Documentation

bouthilx commented Aug 23, 2021 •

edited

Loading