Breaking Changes
-
The imputation steps do not change the data type being imputed now. Previously, if the data were integer, the data would be changed to numeric (for some step types). The change is breaking since the underlying data of imputed values are now saved as a list instead of a vector (for some step types).
-
The data sets were moved to the new
modeldata
package. -
step_num2factor()
was rewritten due to a bug that ignored the user-supplied levels (#425). The results of thetransform
argument are now required to be a function andlevels
must now be supplied.
Other Changes
-
Using a minus in the formula to
recipes()
is no longer allowed (it didn't remove variables anyway).step_rm()
orupdate_role()
can be used instead. -
When using a selector that returns no columns,
juice()
andbake()
will now return a tibble with as many rows as the original template data or thenew_data
respectively. This is more consistent with how selectors work in dplyr (#411). -
Code was added to explicitly register
tunable
methods whenrecipes
is loaded. This is required because of changes occurring in R 4.0. -
check_class()
checks if a variable is of the designated class. Class is either learned from the train set or provided in the check. (contributed by Edwin Thoen) -
step_normalize()
andstep_scale()
gained afactor
argument with values of 1 or 2 that can scale the standard deviations used to transform the data. (#380) -
bake()
now produces a tibble with columns in the same order asjuice()
(#365)