Scikit-DYN2SEL is a framework for applying and implementing DCS techniques in the data stream mining context.
Scikit-dyn2sel can be easily installed with pip. Just run:
pip install dyn2sel
Scikit-DYN2SEL fully works with ensembles from the scikit-multiflow library, or any implementation that follows the same interface.
Scikit-DYN2SEL classifiers follows the same interface as scikit-multiflow, which is based on the popular scikit-learn. Thus, scikit-DYN2SEL will be extremely familiar if you know any of these libraries.
In its current state, scikit-multiflow contains data generators, multi-output/multi-target stream learning methods, change detection methods, evaluation methods, and more.
Distributed under the MIT license, scikit-DYN2SEL implements the current state of art methods, however if you thinks it that it misses a method, feel free to either open an issue or opening a pull request with your implementation.
The usage of Scikit-DYN2SEL is very straightforward.
from skmultiflow.evaluation import EvaluatePrequential
from skmultiflow.data import SEAGenerator
from skmultiflow.trees import HoeffdingTree
from dyn2sel.apply_dcs import DYNSEMethod
from dyn2sel.dcs_techniques import KNORAE
clf = DYNSEMethod(
HoeffdingTree(), chunk_size=1000,
dcs_method=KNORAE(), max_ensemble_size=10)
gen = SEAGenerator()
ev = EvaluatePrequential()
ev.evaluate(gen, clf)
For a complete reference on the library, you can check the documentation.
If you use scikit-dyn2sel in a paper, please cite us as following:
@ARTICLE{dyn2sel,
author = {{Portes Cavalheiro}, Lucca and {Barddal}, Jean Paul and
{Souza de Britto Jr}, Alceu and {Heutte}, Laurent},
title = "{scikit-dyn2sel -- A Dynamic Selection Framework for Data Streams}",
journal = {arXiv e-prints},
keywords = {Computer Science - Machine Learning},
year = 2020,
month = aug,
eid = {arXiv:2008.08920},
pages = {arXiv:2008.08920},
archivePrefix = {arXiv},
eprint = {2008.08920},
primaryClass = {cs.LG},
adsurl = {https://ui.adsabs.harvard.edu/abs/2020arXiv200808920P},
adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}