-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Added README.md documentation to PyPI
- Loading branch information
Unai Torrecilla
committed
Oct 25, 2022
1 parent
8efa545
commit 71e6510
Showing
7 changed files
with
117 additions
and
4 deletions.
There are no files selected for viewing
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,88 @@ | ||
Metadata-Version: 2.1 | ||
Name: mlforall | ||
Version: 0.2 | ||
Summary: Library that easily allows to create machine learning progress for more unexperiencied programmers. | ||
Home-page: /~https://github.com/UnaiTorrecilla/MLForAll | ||
Download-URL: /~https://github.com/UnaiTorrecilla/MLForAll/archive/refs/tags/v_02.tar.gz | ||
Author: Unai Torrecilla | ||
Author-email: unai.torrecilla@alumni.mondragon.edu | ||
License: MIT | ||
Keywords: Machine learning,Easy to use | ||
Classifier: Development Status :: 5 - Production/Stable | ||
Classifier: Intended Audience :: Developers | ||
Classifier: Topic :: Software Development :: Build Tools | ||
Classifier: License :: OSI Approved :: MIT License | ||
Classifier: Programming Language :: Python :: 3.9 | ||
Description-Content-Type: text/markdown | ||
License-File: LICENSE | ||
|
||
# MLForAll: Easy ML projects from scratch | ||
[data:image/s3,"s3://crabby-images/214d1/214d1da3d490c1ae533923bfa76044061598169a" alt="PyPI Latest Release"](https://pypi.org/project/mlforall/) | ||
[data:image/s3,"s3://crabby-images/c7ace/c7ace2260d0da59d74395bc66c5adfca648d37e5" alt="Package Status"](https://pypi.org/project/mlforall/) | ||
[data:image/s3,"s3://crabby-images/6323c/6323c0df38423d7ad8b2899a1e46c9e751a070a9" alt="License"](/~https://github.com/mlforall-dev/mlforall/blob/main/LICENSE) | ||
[data:image/s3,"s3://crabby-images/98647/986475842f2907062b79c4bb27fdd075d638e5b9" alt="Code style: black"](/~https://github.com/psf/black) | ||
|
||
|
||
## Description | ||
**mlforall** is an open-source library aimed to developers that are beginners in the data analysis area but want to build powerful machine learning projects from the very beginning. The package offers a reliable, easy to use and well documented set of functions that drive the user through the most common steps of any machine learning projects, from data reading to model testing. | ||
|
||
## Main features | ||
These are some of the functionalities that mlforall offers: | ||
1. File extension asbtraction when reading data (only supported for `.csv`, `.txt`, `.xlsx`, `.xlsx`, `.parquet` and `.npy`) | ||
2. Automatic handling of non-numeric features and missing values. | ||
3. A pool with almost all the data-scaling methods available and the most common ML models. | ||
4. Automatic model evaluation and reporting. | ||
|
||
## Usage options | ||
The **mlforall** package can be used by command line interface or with Interactive Python Notebooks (`.ipynb`). An example of the first usage method is the following: | ||
|
||
```sh | ||
# MLForAll usage through command line | ||
python -m mlforall --kwargs | ||
``` | ||
With this option, the library will execute the full ML pipeline, from data reading to model testing. The available keyword arguments are: | ||
- --data_path: **Mandatory**. *str*. | ||
- The absolute or relative route to the data for which the ML pipelin wants to be built. | ||
- --target_var: **Mandatory**. *str* or *int*. | ||
- The name or the position of the target column in the original data. | ||
- --test_size: **Optional**. *float* | ||
- Proportion of the data that wants to be kept as test. The defualt is 0.2. | ||
- --cv: **Optional**. *int* | ||
- The number of k-folds that will be performed when evaluating the model. The deault is 5. | ||
- --path_to_save_metrics: **Optional**. *str* | ||
- The absolute or relative path to save the final metrics of the model. It must be a file with the `.csv` extension. If not provided, the metrics will be printed in the console but not saved. | ||
|
||
|
||
As mentioned, usage from `.ipynb` files is also possible. As of version 0.1, the way to use this module from `.ipynb` files is by importing the submodules and using their methods separatedly. To import the different modules the following code snippet can be used: | ||
|
||
```sh | ||
# MLForAll usage through .ipynb files | ||
from mlforall.DataReading import DataReader | ||
from mlforall.DataScaling import DataScaler | ||
from mlforall.DataModeling import DataModeler | ||
|
||
|
||
data_reader = DataReader.ReadData(*args, **kwargs) | ||
data_scaler = DataScaler.ScaleData(*args, **kwargs) | ||
data_modeler = DataModeler.ModelData(*args, **kwargs) | ||
``` | ||
|
||
|
||
## Dependencies | ||
- [Numpy. It offers comprehensive mathematical functions, random number generators, linear algebra routines, Fourier transforms, and more](https://numpy.org) | ||
- [Pandas. a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, | ||
built on top of the Python programming language](https://pandas.pydata.org) | ||
- [Scikit-learn. accesible, reusable, simple and efficient tools for predictive data analysis](https://scikit-learn.org/stable/) | ||
|
||
## Where to get it | ||
The source code is currently hosted on GitHub at: | ||
[/~https://github.com/UnaiTorrecilla/MLForAll](/~https://github.com/UnaiTorrecilla/MLForAll) | ||
|
||
Binary installers for the latest released version are available at the [Python | ||
Package Index (PyPI)](https://pypi.org/project/mlforall) | ||
|
||
|
||
```sh | ||
# PyPI | ||
pip install mlforall | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,11 @@ | ||
LICENSE | ||
README.md | ||
setup.cfg | ||
setup.py | ||
mlforall/__init__.py | ||
mlforall/__main__.py | ||
mlforall.egg-info/PKG-INFO | ||
mlforall.egg-info/SOURCES.txt | ||
mlforall.egg-info/dependency_links.txt | ||
mlforall.egg-info/requires.txt | ||
mlforall.egg-info/top_level.txt |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
pandas | ||
numpy | ||
scikit-learn |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
mlforall |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters