NOTE: THIS IS NOT YET RELEASE READY, PLEASE BE PATIENT.
dstoolbox is not one big tool but rather an amalgamation of small re-usable tools. They are intended to work well with scikit-learn and pandas make the integration of those libraries easier.
The tools included here are used by us at Otto Group BI for our production services, as well as by individual members for machine learning related things, such as participating in Kaggle competitions.
pip install dstoolbox
There is a conda recipe for those who want to build their own conda package.
Pull requests are welcome. Here are some directions:
To run the tests, you need to install the dev requirements using pip:
pip install -r requirements-dev.txt
conda install --file requirements-dev.txt
Next you should check that all unit tests and all static code checks pass:
py.test pylint dstoolbox