d2m - Data to model

A machine learning pipeline for trustworthy and green models, enabling responsible AI:

Explainable AI, using SHAP, LIME or both.
Uncertainty estimation, using Bayesian dropout for neural networks.
Carbon emissions tracking and reporting, using CodeCarbon.

d2m lets you easily create and evaluate machine learning models for tabular and time series data, with built-in data profiling and feature engineering.

Usage

Tested on:

Linux
macOS
Windows with WSL 2

Clone/download this repository.
Place your datafiles (csv) in a folder with the name of your dataset (DATASET) inside assets/data/raw/, so the path to the files is assets/data/raw/[DATASET]/.
Update params.yaml with the name of your dataset (DATASET), the target variable, and other configuration parameters.
Build Docker container:

docker build -t d2m -f Dockerfile .

Run the container:

docker run -p 5000:5000 -it -v $(pwd)/assets:/usr/d2m/assets -v $(pwd)/.dvc:/usr/d2m/.dvc d2m

Open the website at localhost:5000 to use the graphical user interface.

Creating models on the command line

Copy params.yaml from the host to the container (find CONTAINER_NAME by running docker ps):

docker cp params.yaml  [CONTAINER_NAME]:/usr/d2m/params.yaml

Inside the interactive session in the container, run:

docker exec [CONTAINER_NAME] dvc repro

Name	Name	Last commit message	Last commit date
Latest commit ejhusom Fix requirements Feb 20, 2024 5a49073 · Feb 20, 2024 History 81 Commits
assets	assets	Small bug fixes	Dec 18, 2023
docs	docs	Update link to the SINTEF fork	Nov 24, 2023
src	src	Fix requirements	Feb 20, 2024
test	test	Add stuff	Sep 19, 2022
.dockerignore	.dockerignore	Add stuff	Sep 19, 2022
.gitignore	.gitignore	Remove preidiction	Nov 23, 2023
Dockerfile	Dockerfile	Small bug fixes	Dec 18, 2023
LICENSE	LICENSE	Add stuff	Sep 19, 2022
README.md	README.md	Try ti fix email issue	Sep 29, 2023
dvc.yaml	dvc.yaml	Add abstract method for finding files	Nov 23, 2023
params.yaml	params.yaml	Add stages to virtualsensor	Feb 20, 2024
params_default.yaml	params_default.yaml	Add more complex validation of parameters	Nov 23, 2023
requirements.txt	requirements.txt	Small bug fixes	Dec 18, 2023
run.sh	run.sh	Add run script	Mar 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

d2m - Data to model

Usage

Creating models on the command line

About

Releases

Packages

Languages

License

ejhusom/d2m

Folders and files

Latest commit

History

Repository files navigation

d2m - Data to model

Usage

Creating models on the command line

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages