Trane is a software package designed to automatically generate problems for temporal datasets and produce labels for supervised learning. It aims to enhance the automation of the machine learning problem-solving process.
To install Trane, run the following command:
python -m pip install trane
Below is an example of using Trane:
import trane
data, metadata = trane.load_airbnb()
entity_columns = ["location"]
window_size = "2d"
problem_generator = trane.ProblemGenerator(
metadata=metadata,
window_size=window_size,
entity_columns=entity_columns
)
problems = problem_generator.generate()
print(f'Generated {len(problems)} problems')
print(problems[108])
print(problems[108].create_target_values(data).head(5))
Generated 168 problems
For each <location> predict the majority <rating> in all related records in next 2d days
location time target
0 London 2021-01-01 5
1 London 2021-01-03 4
2 London 2021-01-05 5
3 London 2021-01-07 4
4 London 2021-01-09 5
- Need help? Use a GitHub issue
- Prefer chatting? Join Slack
If you use Trane, please consider citing the following paper:
Ben Schreck, Kalyan Veeramachaneni. What Would a Data Scientist Ask? Automatically Formulating and Solving Predictive Problems. IEEE DSAA 2016, 440-451
BibTeX entry:
@inproceedings{schreck2016would,
title={What Would a Data Scientist Ask? Automatically Formulating and Solving Predictive Problems},
author={Schreck, Benjamin and Veeramachaneni, Kalyan},
booktitle={Data Science and Advanced Analytics (DSAA), 2016 IEEE International Conference on},
pages={440--451},
year={2016},
organization={IEEE}
}