Skip to content

An open source python library for automated prediction engineering

License

Notifications You must be signed in to change notification settings

trane-dev/Trane

Repository files navigation

“Trane_Logo”

Tests PyPI Version PyPI Downloads


Trane is a software package designed to automatically generate problems for temporal datasets and produce labels for supervised learning. It aims to enhance the automation of the machine learning problem-solving process.

Install

To install Trane, run the following command:

python -m pip install trane

Example

Below is an example of using Trane:

import trane
data, metadata = trane.load_airbnb()
entity_columns = ["location"]
window_size = "2d"
problem_generator = trane.ProblemGenerator(
    metadata=metadata,
    window_size=window_size,
    entity_columns=entity_columns
)
problems = problem_generator.generate()
print(f'Generated {len(problems)} problems')
print(problems[108])
print(problems[108].create_target_values(data).head(5))
Generated 168 problems
For each <location> predict the majority <rating> in all related records in next 2d days
  location       time  target
0   London 2021-01-01       5
1   London 2021-01-03       4
2   London 2021-01-05       5
3   London 2021-01-07       4
4   London 2021-01-09       5

Community

Citing Trane

If you use Trane, please consider citing the following paper:

Ben Schreck, Kalyan Veeramachaneni. What Would a Data Scientist Ask? Automatically Formulating and Solving Predictive Problems. IEEE DSAA 2016, 440-451

BibTeX entry:

@inproceedings{schreck2016would,
  title={What Would a Data Scientist Ask? Automatically Formulating and Solving Predictive Problems},
  author={Schreck, Benjamin and Veeramachaneni, Kalyan},
  booktitle={Data Science and Advanced Analytics (DSAA), 2016 IEEE International Conference on},
  pages={440--451},
  year={2016},
  organization={IEEE}
}