A Python tool that automatically cleans data sets and readies them for analysis.
datacleaner works with data in pandas DataFrames.
datacleaner is not magic, and it won't take an unorganized blob of text and automagically parse it out for you.
What datacleaner will do is save you a ton of time encoding and cleaning your data once it's already in a format that pandas DataFrames can handle.
Please see the repository license for the licensing and usage information for datacleaner.
Generally, we have licensed datacleaner to make it as widely usable as possible.
[empty for now]
datacleaner can be used on the command line. Use --help
to see its usage instructions.
[empty for now]
We welcome you to check the existing issues for bugs or enhancements to work on. If you have an idea for an extension to datacleaner, please file a new issue so we can discuss it.