Stars
Zipline, a Pythonic Algorithmic Trading Library
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
An open access book on scientific visualization using python and matplotlib
An open source python library for automated feature engineering
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Missing data visualization module for Python.
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
Pandas integration with sklearn
pyforest - feel the bliss of automated imports
This is some starter / reference code for those interested in starting data science
The Complete Python Course, published by Packt
Explore tips and tricks to deploy machine learning models with Docker.
Research at the intersection of natural language processing and social network analysis.
A program designed to analyse a dataset and provide summary data regarding both categorical and continuous data in the form of a CSV file. The summary data includes means, medians, modes, cardinali…
Create a command line user interface which allows user to query data from stockcards data file. Understanding customer buying patterns, geographical distribution of transactions, stocks item analys…
Big Data Analytics for Healthcare - Homework 1
This is my capstone project. The full presentation can be found in the uploaded PDF file. I created a model that calculates the ROI and predicts individual donations to election campaigns for the U…
Personal Fitbit data analysis using python-fitbit API