Video Guide to Azure ML and real example on titanic dataset (survival): https://channel9.msdn.com/Series/Choose-to-Code/Using-Azure-Machine-Learning
UC Irvine Machine Learning Repository: http://archive.ics.uci.edu/ml/
- numpy - mainly useful for its N-dimensional array objects
- pandas - Python data analysis library, including structures such as dataframes
- matplotlib - 2D plotting library producing publication quality figures
- scikit-learn - the machine learning algorithms used for data analysis and data mining tasks
CRISP-DM remains the most popular methodology for analytics, data mining, and data science projects.
https://en.wikipedia.org/wiki/Cross_Industry_Standard_Process_for_Data_Mining