General

16 Important Data Science Papers:

Machine Learning Formulas:

140 Machine Learning Formulas

Best Reference Books - Database Concepts and System:

Top 9 Data Science Algorithms:

Top 11 Data Structure Books:

Books and articles about Flowcharts:

Lecture Notes:

Introduction, linear classification, perceptron update rule (PDF)
Perceptron convergence, generalization (PDF)
Maximum margin classification (PDF)
Classification errors, regularization, logistic regression (PDF)
Linear regression, estimator bias and variance, active learning (PDF)
Active learning (cont.), non-linear predictions, kernals (PDF)
Kernal regression, kernels (PDF)
Support vector machine (SVM) and kernels, kernel optimization (PDF)
Model selection (PDF)
Model selection criteria (PDF)
Description length, feature selection (PDF)
Combining classifiers, boosting (PDF)
Boosting, margin, and complexity (PDF)
Margin and generalization, mixture models (PDF)
Mixtures and the expectation maximization (EM) algorithm (PDF)
EM, regularization, clustering (PDF)
Clustering (PDF)
Spectral clustering, Markov models (PDF)
Hidden Markov models (HMMs) (PDF)
HMMs (cont.) (PDF)
Bayesian networks (PDF)
Learning Bayesian networks (PDF)
Probabilistic inference - Guest lecture on collaborative filtering (PDF)

Books:

22 Algorithms Books Every Programmer Should Read:

Assignments:

Bayes' Theorem – The Forecasting Pillar of Data Science:

Bayes' Theorem
Bayes' Theorem in the 21st Century
Bayesian Modelling
Bayesian Reasoning and Machine Learning

Essential Math for Data Science:

Data Science Case Studies:

Data Science Tutorials for Beginners:

A Tutorial on Machine Learning and Data Science Tools with Python
Data Science Tutorial
Big Data Analytics Tutorial
Machine Learning Tutorial
Deep Learning Tutorial
Data Mining Tutorial
The Field Guide to Data Science
Agile Data Science Tutorial
Beginner's Guide to Data Science
Tutorial on Natural Language Processing
Natural Language Processing: A Tutorial
Artificial Neural Networks Tutorial
Machine Learning in Computer Vision
The Fundamentals of Machine Vision

Lecture Notes by Andrew Ng:

01 and 02: Introduction, Regression Analysis and Gradient Descent
03: Linear Algebra - review
04: Linear Regression with Multiple Variables
05: Octave
06: Logistic Regression
07: Regularization
08: Neural Networks - Representation
09: Neural Networks - Learning
10: Advice for applying machine learning techniques
11: Machine Learning System Design
12: Support Vector Machines
13: Clustering
14: Dimensionality Reduction
15: Anomaly Detection
16: Recommender Systems
17: Large Scale Machine Learning
18: Application Example - Photo OCR
Understanding Andrew Ng's Machine Learning Course – Notes and codes

50 selected papers in Data Mining and Machine Learning:

General

Data Mining and Statistics: What’s the Connection?

Data Mining: Statistics and More?, D. Hand, American Statistician, 52(2):112-118.

Data Mining, G. Weiss and B. Davison, in Handbook of Technology Management, John Wiley and Sons, expected 2010.

From Data Mining to Knowledge Discovery in Databases, U. Fayyad, G. Piatesky-Shapiro & P. Smyth, AI Magazine, 17(3):37-54, Fall 1996.

Mining Business Databases, Communications of the ACM, 39(11): 42-48.

10 Challenging Problems in Data Mining Research, Q. Yiang and X. Wu, International Journal of Information Technology & Decision Making, Vol. 5, No. 4, 2006, 597-604.

General Data Mining Methods and Algorithms

Top 10 Algorithms in Data Mining, X. Wu, V. Kumar, J.R. Quinlan, J. Ghosh, Q. Yang, H. motoda, G.J. MClachlan, A. Ng, B. Liu, P.S. Yu, Z. Zhou, M. Steinbach, D. J. Hand, D. Steinberg, Knowl Inf Syst (2008) 141-37.

Induction of Decision Trees, R. Quinlan, Machine Learning, 1(1):81-106, 1986.

Web and Link Mining

The Pagerank Citation Ranking: Bringing Order to the Web, L. Page, S. Brin, R. Motwani, T. Winograd, Technical Report, Stanford University, 1999.

The Structure and Function of Complex Networks, M. E. J. Newman, SIAM Review, 2003, 45, 167-256.

Link Mining: A New Data Mining Challenge, L. Getoor, SIGKDD Explorations, 2003, 5(1), 84-89.

Link Mining: A Survey, L. Getoor, SIGKDD Explorations, 2005, 7(2), 3-12.

Semi-supervised Learning

Semi-Supervised Learning Literature Survey, X. Zhu, Computer Sciences TR 1530, University of Wisconsin — Madison.

Learning with Labeled and Unlabeled Data, M. Seeger, University of Edinburgh (unpublished), 2002.

Person Identification in Webcam Images: An Application of Semi-Supervised Learning, M. Balcan, A. Blum, P. Choi, J. lafferty, B. Pantano, M. Rwebangira, X. Zhu, Proceedings of the 22nd ICML Workshop on Learning with Partially Classified Training Data, 2005.

Learning from Labeled and Unlabeled Data: An Empirical Study across Techniques and Domains, N. Chawla, G. Karakoulas, Journal of Artificial Intelligence Research, 23:331-366, 2005.

Text Classification from Labeled and Unlabeled Documents using EM, K. Nigam, A. McCallum, S. Thrun, T. Mitchell, Machine Learning, 39, 103-134, 2000.

Self-taught Learning: Transfer Learning from Unlabeled Data, R. Raina, A. Battle, H. Lee, B. Packer, A. Ng, in Proceedings of the 24th International Conference on Machine Learning, 2007.

An iterative algorithm for extending learners to a semisupervised setting, M. Culp, G. Michailidis, 2007 Joint Statistical Meetings (JSM), 2007

Get Another Label? Improving Data Quality and Data Mining Using Multiple, Noisy Labelers, V. Sheng, F. Provost, P. Ipeirotis, in Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008.

Logistic Regression for Partial Labels, in 9th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, Volume III, pp. 1935-1941, 2002.

Classification with Partial labels, N. Nguyen, R. Caruana, in Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008.

Induction of Decision Trees from Partially Classified Data Using Belief Functions, M. Bjanger, Norweigen University of Science and Technology, 2000.

Knowledge Discovery in Large Image Databases: Dealing with Uncertainties in Ground Truth, P. Smyth, M. Burl, U. Fayyad, P. Perona, KDD Workshop 1994, AAAI Technical Report WS-94-03, pp. 109-120, 1994.

Active Learning

Improving Generalization with Active Learning, D Cohn, L. Atlas, and R. Ladner, Machine Learning 15(2), 201-221, May 1994.

On Active Learning for Data Acquisition, Z. Zheng and B. Padmanabhan, In Proc. of IEEE Intl. Conf. on Data Mining, 2002.

Active Sampling for Class Probability Estimation and Ranking, M. Saar-Tsechansky and F. Provost, Machine Learning 54:2 2004, 153-178.

The Learning-Curve Sampling Method Applied to Model-Based Clustering, C. Meek, B. Thiesson, and D. Heckerman, Journal of Machine Learning Research 2:397-418, 2002.

Active Sampling for Feature Selection, S. Veeramachaneni and P. Avesani, Third IEEE Conference on Data Mining, 2003.

Heterogeneous Uncertainty Sampling for Supervised Learning, D. Lewis and J. Catlett, In Proceedings of the 11th International Conference on Machine Learning, 148-156, 1994.

Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction, G. Weiss and F. Provost, Journal of Artificial Intelligence Research, 19:315-354, 2003.

Active Learning using Adaptive Resampling, KDD 2000, 91-98.

Cost-Sensitive Learning

Types of Cost in Inductive Concept Learning, P. Turney, In Proceedings Workshop on Cost-Sensitive Learning at the Seventeenth International Conference on Machine Learning.

Toward Scalable Learning with Non-Uniform Class and Cost Distributions: A Case Study in Credit Card Fraud Detection, P. Chan and S. Stolfo, KDD 1998.

Papers

Learning when Data Sets are Imbalanced and When Costs are Unequal and Unknown, M. Maloof, in ICML Workshop on Learning from Imbalanced Datasets II, 2003.

Uncertainty Sampling Methods for One-class Classifiers, P. Juszcak and R. Duin, in ICML Workshop on Learning from Imbalanced Datasets II, 2003.

C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling beats Over-Sampling, C. Drummond and R. Holte, in ICML Workshop onLearning from Imbalanced Datasets II, 2003.

C4.5 and Imbalanced Data sets: Investigating the effect of sampling method, probabilistic estimate, and decision tree structure, N. Chawla, in ICML Workshop on Learning from Imbalanced Datasets II, 2003.

Wrapper-based Computation and Evaluation of Sampling Methods for Imbalanced Datasets, N. Chawla, L. Hall, and A. Joshi, in Proceedings of the 1st International Workshop on Utility-based Data Mining, 24-33, 2005.

Learning from Little: Comparison of Classifiers Given Little of Classifiers given Little Training, G. Forman and I. Cohen, in 8th European Conference on Principles and Practice of Knowledge Discovery in Databases, 161-172, 2004.

A Multiple Resampling Method for Learning from Imbalanced Data Sets, A. Estabrooks, T. Jo, and N. Japkowicz, in Computational Intelligence, 20(1), 2004.

A Study of the Behavior of Several Methods for Balancing Machine Learning Training Data, G. Batista, R. Prati, and M. Monard, SIGKDD Explorations, 6(1):20-29, 2004.

Class Imbalance versus Small Disjuncts, T. Jo and N. Japkowicz, SIGKDD Explorations, 6(1): 40-49, 2004.

Extreme Re-balancing for SVMs: a Case Study, B. Raskutti and A. Kowalczyk, SIGKDD Explorations, 6(1):60-69, 2004.

Generative Oversampling for Mining Imbalanced Datasets, A. Liu, J. Ghosh, and C. Martin, Third International Conference on Data Mining (DMIN-07), 66-72.

Computing Machinery and Intelligence

Class Imbalances: Are we Focusing on the Right Issue?, N. Japkowicz, in ICML Workshop on Learning from Imbalanced Datasets II, 2003.

Recommender Systems

Trust No One: Evaluating Trust-based Filtering for Recommenders, J. O’Donovan and B. Smyth, In Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI-05), 2005, 1663-1665.

Trust in Recommender Systems, J. O’Donovan and B. Symyth, In Proceedings of the 10th International Conference on Intelligent User Interfaces (IUI-05), 2005, 167-174.

10 Cutting Edge Research-Papers In Computer Vision and Image Generation:

21 hottest research papers on Computer Vision and Machine Learning:

10 Important AI Research Papers:

6 Top NLP Papers:

The 18 Best Books About AI:

Top 30 most influential papers in the world of big data:

Top Papers on Clustering Algorithms:

5 Latest Research Papers On ML You Must Read:

Readings in Databases:

The 5 Best Data Science Books for Non-Techies:

Five Must-Read Statistics Books to Become a Successful Data Analyst:

Deep Learning Papers:

Research Papers on Programming Languages:

Numenta Research Papers:

Awesome Machine Learning Papers:

3D Detection Papers:

The papers in this list are about Autonomous Vehicles 3D Detection and Semantic Segmentation especially those using point clouds and in deep learning methods.

Ten Trending Academic Papers on the Future of Computer Vision:

Must-Read Papers on GANs:

Generative Adversarial Networks are one of the most interesting and popular applications of Deep Learning. Here are the list of 10 papers on GANs that will give you a great introduction to GAN as well as a foundation for understanding the state-of-the-art.

5 Must-read Papers on Product Categorization for Data Scientists:

Must read research papers on Data Structures:

Key Papers in Deep RL:

Model-Free RL:

Deep Q-Learning

Policy Gradients

Deterministic Policy Gradients

Distributional RL

Policy Gradients with Action-Dependent Baselines

Path-Consistency Learning

Other Directions for Combining Policy-Learning and Q-Learning

Evolutionary Algorithms

Evolution Strategies as a Scalable Alternative to Reinforcement Learning

Exploration:

Intrinsic Motivation

Unsupervised RL

Transfer and Multitask RL:

Hierarchy:

Memory:

Model-Based RL:

Model is Learned

Model is Given

Meta-RL:

Scaling RL:

RL in the Real World:

Safety:

Imitation Learning and Inverse Reinforcement Learning:

Reproducibility, Analysis, and Critique:

Bonus: Classic Papers in RL Theory or Review:

14 NLP Research Breakthroughs You Can Apply To Your Business:

Most Downloaded Artificial Intelligence Articles:

AI Papers and Notes:

Most Influential Data Science Research Papers:

Awesome Fraud Detection Research Papers:

Machine Learning Lectures:

Black Box Machine Learning
Introduction to Statistical Learning Theory
Gradient and Stochastic Gradient Descent
Excess Risk Decomposition
l₁ and l₂ Regularization
Lasso, Ridge, and Elastic Net
Loss Functions for Regression and Classification
Lagrangian Duality and Convex Optimization
Support Vector Machines
Subgradient Descent
Features
Kernel Methods
Performance Evaluation
"CitySense": Probabilistic Modeling and Anomaly Detection
Maximum Likelihood Estimation
Conditional Probability Models
Bayesian Methods
Bayesian Regression
Classification and Regression Trees
Basic Statistics and a Bit of Bootstrap
Bagging and Random Forests
Gradient Boosting
Multiclass and Introduction to Structured Prediction
k-Means Clustering
Gaussian Mixture Models
EM Algorithm for Latent Variable Models
Neural Networks
Backpropagation and the Chain Rule
Case Study: Churn Prediction
Next Steps

Assignments:

The 5 Algorithms for Efficient Deep Learning Inference on Small Devices:

Pruning Neural Networks:

Deep Compression:

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

Data Quantization:

Going Deeper with Embedded FPGA Platform for Convolutional Neural Network

Low-Rank Approximation:

Efficient and Accurate Approximations of Nonlinear Convolutional Networks

Trained Ternary Quantization:

Trained Ternary Quantization

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
README.md		README.md
awesome-machine-learning-artwork.jpg		awesome-machine-learning-artwork.jpg
dsi(1).pdf		dsi(1).pdf
dsi(10).pdf		dsi(10).pdf
dsi(100).pdf		dsi(100).pdf
dsi(101).pdf		dsi(101).pdf
dsi(102).pdf		dsi(102).pdf
dsi(103).pdf		dsi(103).pdf
dsi(104).pdf		dsi(104).pdf
dsi(105).pdf		dsi(105).pdf
dsi(106).pdf		dsi(106).pdf
dsi(107).pdf		dsi(107).pdf
dsi(108).pdf		dsi(108).pdf
dsi(109).pdf		dsi(109).pdf
dsi(11).pdf		dsi(11).pdf
dsi(110).pdf		dsi(110).pdf
dsi(111).pdf		dsi(111).pdf
dsi(112).pdf		dsi(112).pdf
dsi(113).pdf		dsi(113).pdf
dsi(114).pdf		dsi(114).pdf
dsi(115).pdf		dsi(115).pdf
dsi(116).pdf		dsi(116).pdf
dsi(117).pdf		dsi(117).pdf
dsi(118).pdf		dsi(118).pdf
dsi(119).pdf		dsi(119).pdf
dsi(12).pdf		dsi(12).pdf
dsi(120).pdf		dsi(120).pdf
dsi(121).pdf		dsi(121).pdf
dsi(122).pdf		dsi(122).pdf
dsi(123).pdf		dsi(123).pdf
dsi(124).pdf		dsi(124).pdf
dsi(125).pdf		dsi(125).pdf
dsi(126).pdf		dsi(126).pdf
dsi(127).pdf		dsi(127).pdf
dsi(128).pdf		dsi(128).pdf
dsi(129).pdf		dsi(129).pdf
dsi(13).pdf		dsi(13).pdf
dsi(130).pdf		dsi(130).pdf
dsi(131).pdf		dsi(131).pdf
dsi(132).pdf		dsi(132).pdf
dsi(133).pdf		dsi(133).pdf
dsi(134).pdf		dsi(134).pdf
dsi(135).pdf		dsi(135).pdf
dsi(136).pdf		dsi(136).pdf
dsi(137).pdf		dsi(137).pdf
dsi(138).pdf		dsi(138).pdf
dsi(139).pdf		dsi(139).pdf
dsi(14).pdf		dsi(14).pdf
dsi(140).pdf		dsi(140).pdf
dsi(141).pdf		dsi(141).pdf
dsi(142).pdf		dsi(142).pdf
dsi(143).pdf		dsi(143).pdf
dsi(144).pdf		dsi(144).pdf
dsi(145).pdf		dsi(145).pdf
dsi(146).pdf		dsi(146).pdf
dsi(147).pdf		dsi(147).pdf
dsi(148).pdf		dsi(148).pdf
dsi(149).pdf		dsi(149).pdf
dsi(15).pdf		dsi(15).pdf
dsi(150).pdf		dsi(150).pdf
dsi(151).pdf		dsi(151).pdf
dsi(152).pdf		dsi(152).pdf
dsi(153).pdf		dsi(153).pdf
dsi(154).pdf		dsi(154).pdf
dsi(155).pdf		dsi(155).pdf
dsi(156).pdf		dsi(156).pdf
dsi(157).pdf		dsi(157).pdf
dsi(158).pdf		dsi(158).pdf
dsi(159).pdf		dsi(159).pdf
dsi(16).pdf		dsi(16).pdf
dsi(160).pdf		dsi(160).pdf
dsi(161).pdf		dsi(161).pdf
dsi(162).pdf		dsi(162).pdf
dsi(163).pdf		dsi(163).pdf
dsi(164).pdf		dsi(164).pdf
dsi(165).pdf		dsi(165).pdf
dsi(166).pdf		dsi(166).pdf
dsi(167).pdf		dsi(167).pdf
dsi(168).pdf		dsi(168).pdf
dsi(169).pdf		dsi(169).pdf
dsi(17).pdf		dsi(17).pdf
dsi(170).pdf		dsi(170).pdf
dsi(171).pdf		dsi(171).pdf
dsi(172).pdf		dsi(172).pdf
dsi(173).pdf		dsi(173).pdf
dsi(174).pdf		dsi(174).pdf
dsi(175).pdf		dsi(175).pdf
dsi(176).pdf		dsi(176).pdf
dsi(177).pdf		dsi(177).pdf
dsi(178).pdf		dsi(178).pdf
dsi(179).pdf		dsi(179).pdf
dsi(18).pdf		dsi(18).pdf
dsi(180).pdf		dsi(180).pdf
dsi(181).pdf		dsi(181).pdf
dsi(182).pdf		dsi(182).pdf
dsi(183).pdf		dsi(183).pdf
dsi(184).pdf		dsi(184).pdf
dsi(185).pdf		dsi(185).pdf
dsi(186).pdf		dsi(186).pdf
dsi(187).pdf		dsi(187).pdf

iqbaljaved/Data-Science-Study-Materials

Folders and files

Latest commit

History

Repository files navigation

16 Important Data Science Papers:

Machine Learning Formulas:

Best Reference Books - Database Concepts and System:

Top 9 Data Science Algorithms:

Top 11 Data Structure Books:

Books and articles about Flowcharts:

Lecture Notes:

Books:

22 Algorithms Books Every Programmer Should Read:

Assignments:

Bayes' Theorem – The Forecasting Pillar of Data Science:

Essential Math for Data Science:

Data Science Case Studies:

Data Science Tutorials for Beginners:

Lecture Notes by Andrew Ng:

50 selected papers in Data Mining and Machine Learning:

General

General Data Mining Methods and Algorithms

Web and Link Mining

Active Learning

Cost-Sensitive Learning

Papers

Recommender Systems

10 Cutting Edge Research-Papers In Computer Vision and Image Generation:

21 hottest research papers on Computer Vision and Machine Learning:

10 Important AI Research Papers:

6 Top NLP Papers:

The 18 Best Books About AI:

Top 30 most influential papers in the world of big data:

Top Papers on Clustering Algorithms:

5 Latest Research Papers On ML You Must Read:

Readings in Databases:

The 5 Best Data Science Books for Non-Techies:

Five Must-Read Statistics Books to Become a Successful Data Analyst:

Deep Learning Papers:

Research Papers on Programming Languages:

Numenta Research Papers:

Awesome Machine Learning Papers:

3D Detection Papers:

Ten Trending Academic Papers on the Future of Computer Vision:

Must-Read Papers on GANs:

5 Must-read Papers on Product Categorization for Data Scientists:

Must read research papers on Data Structures:

Key Papers in Deep RL:

Model-Free RL:

Exploration:

Transfer and Multitask RL:

Hierarchy:

Memory:

Model-Based RL:

Meta-RL:

Scaling RL:

RL in the Real World:

Safety:

Imitation Learning and Inverse Reinforcement Learning:

Reproducibility, Analysis, and Critique:

Bonus: Classic Papers in RL Theory or Review:

14 NLP Research Breakthroughs You Can Apply To Your Business:

Most Downloaded Artificial Intelligence Articles:

AI Papers and Notes:

Most Influential Data Science Research Papers:

Awesome Fraud Detection Research Papers:

Machine Learning Lectures:

The 5 Algorithms for Efficient Deep Learning Inference on Small Devices:

Neuro AI Papers:

Quantum ML Papers:

Healthcare ML Papers:

Human AI Interaction Papers:

Economics ML Papers:

Text Detection Papers:

Proteins ML Papers:

About

Resources

Stars

Watchers

Packages