This repository is intended to provide a free Self-Learning Roadmap to learn the field of Data Science. I provide some of the best free resources.
ββOur Previous Roadmap
ββ
If you Dont know What`s Data Science or Projects Life Cycle (starting from Business Understanding to Deployment) or Which Programming Language you should go for or Job Descriptions or the required Soft & Hard Skills needed for this field or Data Science Applications or the Most Common Mistakes, then
πThis Video is for you (Highly Recommended βοΈ)
Anaconda: Itβs a tool kit that fulfills all your necessities in writing and running code. From Powershell prompt to Jupyter Notebook and PyCharm, even R Studio (if interested to try R)
Atom: A more advanced Python interface, highly recommended by experts.
Google Colab: Itβs like a Jupyter Notebook but in the cloud. You donβt need to install anything locally. All the important libraries are already installed. For example NumPy, Pandas, Matplotlib, and Sci-kit Learn
PyCharm: PyCharm is another excellent IDE that enables you to integrate with libraries such as NumPy and Matplotlib, allowing you to work with array viewers and interactive plots.
Thonny: Thonny is an IDE for teaching and learning programming. Thonny is equipped with a debugger, and supports code completion, and highlights syntax errors.
π For Data Camp courses, github student pack gives 3 free months. Google how to get it.
if you already used it, do not hesitate to contact us to have an account with free access.:hibiscus:
- πΉ Video Content
- π Online Article Content / Book
π‘ Roadmap Explanation βΆοΈ Youtube Video π₯
Algorithms Book Every piece of code could be called an algorithm, but this book covers the
more interesting bits.
Specializations (data structures-algorithms)
1. Descriptive Stats.
βββ:video_camera: Intro to descriptive statistics
βββ:closed_book: Online statistics education
βββ:closed_book: Intro to descriptive statistics Article1 & Article2
βββ:video_camera: Arabic Course
βββ:video_camera: Intro to Inferential Statistics++
2. Probability
βββ:video_camera: Khan Academy
βββ:video_camera: Arabic Course
βββ:closed_book: Introduction to Probability
3. Python
βββ:video_camera: Introduction to Python Programming
βββ:video_camera: OOP
βββ:video_camera: Arabic - Hassouna | Elzero
βββ:video_camera: Python Full Course - FreeCodeCamp on YouTube
βββ:closed_book: Intro to Python for CS and Data Science
βββmore in OOP
4. Pandas
βββ:video_camera: Corey Schafer-Youtube
βββ:closed_book: Kaggle
βββ:closed_book: Docs
βββ:video_camera: Data School-Youtube
βββ:video_camera: Arabic Course
5. Numpy
βββ:closed_book: Kaggle
βββ:video_camera: Arabic Course
βββ:closed_book: Tutorial
βββ:closed_book: Docs
6. Scipy
βββ:closed_book: Tutorial
βββ:closed_book: Docs
7. Data Cleaning: One of the MOST important skills that you need to master to become a good data scientist, you need to practice on many datasets to master it.
βββRead this
βββ:video_camera: Course 1
βββ:closed_book: Notebook1
βββ:closed_book: Notebook2
βββ:closed_book: Notebook3
βββ:closed_book: Kaggle Data cleaning
8. Data Visualization π
βββ:video_camera: Introduction to Data Visualization with Matplotlib or
βββ:video_camera: Corey Schafer - Playlist on Youtube or
βββ:video_camera: sentdex - Playlist on YouTube
βββ:closed_book: Kaggle to Data Visualization with Seaborn
βββ:video_camera: Playlist-Youtube
βββ:video_camera: Course1: Intro to Data Visualization with Seaborn
βββ:video_camera: Course2: Intermediate Data Visualization with Seaborn
βββ:video_camera: Course3: Understanding and Visualizing with Python
9. EDA
Note: it's already mentioned in the above probability course
βββ:video_camera: DataCamp-EDA in Python
βββ:video_camera: IBM-EDA for Machine Learning
10. Dashboards
βTableau
βββ:closed_book: Tutorial
βββ:video_camera: docs
βββ:video_camera: course
βPower BI
βββ:video_camera: Power BI Desktop - Coursera
βββ:video_camera: Power BI training
βββ:video_camera: Arabic - Youtube
11. SQL and DB
βββ:video_camera: Intro to SQL or IBM
βββ:video_camera: Intro to Relational Databases in SQL
βββ:video_camera: Arabric Course
βββ:video_camera: Joining Data in SQL
12. Python Regular Expression
βββ:closed_book: Tutorial
13. Time Series Analysis
βββ:video_camera: Track
βββ:closed_book: Book
βββ:closed_book: fbprohet
βββ:video_camera: Arabic Source Video1 & Video2
1. Math for ML: consists of Linear Algebra, Calculus and PCA.
πΉ Specialization
πΉ Mathematics for Machine Learning - Most of the needed basics
πΉLinear Algebra
βββ:video_camera: Khan Academy - Linear Algebra
βββ:video_camera: Mathematics for Machine Learning: Linear Algebra
βββ:video_camera: 3Blue1Brown - Essence of Linear Algebra
πΉCalculus
βββ:video_camera: Multivariate Calculus - Coursera
βββ:video_camera: Essence of calculus - Youtube
πΉPCA
βββ:video_camera: PCA - Coursera
2. Machine Learning
βββ:video_camera: Coursera Free Course by Andrew Ng (Octave/Matlab)
βββ:video_camera: Coursera Andrew`s new ML Specialization (Python)
βββ:video_camera: Machine Learning Stanford Full Course on YouTube by Andrew
βββ:video_camera: CS480/680 Intro to Machine Learning - Spring 2019 - University of Waterloo
βββ:video_camera: SYDE 522 β Machine Intelligence (Winter 2018, University of Waterloo)
βββ:video_camera: Introduction to Machine Learning Course - Udacity
βββ:video_camera: Hesham Asem - Arabic content
βββ:video_camera: IBM ML with Python
βββ:video_camera: Machine Learning From Scratch - YouTube (Python Engineer)
βββ:closed_book: Hands on ML book
βββ:video_camera: ML Algorithms in Practice
βββ:video_camera: ML scientist
βββ:video_camera: Project
3. Web Scraping/APIs
βββ:video_camera: course
βββ:closed_book: intro2
βββ:closed_book: Tutorial
βββ:closed_book: Book for both topics
APIs
βββ:closed_book: Tutorial
βββ:closed_book: Article
βββ:closed_book: Tutorial
4. Stats.
βββ:closed_book: This stats - Book
βββ:closed_book: Think Bayes - Book
5. Advanced SQL
βββ:video_camera: More advanced SQL
βββ:video_camera: Joining Data in SQL
7. Feature Engineering
βββ:closed_book: Tutorial
βββ:closed_book: Article
βββ:closed_book: Book
8. interpet Shapley-based explanations of ML models.
βββ:closed_book: SHAP
βββ:closed_book: Kaggle ML explainability
Read this book, please π Introduction to Statistical Learning with Applications in R Ψ¨ΩΩΩΩ Ψ§ΩΨ±Ψ£Ω
1. Deep Learning
βββ:video_camera: Deep Learning Fundamentals
βββ:video_camera: Introduction to
Deep Learning - MIT
βββ:video_camera: Specialization
βββ:closed_book: Dive into Deep Learning - Book
βββ:video_camera: Deep Learning UC Berkely
βββ:closed_book: github of Dive into DL
βββ:video_camera: Stanford Lecture - Convolutional Neural Networks for Visual Recognition
βββ:video_camera: University of Waterloo - ML / DL
2. Tensorflow
βββ:video_camera: Specialization
βββ:video_camera: Youtube
βββ fast.ai's Deep Learning Courses
3. Advanced Data Science
βββ:video_camera: Advanced Data Science with IBM Specialization
4. NLP
βββ:video_camera: Specialization
βββ:video_camera: Introduction to Natural Language Processing in Python
5. Inferential Statistics
βββ:video_camera: Specialization, 2nd & 3rd courses
βββ:video_camera: course
6. Bayesian Statistics
βββ:video_camera: 1 - From Concept to Data Analysis
βββ:video_camera: 2 - Techniques and Models
βββ:video_camera: 3 - Mixture Models
7. Model Deployment
βββ:closed_book: Flask tutorial
βββ:video_camera: TensorFlow: Data and Deployment Specialization
βββ:video_camera: Deploy Models with TensorFlow Serving and Flask
βββ:video_camera: How to Deploy a Machine Learning Model to Google Cloud - Daniel Bourke
βββif you`re intersted in more deployment methods, search for (FastAPI - Heroku - chitra)
8. Probabilistic Graphical Models
βββ:video_camera: Specialization
Tasks and Projects will be added soon. β³
βββAnaconda
βββGit
βββcourse
π More Books ~ π Check This!
ββ:atom::atom::atom::atom::atom:
βββ:closed_book: π₯ 65 Free Important Books π₯
βββ:closed_book: Mathematics for Machine Learning
βββ:closed_book: An Introduction to Statistical Learning
βββ:closed_book: Understanding Machine Learning: From Theory to Algorithms
βββ:closed_book: Probabilistic Machine Learning: An Introduction
Competitions will make you even more proficient in Data Science.
When we talk about top data science competitions, Kaggle is one of the most popular platforms for data science. Kaggle has a lot of competitions where you can participate according to your knowledge level.
You can also check these platforms for data science competitions-
- Driven Data
- Codalab
- Iron Viz
- Topcoder
- CrowdANALYTIX Community
- Bitgrit
π Data Science Interview Questions:
π Data Analysis Recommendations.
βββ Books (:closed_book: The Data Analysis Workshop &
β:closed_book: Head First Data Analysis)
βββ FWD - (The 3 Levels)
βββ Google Data Analytics Professional Certificate
βββ IBM Data Analyst Professional Certificate
π Data Engineering Recommendations.
βββ Roadmap 1
βββ Roadmap 2
π Data & AI Companies in Egypt