This is a collection of links/documents/tutorials which are good.
##Recommendations and Search links
Session Aware recommendations
Personalization and search
Kamelia Aryafar- Learning to Rank in e-commerce AIWTB 2017
https://www.youtube.com/watch?v=QjTi1qcLTQw
Learning to Rank from Clicks - Zach Alexander & Tracy Backes, Salesforce.com
https://www.youtube.com/watch?v=pDY2lReiCqk
Amazon Search: The Joy of Ranking Products
https://www.youtube.com/watch?v=NLrhmn-EZ88
Alex Smola - Personalization and Scalable Deep Learning with MXNET
https://www.youtube.com/watch?v=ofaPq5aRKZ0
Deep Learning for Personalized Search and Recommender Systems part 1
https://www.youtube.com/watch?v=0DYQzZp68ok
Deep Learning for Personalized Search and Recommender Systems part 2
https://www.youtube.com/watch?v=kAXVRAoyfCU
Airbnb search and personalization
http://videos.re-work.co/videos/357-search-ranking-and-personalization-at-airbnb
Papers to read
https://faculty.washington.edu/hemay/search_personalization.pdf
http://ftp.cs.wisc.edu/machine-learning/shavlik-group/li.nips14.pdf
https://arxiv.org/pdf/1712.07525.pdf
Recent trends in NLP
Recommendation engine using deep learning survey https://arxiv.org/pdf/1707.07435.pdf
Learning to Rank/ Recommender system papers and video
To read
https://github.com/ayushidalmia/awesome-fashion-ai#datasets
You tube recommendations ppt
http://dparra.sitios.ing.uc.cl/classes/recsys-2016-2/students/DNNyoutube_FdelRio.pdf
Data science
Print out
https://www.interviewcake.com/python-interview-questions
https://code.tutsplus.com/articles/advanced-python-data-structures—net-32748
https://github.com/jmportilla/Complete-Python-Bootcamp
https://code.tutsplus.com/articles/advanced-python-data-structures—net-32748 https://code.tutsplus.com/tutorials/how-to-read-and-write-csv-files-in-python—cms-29907 (https://code.tutsplus.com/tutorials/how-to-read-and-write-csv-files-in-python%E2%80%94cms-29907) https://code.tutsplus.com/articles/understanding-args-and-kwargs-in-python—cms-29494 https://code.tutsplus.com/tutorials/counting-word-frequency-in-a-file-using-python—cms-25965 https://code.tutsplus.com/tutorials/quick-tip-how-to-read-extremely-large-text-files-using-python—cms-25992
Files https://courses.cs.washington.edu/courses/cse142/11au/python/06-files.pdf https://courses.cs.washington.edu/courses/cse142/11au/python/07-lists.pdf
Data types
https://www.datacamp.com/community/tutorials/data-structures-python
Strings https://www.datacamp.com/community/tutorials/python-string-tutorial
Lists
https://www.datacamp.com/community/tutorials/18-most-common-python-list-questions-learn-python
https://www.datacamp.com/community/tutorials/python-list-comprehension
Dictionaries
https://www.datacamp.com/community/tutorials/python-dictionary-comprehension#nested
Tuples
https://www.datacamp.com/community/tutorials/python-tuples-tutorial
Iterator https://www.datacamp.com/community/tutorials/python-iterator-tutorial
Regular expressions https://www.datacamp.com/community/tutorials/python-regular-expression-tutorial
Pandas https://www.datacamp.com/community/tutorials/pandas-multi-index https://github.com/PacktPublishing/Pandas-Cookbook
Numpy
Machine learning
http://martin.zinkevich.org/rules_of_ml/rules_of_ml.pdf
Python Idiomatic code https://www.datacamp.com/community/tutorials/pandas-idiomatic
https://github.com/SigmaQuan/Better-Python-59-Ways
https://gist.github.com/csparpa/9409804
Sorting algorithms https://betterexplained.com/articles/sorting-algorithms/
https://latesthackingnews.com/2016/12/10/comparison-various-sorting-algorithms/
Deploying models to production
High level discussion https://www.infoworld.com/article/3250852/data-science/handing-off-models-from-data-science-to-it.html https://www.opendatagroup.com/blog/how-to-deploy-analytic-models-into-production-successfully https://medium.com/ocadotechnology/building-ml-models-is-hard-deploying-them-in-real-business-environments-is-harder-c2a0433f527 https://medium.com/ios-os-x-development/future-of-core-ml-model-training-2c6fc97aa58a https://machinelearningmastery.com/deploy-machine-learning-model-to-production/
Implementation examples https://github.com/rasbt/python-machine-learning-book-2nd-edition/tree/master/code/ch09 https://medium.com/redbus-in/how-to-deploy-scikit-learn-ml-models-d390b4b8ce7a https://github.com/orgesleka/webscikit https://blog.keras.io/building-a-simple-keras-deep-learning-rest-api.html https://medium.com/@dvelsner/deploying-a-simple-machine-learning-model-in-a-modern-web-application-flask-angular-docker-a657db075280 https://www.pyimagesearch.com/2018/01/29/scalable-keras-deep-learning-rest-api/ https://www.pyimagesearch.com/2018/02/05/deep-learning-production-keras-redis-flask-apache/ https://docs.microsoft.com/en-us/machine-learning-server/operationalize/python/quickstart-deploy-python-web-service https://github.com/Microsoft/ML-Server-Python-Samples https://blog.solutotlv.com/deployed-scikit-learn-model-flask-docker/?utm_source=Github&utm_medium=python-flask-sklearn-docker-template http://www.jamessturtevant.com/posts/Deploying-Python-Website-To-Azure-Web-with-Docker/ https://guillaumegenthial.github.io/serving.html https://towardsdatascience.com/a-flask-api-for-serving-scikit-learn-models-c8bcdaa41daa https://github.com/keiraqz/StreamingLogisticRegression https://github.com/shivasj/Integrating-a-Machine-Learning-Model-into-a-Web-app
Kubernotes
https://deis.com/blog/2016/kubernetes-illustrated-guide/ https://www.slideshare.net/PetteriTeikariPhD/deploying-deep-learning-models-with-docker-and-kubernetes
Restfull API https://github.com/RestCheatSheet/api-cheat-sheet
https://www.vinaysahni.com (https://www.vinaysahni.com/)
PARSING information from Resumes for learning coding style https://github.com/divapriya/Language_Processing/blob/master/resumeParser.py
How to write production level code for data science projects https://towardsdatascience.com/how-to-write-a-production-level-code-in-data-science-5d87bd75ced
Data convertion
int() str() float()
Using the right data structure http://careerdrill.com/blog/coding-interview/choosing-the-right-data-structure-to-solve-problems/
Problem statement: Automatically categorize the customer complaints into different categories so that a chatbot skills can answer the question
steps taken:
leetcode and hackerank - strings, sorting and searching , hash tables, dynamic programming, recursion,
coin flipping questions and probabilty questions
http://mathforum.org/library/drmath/sets/select/dm_coin_tossing.html https://brilliant.org/wiki/bayes-theorem/ https://www-old.math.gatech.edu/academic/courses/core/math1711/html/bayes.html https://onlinecourses.science.psu.edu/stat414/
A/b Testing https://conversionxl.com/blog/ab-testing-guide/
https://medium.com/the-graph/insights-from-an-evening-with-recommender-systems-experts-ab44d677dc5e https://www.youtube.com/watch?v=SxU0CJJ2nVE
Recommender systems
ebay-tech
Apache spark ML pipelines
https://www.confluent.io/blog/build-deploy-scalable-machine-learning-production-apache-kafka/
Unix and shell scripting
https://www.youtube.com/watch?v=YHFzr-akOas&list=PLS1QulWo1RIb9WVQGJ_vh-RQusbZgO_As
https://www.kdnuggets.com/2018/06/command-line-tricks-data-scientists.html
Udacity : Shell scripting and Linux commands
Spark Sql
https://datascienceplus.com/leveraging-hive-with-spark-using-python/
https://datascienceplus.com/multi-class-text-classification-with-pyspark/
https://developerzen.com/best-practices-writing-production-grade-pyspark-jobs-cb688ac4d20f
Machine learning pipelines course
REST API deployment
Algorithms and data structures:
Spark:
Shell scripting best tutorials https://www.youtube.com/watch?v=hwrnmQumtPw http://www.newthinktank.com/2016/06/shell-scripting-tutorial/
Unix cheatsheet - google
Kafka
Pyspark
Computer vision
Tensorflow :
Spark best resources http://www.sparktutorials.net/tutorials https://github.com/vinta/albedo https://github.com/awantik/pyspark-tutorial/wiki/PySpark-SQL-Module https://alvinalexander.com/scala/how-to-open-read-text-files-in-scala-cookbook-examples https://wrobstory.gitbooks.io/python-to-scala/index.html https://github.com/rohgar/scala-spark-4/wiki/Optimizing-with-Partitioners https://github.com/jleetutorial/scala-spark-tutorial http://crscardellino.me/archive/
Spark custom transformations https://medium.com/@mrpowers/the-different-type-of-spark-functions-custom-transformations-column-functions-udfs-bf556c9d0ce7
https://medium.com/@mrpowers/chaining-custom-dataframe-transformations-in-spark-a39e315f903c
https://medium.com/@mrpowers/chaining-custom-dataframe-transformations-in-spark-a39e315f903c
https://medium.com/@mrpowers/chaining-custom-pyspark-transformations-4f38a8c7ae55
https://medium.com/@mrpowers/chaining-custom-dataframe-transformations-in-spark-a39e315f903c http://nadbordrozd.github.io/blog/2016/05/22/one-weird-trick-that-will-fix-your-pyspark-schemas/
https://github.com/elenacuoco/bosch-kaggle-competition-spark/tree/master/python-utils
https://github.com/MingChen0919/learning-apache-spark
Pyspark http://changhsinlee.com/pyspark-udf/ http://changhsinlee.com/pyspark-dataframe-basics/ https://github.com/changhsinlee/changhsinlee.github.io/tree/master/notebook https://legacy.gitbook.com/book/databricks/databricks-spark-knowledge-base/details https://github.com/jubins/Spark-And-MLlib-Projects/blob/master/Spark_DataFrame_API_Project/Spark_DataFrames_API.ipynb
learn scala https://github.com/PacktPublishing/Scala-and-Spark-for-Big-Data-Analytics https://github.com/PacktPublishing/Learning-Spark-SQL https://github.com/PacktPublishing/Scala-Machine-Learning-Projects
Implement this using scala spark https://www.codementor.io/jadianes/building-a-web-service-with-apache-spark-flask-example-app-part2-du1083854
projects https://github.com/Yifeng-He/Large-Scale-Machine-Learning-with-Spark-for-Marketing https://github.com/sohilsshah91/Airline-Stock-Prediction-Using-Google-Trends-Oil-Prices
Spark ML notes https://github.com/vaquarkhan/vk-wiki-notes/wiki http://www.waitingforcode.com (http://www.waitingforcode.com/) https://legacy.gitbook.com/book/umbertogriffo/apache-spark-best-practices-and-tuning/details https://www.slideshare.net/cloudera/top-5-mistakes-to-avoid-when-writing-apache-spark-applications
Optimizing spark jobs https://michalsenkyr.github.io/2018/01/spark-performance#comments https://zerogravitylabs.ca/spark-performance-tuning-checklist/ https://umbertogriffo.gitbooks.io/apache-spark-best-practices-and-tuning/content/references.html https://stackoverflow.com/questions/43831387/how-to-avoid-shuffles-while-joining-dataframes-on-unique-keys
Complete
NLP CMU course Tensorflow NLP stanford
kaggle - Image problem
Spark -
git, version control, shell,unix commands - practise