Skip to content

Latest commit

 

History

History
410 lines (203 loc) · 11.3 KB

README.md

File metadata and controls

410 lines (203 loc) · 11.3 KB

Important-stuff

This is a collection of links/documents/tutorials which are good.

##Recommendations and Search links

Session Aware recommendations

  1. https://sites.google.com/view/seq-recsys-tutorial/home

Personalization and search

Kamelia Aryafar- Learning to Rank in e-commerce AIWTB 2017

https://www.youtube.com/watch?v=QjTi1qcLTQw

Learning to Rank from Clicks - Zach Alexander & Tracy Backes, Salesforce.com

https://www.youtube.com/watch?v=pDY2lReiCqk

Amazon Search: The Joy of Ranking Products

https://www.youtube.com/watch?v=NLrhmn-EZ88

Alex Smola - Personalization and Scalable Deep Learning with MXNET

https://www.youtube.com/watch?v=ofaPq5aRKZ0

Deep Learning for Personalized Search and Recommender Systems part 1

https://www.youtube.com/watch?v=0DYQzZp68ok

Deep Learning for Personalized Search and Recommender Systems part 2

https://www.youtube.com/watch?v=kAXVRAoyfCU

Airbnb search and personalization

http://videos.re-work.co/videos/357-search-ranking-and-personalization-at-airbnb

Papers to read

https://faculty.washington.edu/hemay/search_personalization.pdf

http://ftp.cs.wisc.edu/machine-learning/shavlik-group/li.nips14.pdf

https://arxiv.org/pdf/1712.07525.pdf

Recent trends in NLP

Recommendation engine using deep learning survey https://arxiv.org/pdf/1707.07435.pdf

Learning to Rank/ Recommender system papers and video

https://qconsf.com/sf2016/system/files/presentation-slides/scaling_quality_using_machine_learning_-_qcon_sf_2016.pdf

To read

https://github.com/ayushidalmia/awesome-fashion-ai#datasets

You tube recommendations ppt

http://dparra.sitios.ing.uc.cl/classes/recsys-2016-2/students/DNNyoutube_FdelRio.pdf

Data science

Print out

https://www.interviewcake.com/python-interview-questions

https://code.tutsplus.com/articles/advanced-python-data-structures—net-32748

https://github.com/jmportilla/Complete-Python-Bootcamp

https://code.tutsplus.com/articles/advanced-python-data-structures—net-32748 https://code.tutsplus.com/tutorials/how-to-read-and-write-csv-files-in-python—cms-29907 (https://code.tutsplus.com/tutorials/how-to-read-and-write-csv-files-in-python%E2%80%94cms-29907) https://code.tutsplus.com/articles/understanding-args-and-kwargs-in-python—cms-29494 https://code.tutsplus.com/tutorials/counting-word-frequency-in-a-file-using-python—cms-25965 https://code.tutsplus.com/tutorials/quick-tip-how-to-read-extremely-large-text-files-using-python—cms-25992

Files https://courses.cs.washington.edu/courses/cse142/11au/python/06-files.pdf https://courses.cs.washington.edu/courses/cse142/11au/python/07-lists.pdf

Data types

https://www.datacamp.com/community/tutorials/data-structures-python

Strings https://www.datacamp.com/community/tutorials/python-string-tutorial

Lists

https://www.datacamp.com/community/tutorials/18-most-common-python-list-questions-learn-python

https://www.datacamp.com/community/tutorials/python-list-comprehension

Dictionaries

https://www.datacamp.com/community/tutorials/python-dictionary-comprehension#nested

Tuples

https://www.datacamp.com/community/tutorials/python-tuples-tutorial

Iterator https://www.datacamp.com/community/tutorials/python-iterator-tutorial

Regular expressions https://www.datacamp.com/community/tutorials/python-regular-expression-tutorial

Pandas https://www.datacamp.com/community/tutorials/pandas-multi-index https://github.com/PacktPublishing/Pandas-Cookbook

Numpy

Machine learning

http://martin.zinkevich.org/rules_of_ml/rules_of_ml.pdf

Python Idiomatic code https://www.datacamp.com/community/tutorials/pandas-idiomatic

https://github.com/SigmaQuan/Better-Python-59-Ways

https://gist.github.com/csparpa/9409804

Big O https://www.hackerearth.com/practice/notes/big-o-cheatsheet-series-data-structures-and-algorithms-with-thier-complexities-1/

Sorting algorithms https://betterexplained.com/articles/sorting-algorithms/

https://startupsventurecapital.com/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5

https://latesthackingnews.com/2016/12/10/comparison-various-sorting-algorithms/

Deploying models to production

High level discussion https://www.infoworld.com/article/3250852/data-science/handing-off-models-from-data-science-to-it.html https://www.opendatagroup.com/blog/how-to-deploy-analytic-models-into-production-successfully https://medium.com/ocadotechnology/building-ml-models-is-hard-deploying-them-in-real-business-environments-is-harder-c2a0433f527 https://medium.com/ios-os-x-development/future-of-core-ml-model-training-2c6fc97aa58a https://machinelearningmastery.com/deploy-machine-learning-model-to-production/

Implementation examples https://github.com/rasbt/python-machine-learning-book-2nd-edition/tree/master/code/ch09 https://medium.com/redbus-in/how-to-deploy-scikit-learn-ml-models-d390b4b8ce7a https://github.com/orgesleka/webscikit https://blog.keras.io/building-a-simple-keras-deep-learning-rest-api.html https://medium.com/@dvelsner/deploying-a-simple-machine-learning-model-in-a-modern-web-application-flask-angular-docker-a657db075280 https://www.pyimagesearch.com/2018/01/29/scalable-keras-deep-learning-rest-api/ https://www.pyimagesearch.com/2018/02/05/deep-learning-production-keras-redis-flask-apache/ https://docs.microsoft.com/en-us/machine-learning-server/operationalize/python/quickstart-deploy-python-web-service https://github.com/Microsoft/ML-Server-Python-Samples https://blog.solutotlv.com/deployed-scikit-learn-model-flask-docker/?utm_source=Github&utm_medium=python-flask-sklearn-docker-template http://www.jamessturtevant.com/posts/Deploying-Python-Website-To-Azure-Web-with-Docker/ https://guillaumegenthial.github.io/serving.html https://towardsdatascience.com/a-flask-api-for-serving-scikit-learn-models-c8bcdaa41daa https://github.com/keiraqz/StreamingLogisticRegression https://github.com/shivasj/Integrating-a-Machine-Learning-Model-into-a-Web-app

Kubernotes

https://deis.com/blog/2016/kubernetes-illustrated-guide/ https://www.slideshare.net/PetteriTeikariPhD/deploying-deep-learning-models-with-docker-and-kubernetes

Restfull API https://github.com/RestCheatSheet/api-cheat-sheet

https://www.vinaysahni.com (https://www.vinaysahni.com/)

PARSING information from Resumes for learning coding style https://github.com/divapriya/Language_Processing/blob/master/resumeParser.py

How to write production level code for data science projects https://towardsdatascience.com/how-to-write-a-production-level-code-in-data-science-5d87bd75ced

Data convertion

int() str() float()

Using the right data structure http://careerdrill.com/blog/coding-interview/choosing-the-right-data-structure-to-solve-problems/

Problem statement: Automatically categorize the customer complaints into different categories so that a chatbot skills can answer the question

steps taken:

leetcode and hackerank - strings, sorting and searching , hash tables, dynamic programming, recursion,

coin flipping questions and probabilty questions

http://mathforum.org/library/drmath/sets/select/dm_coin_tossing.html https://brilliant.org/wiki/bayes-theorem/ https://www-old.math.gatech.edu/academic/courses/core/math1711/html/bayes.html https://onlinecourses.science.psu.edu/stat414/

A/b Testing https://conversionxl.com/blog/ab-testing-guide/

https://medium.com/the-graph/insights-from-an-evening-with-recommender-systems-experts-ab44d677dc5e https://www.youtube.com/watch?v=SxU0CJJ2nVE

Recommender systems

https://medium.com/recombee-blog/evaluating-recommender-systems-choosing-the-best-one-for-your-business-c688ab781a35

ebay-tech

Apache spark ML pipelines

https://www.confluent.io/blog/build-deploy-scalable-machine-learning-production-apache-kafka/

Unix and shell scripting

https://www.youtube.com/watch?v=YHFzr-akOas&list=PLS1QulWo1RIb9WVQGJ_vh-RQusbZgO_As

https://www.kdnuggets.com/2018/06/command-line-tricks-data-scientists.html

Udacity : Shell scripting and Linux commands

Spark Sql

https://datascienceplus.com/leveraging-hive-with-spark-using-python/

https://datascienceplus.com/multi-class-text-classification-with-pyspark/

https://developerzen.com/best-practices-writing-production-grade-pyspark-jobs-cb688ac4d20f

Machine learning pipelines course

REST API deployment

Algorithms and data structures:

Spark:

Shell scripting best tutorials https://www.youtube.com/watch?v=hwrnmQumtPw http://www.newthinktank.com/2016/06/shell-scripting-tutorial/

Unix cheatsheet - google

Kafka

Pyspark

Computer vision

Tensorflow :

Spark best resources http://www.sparktutorials.net/tutorials https://github.com/vinta/albedo https://github.com/awantik/pyspark-tutorial/wiki/PySpark-SQL-Module https://alvinalexander.com/scala/how-to-open-read-text-files-in-scala-cookbook-examples https://wrobstory.gitbooks.io/python-to-scala/index.html https://github.com/rohgar/scala-spark-4/wiki/Optimizing-with-Partitioners https://github.com/jleetutorial/scala-spark-tutorial http://crscardellino.me/archive/

Spark custom transformations https://medium.com/@mrpowers/the-different-type-of-spark-functions-custom-transformations-column-functions-udfs-bf556c9d0ce7

https://medium.com/@mrpowers/chaining-custom-dataframe-transformations-in-spark-a39e315f903c

https://medium.com/@mrpowers/chaining-custom-dataframe-transformations-in-spark-a39e315f903c

https://medium.com/@mrpowers/chaining-custom-pyspark-transformations-4f38a8c7ae55

https://medium.com/@mrpowers/chaining-custom-dataframe-transformations-in-spark-a39e315f903c http://nadbordrozd.github.io/blog/2016/05/22/one-weird-trick-that-will-fix-your-pyspark-schemas/

https://github.com/elenacuoco/bosch-kaggle-competition-spark/tree/master/python-utils

https://github.com/MingChen0919/learning-apache-spark

Pyspark http://changhsinlee.com/pyspark-udf/ http://changhsinlee.com/pyspark-dataframe-basics/ https://github.com/changhsinlee/changhsinlee.github.io/tree/master/notebook https://legacy.gitbook.com/book/databricks/databricks-spark-knowledge-base/details https://github.com/jubins/Spark-And-MLlib-Projects/blob/master/Spark_DataFrame_API_Project/Spark_DataFrames_API.ipynb

learn scala https://github.com/PacktPublishing/Scala-and-Spark-for-Big-Data-Analytics https://github.com/PacktPublishing/Learning-Spark-SQL https://github.com/PacktPublishing/Scala-Machine-Learning-Projects

Implement this using scala spark https://www.codementor.io/jadianes/building-a-web-service-with-apache-spark-flask-example-app-part2-du1083854

projects https://github.com/Yifeng-He/Large-Scale-Machine-Learning-with-Spark-for-Marketing https://github.com/sohilsshah91/Airline-Stock-Prediction-Using-Google-Trends-Oil-Prices

Spark ML notes https://github.com/vaquarkhan/vk-wiki-notes/wiki http://www.waitingforcode.com (http://www.waitingforcode.com/) https://legacy.gitbook.com/book/umbertogriffo/apache-spark-best-practices-and-tuning/details https://www.slideshare.net/cloudera/top-5-mistakes-to-avoid-when-writing-apache-spark-applications

Optimizing spark jobs https://michalsenkyr.github.io/2018/01/spark-performance#comments https://zerogravitylabs.ca/spark-performance-tuning-checklist/ https://umbertogriffo.gitbooks.io/apache-spark-best-practices-and-tuning/content/references.html https://stackoverflow.com/questions/43831387/how-to-avoid-shuffles-while-joining-dataframes-on-unique-keys

Complete

NLP CMU course Tensorflow NLP stanford

kaggle - Image problem

Spark -

git, version control, shell,unix commands - practise