This is my work for when I participated in Home Hackathon 2018: recommendation system competition. Given the data of user access logs from January to February 23rd, the goal is to predict the next new pages that each user will visit during the last week of February. Only pages that each user has not visited during the training period are treated as a new page. I got the "Technical Excellence" award from this hackathon.
Competition Detail (in Thai): [Link1] [Link2]
- Result Presentation: [Slide]
- Data Exploration: [notebook]
- Data Preparation: [notebook]
- Bot data removal: [notebook]
- Item2Vec Embeddings from user views: [notebook]
- Doc2Vec Embeddings from descriptions: [notebook]
- Baseline (Most frequent properties): [notebook]
- Matrix factorization (SVD): [notebook]
- Item2Vec embeddings: [notebook]
- LSTM: [notebook]
- GRU: [notebook]