GitHub - WellyWong/NLP-Disaster-Tweets

NLP-Disaster-Tweets

https://nbviewer.org/github/WellyWong/NLP-Disaster-Tweets/blob/main/NLP_disaster_tweet.ipynb

Overview

This is a Kaggle mini-project for DTSA 551 - Introduction to Deep Learning, focusing on text classification. I utilized models from the Recurrent Neural Network (RNN) family, a choice informed by the inherent sequential nature of tweets. In tweets, each word depends on the preceding ones to convey meaning, and understanding the context necessitates considering the order of the words within the sequence.

The implemented models include:

Stacked LSTM
Bi-directional LSTM with attention
Universal Sentence Encoder, a pre-trained model developed by Google Research
RoBERTa, a pre-trained model developed by Facebook AI, that builds upon BERT by addressing some of its limitations and incorporating additional training techniques.
RoBERTa, trained on tweets that incorporate 'keyword' and 'location' information.
RandomForestClassifier fitted to our meta-features data (polarity, subjectivity, number of words, number of characters and number of sentences).

RoBERTa

Figure 1: RoBERTa training history

Figure 2: Kaggle Leader board for the best model in this project

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
NLP_disaster_tweet.ipynb		NLP_disaster_tweet.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP-Disaster-Tweets

Overview

The implemented models include:

RoBERTa

About

Releases

Packages

Languages

License

WellyWong/NLP-Disaster-Tweets

Folders and files

Latest commit

History

Repository files navigation

NLP-Disaster-Tweets

Overview

The implemented models include:

RoBERTa

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages