1. Sentiment Analysis( SST.ipynb )
2. Natural Language Inference(MULTI_NLI.ipynb)
The data used is sst from huggingface datasets. The dataset is already imported in the code.
The model used is ElMO which is head and a classifier is attached to it.
The ElMO model is made using pytorch's BiLSTM. Two layers BiLSTM are stacked. The output of the first layer is fed to the second layer. The output of the second layer is fed to the classifier. The ElMO is first pretrained on language modelling task and then fine tuned on the sentiment analysis task.
WHen the classifier is attached the ElMO model, the ElMO model is freezed and only the classifier is trained except the lambda parameters. The classifier is a simple MLP.
The Training is done using optimisers which I thoought are fine one can change them as per their wish.
The data used is multi-nli from huggingface datasets. The dataset is already imported in the code.
The model used is ElMO which is head and a classifier is attached to it.
The ElMO model is made using pytorch's BiLSTM. Two layers BiLSTM are stacked. The output of the first layer is fed to the second layer. The output of the second layer is fed to the classifier. The ElMO is first pretrained on language modelling task and then fine tuned on the sentiment analysis task.
WHen the classifier is attached the ElMO model, the ElMO model is freezed and only the classifier is trained except the lambda parameters. The classifier is a simple MLP.
The Training is done using optimisers which I thoought are fine one can change them as per their wish.