This model applies a 1D Convolutional Net on the headline and body text, represented at the word level using the Google News pretrained vectors. The output of this CNNs is then sent to a MLP with 4-class output (agree
,disagree
,discuss
,unrelated
) and trained end-to-end. The model was regularized using dropout (p=.5
) in all Convolutional layers. All hyperparameters of this model were set to sensible defaults, however they were not further evaulated to find better choices.
The final model was trained on the FNC-1 baseline training set and evaluated against the baseline validation set. The highest scoring parameters during training were saved, then applied to the final test set. This approach scores roughly 3850 on the validation set.
For more information on model selection and further research, please view our blog post (coming soon!).
This model requires a Theano
installation using the GpuArray
backend. Additionally, it requires Cuda
with CuDNN
to be correctly set up on the system. Replacing CuDNN
Conv Ops with vanilla Theano
Conv Ops may allow this code to be run on CPU, but was not tested.