Stars
A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, accompanying The 'Problem' of Human Label Variation: On Grou…
Code accompanying the EMNLP 2022 paper "Stop Measuring Calibration When Humans Disagree" in which we show problems with popular calibration metrics like ECE in settings where more than one answer i…
Machine translation models released by the Gourmet project (https://gourmet-project.eu)
pytorch library for stable inverse and log-determinant of exp(A) in log-domain
Compare the length of groups of sequences in a hierarchical Bayesian model
Implementations of distributions and estimators for binary latent variables in Pytorch.
Constrained Optimization in Pytorch
DGMs for NLP. A roadmap.
Combine multiple MNIST digits to create datasets with 100/1000 classes for few-shot learning/meta-learning
Documentation on how to access and use the Quick, Draw! Dataset.
Code accompanying the paper "Effective Estimation of Deep Generative Language Models".
📜 Generates table of contents for markdown files inside local git repository. Links are compatible with anchors generated by github or other sites.
Converts profiling output to a dot graph.
Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies
Python wrapper for Stanford CoreNLP tools v3.4.1
Pure python implementation of the Trie data structure