An R package for computing feature embeddings a la carte
and estimating text embedding regression models as described in Rodriguez, Spirling and Stewart (2021).
devtools::install_github("prodriguezsosa/conText")
To use conText you will need three datasets:
- A (quanteda) corpus with the documents and corresponding document variables you want to evaluate.
- A set of (GloVe) pre-trained embeddings.
- A transformation matrix specific to the pre-trained embeddings.
In this Dropbox folder (see the /data folder) we have included the three datasets we use in the Quick Start Guide along with their documentation (see the /man folder) and source files (see the /data-raw folder).
Check out this Quick Start Guide to get going with conText
.