prodriguezsosa / conText Public

Notifications You must be signed in to change notification settings
Fork 19
Star 102

An R package for estimating and doing statistical inference on context-specific word embeddings.

102 stars 19 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
R		R
data-raw		data-raw
data		data
man		man
vignettes		vignettes
.DS_Store		.DS_Store
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
NAMESPACE		NAMESPACE
README.md		README.md
conText.Rproj		conText.Rproj

Repository files navigation

About

An R package for computing feature embeddings a la carte and estimating text embedding regression models as described in Rodriguez, Spirling and Stewart (2021).

How to Install

devtools::install_github("prodriguezsosa/conText")

Datasets

To use conText you will need three datasets:

A (quanteda) corpus with the documents and corresponding document variables you want to evaluate.
A set of (GloVe) pre-trained embeddings.
A transformation matrix specific to the pre-trained embeddings.

In this Dropbox folder (see the /data folder) we have included the three datasets we use in the Quick Start Guide along with their documentation (see the /man folder) and source files (see the /data-raw folder).