- College Park, Maryland
- styx97.github.io
- @rupak_53
Highlights
- Pro
Stars
Fast computation of Krippendorff's alpha agreement measure in Python.
Access a database of word frequencies, in various natural languages.
A repo providing starter code and a demo on Slurm usage guide at UMD CLIP
Learn LeetCode and prepare for coding interviews with free resources.
A barebones set of utilities that help run few-shot prompting using hf
A repo to serve huggingface models using FAST API
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
A toolkit to create, launch and monitor SLURM jobs over existing python scripts.
potato: portable text annotation tool
Datasets of the daily Twitter output of Congress.
Download and process tweets from Spritzer sample uploaded at Archive.org
📝 A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
Python programs, usually short, of considerable difficulty, to perfect particular skills.
This repository contains codes and models for the paper: Exploring Question-Specific Rewards for Generating Deep Questions (COLING 2020).
Indic language identification supporting Romanized variants of Indian languages
Dynamically typed bangla programming language written in rust
A Python module to bypass Cloudflare's anti-bot page.
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
TensorFlow code and pre-trained models for BERT
Workaround for Intel throttling issues in Linux.
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, …
A set of notebooks and other resources regarding my deep learning experiments