Stars
Automate browser-based workflows with LLMs and Computer Vision
data cleaning and curation for unstructured text
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
JupyterLab computational environment.
An open science effort to benchmark legal reasoning in foundation models
A fully featured React components library
🛤 Detection of elements in viewport & smooth scrolling with parallax.
Punctuation Restoration using Transformer Models for High-and Low-Resource Languages
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.
Lexical is an extensible text editor framework that provides excellent reliability, accessibility and performance.
Find legal citations in any block of text
Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings"
Create graphics with a hand-drawn, sketchy, appearance
A python module that wraps the pdftoppm utility to convert PDF to PIL Image object
SECDatabase.com produced this dataset with the text and detailed numeric information of all financial statements. The Dataset is extracted from corporate annual and quarterly reports filed with the…
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
A manual for UM-SJTU JI students to survive better.
Sample code for the Twitter API v2 endpoints
A reading list of up-to-date papers on NLP for Social Good.
Coding knowledge base for TBP MI-G members.
Paper List for Style Transfer in Text
Ultra-simplified 3D Finite Element Method simulation in Python3.
Simple calculator built with React