Skip to content
View mattdahl's full-sized avatar

Highlights

  • Pro

Organizations

@aspc @alumNY @reglab

Block or report mattdahl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

High-performance retrieval engine for unstructured data

Python 1,241 88 Updated Mar 13, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 16,710 1,088 Updated Mar 12, 2025

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 970 71 Updated Jan 31, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 18,763 2,327 Updated Mar 13, 2025

High accuracy RAG for answering questions from scientific documents with citations

Python 7,059 693 Updated Mar 12, 2025

This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.

Python 72 8 Updated Jan 15, 2025

Testing Language Models for Memorization of Tabular Datasets.

Jupyter Notebook 33 5 Updated Feb 10, 2025
Python 1 Updated Sep 29, 2023

🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.

JavaScript 151 10 Updated Mar 7, 2025

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 5,940 350 Updated Jul 21, 2024

A guidance language for controlling large language models.

Jupyter Notebook 19,864 1,090 Updated Mar 13, 2025

Python implementation of iterative-random-forests

Cython 64 21 Updated Dec 20, 2023

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its go…

Jupyter Notebook 4,005 739 Updated Mar 10, 2025

Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).

Jupyter Notebook 1,437 124 Updated Mar 5, 2025

CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms

Python 286 62 Updated Oct 2, 2023

Find legal citations in any block of text

Python 136 37 Updated Mar 11, 2025

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

Python 749 109 Updated Jul 25, 2024

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

Python 1,223 149 Updated Feb 4, 2025

⚫ A spaCy pipeline and model for NLP on unstructured legal text.

Python 649 101 Updated Jul 16, 2024

LexNLP by LexPredict

Jupyter Notebook 713 180 Updated May 27, 2024

Legal citation extractor, via command line, JavaScript, or HTTP. See a live example at:

JavaScript 230 45 Updated Jun 7, 2020

Manage AWS Glacier vaults in Django and backup local files to Glacier.

Python 2 1 Updated Dec 11, 2015

Back your Django database and media directory up to Amazon Glacier or a local file

Python 25 11 Updated Jul 25, 2018

Meteor, the JavaScript App Platform

JavaScript 44,583 5,203 Updated Mar 12, 2025

Node.js CMS and web app framework

JavaScript 14,611 2,203 Updated Dec 13, 2023