Stars
A cloud-native vector database, storage for next generation AI applications
Interact with the Deep Search platform for new knowledge explorations and discoveries
A high-throughput and memory-efficient inference and serving engine for LLMs
SGLang is a fast serving framework for large language models and vision language models.
A standardized, fair, and reproducible benchmark for evaluating event extraction approaches
A generative speech model for daily dialogue.
Python module to generate ATOM feeds, RSS feeds and Podcasts.
Reliability Estimation of News Media Sources: Birds of a Feather Flock Together
Visualization and debugging tool for LangChain workflows
Retrieval and Retrieval-augmented LLMs
LLM powered development for VSCode
Plyvel, a fast and feature-rich Python interface to LevelDB
📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
GPT4 based personalized ArXiv paper assistant bot
Library for creating highly customizable CLI-like progress bars in javascript
parse a wikipedia dump into tiny files
roll a wikipedia dump into mongo
A language for constraint-guided and efficient LLM programming.
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Benchmarks of approximate nearest neighbor libraries in Python
A blazing fast inference solution for text embeddings models
A Survey of Attributions for Large Language Models
Resource, Evaluation and Detection Papers for ChatGPT
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.