Stars
ShellSage saves sysadmins’ sanity by solving shell script snafus super swiftly
A system for agentic LLM-powered data processing and ETL
A minimalist yet highly performant, lightweight, lightning fast, multisource, multimodal and local Ingestion, Inference and Indexing solution, built in Rust.
code for turning data sets into trading strategies
Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, backup, re-embed (using any model) or access your vector data …
Portfolio analytics for quants, written in Python
CompanionLLM - A framework to finetune LLMs to be your own sentient conversational companion
A tool for running on-premises large language models with non-public data
Supercharge Your LLM Application Evaluations 🚀
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Customizable implementation of the self-instruct paper.
An Open-source Toolkit for LLM Development
Fast & Simple repository for pre-training and fine-tuning T5-style models
DataComp: In search of the next generation of multimodal datasets
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
🦜🔗 Build context-aware reasoning applications
Allows to scale the ChatGPT API to multiple simultaneous sessions with infinite contextual and adaptive memory powered by GPT and Redis datastore.
High-speed download of LLaMA, Facebook's 65B parameter GPT model
A multi-voice TTS system trained with an emphasis on quality
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…
Robust Speech Recognition via Large-Scale Weak Supervision