-
Unstructured
- State College, PA
- https://thinkregressively.netlify.app/
Stars
A Python client for the Unstructured hosted API
This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essen…
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation. It allows users to compare different chunking methods and i…
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Train transformer language models with reinforcement learning.
Codebase for Aria - an Open Multimodal Native MoE
A modular graph-based Retrieval-Augmented Generation (RAG) system
Neo4j graph construction from unstructured data using LLMs
An Open-Source Package for Information Retrieval
Author: Wenhao Yu ([email protected]). WWW'20. Tabular data extraction. Data science experimental evidence (dataset).
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Python package for tackling multi-class imbalance problems. http://www.cs.put.poznan.pl/mlango/publications/multiimbalance/
Port of Google's language-detection library to Python.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
LLM training code for Databricks foundation models
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Python Library to evaluate VLM models' robustness across diverse benchmarks
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.