Skip to content
View moise-g's full-sized avatar

Block or report moise-g

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Secure open source cloud runtime for AI apps & AI agents

HTML 7,402 492 Updated Feb 2, 2025

Build multimodal language agents for fast prototype and production

Python 1,490 144 Updated Jan 26, 2025

OO for LLMs

Python 611 45 Updated Feb 1, 2025

Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"

Python 301 25 Updated Dec 21, 2024

🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

Python 6,207 582 Updated Feb 1, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 16,845 2,016 Updated Feb 2, 2025

Structured Text Generation

Python 10,537 555 Updated Jan 31, 2025

RAG that intelligently adapts to your use case, data, and queries

Python 2,823 143 Updated Jan 22, 2025

Cross-Platform, GPU Accelerated Whisper 🏎️

TypeScript 1,771 81 Updated Feb 27, 2024

Optimizing inference proxy for LLMs

Python 1,985 156 Updated Feb 2, 2025

Late Interaction Models Training & Retrieval

Python 230 16 Updated Jan 31, 2025

A timeline of notable generative AI events

HTML 76 1 Updated Jan 30, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 8,300 811 Updated Feb 2, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 11,646 1,179 Updated Jan 29, 2025

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & C…

Python 736 54 Updated Nov 29, 2024

A playbook for systematically maximizing the performance of deep learning models.

27,923 2,301 Updated Jun 18, 2024

Superfast AI decision making and intelligent processing of multi-modal data.

Python 2,339 236 Updated Jan 31, 2025

Provides a common interface to many IR ranking datasets.

Python 339 44 Updated Jan 13, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,312 1,456 Updated Jan 30, 2025

Machine Learning Engineering Open Book

Python 12,604 771 Updated Feb 1, 2025

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 835 47 Updated Jan 25, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,071 418 Updated Jan 28, 2025

Mamba SSM architecture

Python 13,866 1,193 Updated Jan 18, 2025

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Jupyter Notebook 2,642 356 Updated Jan 28, 2025

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,218 401 Updated Nov 18, 2024

Tensor library for machine learning

C++ 11,696 1,102 Updated Jan 29, 2025

LLM inference in C/C++

C++ 72,660 10,466 Updated Feb 2, 2025

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 1,949 176 Updated May 25, 2024

Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]

Python 561 56 Updated Mar 10, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,253 2,340 Updated Aug 12, 2024
Next