Stars
This repo hosts the different ways to run vllm on ANL HPC system
Globus-compute agentic tool interface
This is a repository with examples to run inference endpoints on various ALCF clusters
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
Scalable RL solution for advanced reasoning of language models
Experiments preparing data for fine-tuning and RAG using DepMap data
misc LLM code that did not go in the other repos
DSPy: The framework for programming—not prompting—language models
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
Code for the paper "LASER: LLM Agent with State-Space Exploration for Web Navigation"
Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.
Official inference library for Mistral models
High accuracy RAG for answering questions from scientific documents with citations
A web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journals.
Example models using DeepSpeed
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Scrape papers from OpenReview using OpenReview API