Stars
Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
R1-onevision, a visual language model capable of deep CoT reasoning.
JitouchApp / Jitouch
Forked from sukolsak/jitouchA multi-touch extension for MacBook, Magic Mouse, and Magic Trackpad
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
github for the paper "Have Seen Me Before? Automating Dataset Updates Towards Reliable and Timely Evaluation"
A set of Python scripts for preprocessing the Wikidata JSON dump and running simple queries in an efficient manner.
[EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models
Official Anytype client for MacOS, Linux, and Windows
A tool for extracting plain text from Wikipedia dumps
Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.
Code and Checkpoints for "Generate rather than Retrieve: Large Language Models are Strong Context Generators" in ICLR 2023.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words an…
Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks
The project page for "LOGIC-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning"
Using GPT to organize and access information, and generate questions. Long term goal is to make an agent-like research assistant.
QLoRA: Efficient Finetuning of Quantized LLMs