-
Shanghai Jiao Tong University
- Shanghai
Highlights
- Pro
Stars
A generative world for general-purpose robotics & embodied AI learning.
Visualizing the attention of vision-language models
An open-source implementation for training LLaVA-NeXT.
This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strategy.
[ICML 2024] Language Models Represent Beliefs of Self and Others
Code for the "Long Context Needs Some R&R" paper.
The nnsight package enables interpreting and manipulating the internals of deep learned models.
cooelf / dive-into-llms
Forked from Lordog/dive-into-llmsDive-into-LLMs Tutorial for Beginners
This repository contains the code for the paper: Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models
This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Simple implementation of Speculative Sampling in NumPy for GPT-2.
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…