-
CUHK
- Hong Kong
Lists (1)
Sort Name ascending (A-Z)
Stars
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition
Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"
Autonomous Agents (LLMs) research papers. Updated Daily.
An open source Bitcoin wallet password and seed recovery tool designed for the case where you already know most of your password/seed, but need assistance in trying different possible combinations.
TruthfulQA: Measuring How Models Imitate Human Falsehoods
Sparse AutoEncoders for Clamping LLM Behavior. Inspired by Anthropic.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
An Open Source Implementation of Anthropic's Paper: "Towards Monosemanticity: Decomposing Language Models with Dictionary Learning"
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Official Codebase of "DiffComplete: Diffusion-based Generative 3D Shape Completion"
🚀 A very efficient Texas Holdem GTO solver
Agentless🐱: an agentless approach to automatically solve software development problems
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Lumina-T2X is a unified framework for Text to Any Modality Generation
The official source code for "X-Ray: A Sequential 3D Representation for Generation".
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
The official implementation of Self-Play Fine-Tuning (SPIN)
Doing simple retrieval from LLM models at various context lengths to measure accuracy
(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision