-
The Chinese university of Hong Kong
- Hong Kong
- https://julietljy.github.io/
Stars
Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"
This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"
[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)
A guidance language for controlling large language models.
BAL: Balancing Diversity and Novelty for Active Learning - Official Pytorch Implementation
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…
Download the source latex code of multiple arXiv paper with one click
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"
Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Need. (2) MOODv2: Masked Image Modeling for Out-of-Distribution…
Video-P2P: Video Editing with Cross-attention Control
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
[NeurIPS'22] An official PyTorch implementation of PTv2.
State-of-the-art, simple, fast unbounded / large-scale NeRFs.
The official code for our ECCV22 oral paper: tracking objects as pixel-wise distributions.
Variational Adversarial Active Learning (ICCV 2019)