Skip to content
View Liushiyu-0709's full-sized avatar

Block or report Liushiyu-0709

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Ready-to-use code and tutorial notebooks to boost your way into few-shot learning for image classification.

Python 1,101 149 Updated Nov 13, 2024

Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".

Python 58 8 Updated Mar 7, 2024

Watch, read and lookup: learning to spot signs from multiple supervisors, ACCV 2020 (Best Application Paper)

Python 29 4 Updated Apr 10, 2023

The official implementation of the paper "SCOPE: Sign Language Contextual Processing with Embedding from LLMs".

4 Updated Sep 26, 2024
Python 5 1 Updated Apr 19, 2022

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Jupyter Notebook 7,942 598 Updated Nov 30, 2024

A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.

347 15 Updated Dec 20, 2024

This is the official code repository for the paper 'Improving Gloss-free Sign Language Translation by Reducing Representation Density'.

Python 18 Updated Nov 27, 2024

OpenAI CLIP text encoders for multiple languages!

Jupyter Notebook 769 72 Updated May 15, 2023

Large-Vocabulary Continuous Sign Language Recognition, 2024

Python 9 1 Updated May 30, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,706 85 Updated Dec 12, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 966 62 Updated Nov 20, 2024

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 207 24 Updated Dec 16, 2024

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,085 219 Updated Dec 3, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,780 2,291 Updated Aug 12, 2024
Python 3,125 272 Updated Oct 16, 2024

A collection of awesome video generation studies.

TeX 393 15 Updated Dec 20, 2024

This is the official code repository for the paper 'Cross-modality Data Augmentation for End-to-End Sign Language Translation'. Accepted at Findings EMNLP 2023

Python 13 Updated Oct 18, 2023

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Python 3,039 628 Updated Jan 22, 2024

Code for the ICLR'22 paper "Improving Non-Autoregressive Translation Models Without Distillation"

Python 17 2 Updated Mar 11, 2022

PyTorch CTC Decoder bindings

C++ 831 246 Updated Apr 4, 2024

Visual Alignment Constraint for Continuous Sign Language Recognition. ( ICCV 2021)

Python 123 20 Updated Mar 23, 2023

A tool for holistic analysis of language generations systems

Python 467 58 Updated Mar 22, 2022

SLTUNET: A Simple Unified Model for Sign Language Translation (ICLR 2023)

Python 28 7 Updated Jul 10, 2023

本项目旨在提供一组工具,帮助数据科学家和机器学习工程师更有效地处理和优化他们的数据集和模型。本工具集能够处理包括但不限于数据不平衡、未标记数据利用、样本难度过滤、以及训练集的动态增强等挑战。

Python 10 1 Updated Mar 13, 2024
Python 2 Updated Feb 2, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,482 4,495 Updated Dec 19, 2024

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,578 315 Updated May 21, 2024
Next