Liushiyu-0709

Liushiyu-0709

1 follower · 2 following

Stars

sicara / easy-few-shot-learning

Ready-to-use code and tutorial notebooks to boost your way into few-shot learning for image classification.

Python 1,101 149 Updated Nov 13, 2024

alibaba-mmai-research / CLIP-FSAR

Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".

Python 58 8 Updated Mar 7, 2024

gulvarol / bsldict

Watch, read and lookup: learning to spot signs from multiple supervisors, ACCV 2020 (Best Application Paper)

Python 29 4 Updated Apr 10, 2023

Godheritage / SCOPE

The official implementation of the paper "SCOPE: Sign Language Contextual Processing with Embedding from LLMs".

4 Updated Sep 26, 2024

LightChen233 / Awesome-Multilingual-LLM

53 2 Updated Dec 19, 2024

MLSLT / MLSLT

Python 5 1 Updated Apr 19, 2022

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Jupyter Notebook 7,942 598 Updated Nov 30, 2024

zhengli97 / Awesome-Prompt-Adapter-Learning-for-VLMs

A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.

347 15 Updated Dec 20, 2024

JinhuiYE / SignCL

This is the official code repository for the paper 'Improving Gloss-free Sign Language Translation by Reducing Representation Density'.

Python 18 Updated Nov 27, 2024

FreddeFrallan / Multilingual-CLIP

OpenAI CLIP text encoders for multiple languages!

Jupyter Notebook 769 72 Updated May 15, 2023

gulvarol / cslr2

Large-Vocabulary Continuous Sign Language Recognition, 2024

Python 9 1 Updated May 30, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,706 85 Updated Dec 12, 2024

DAMO-NLP-SG / VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 966 62 Updated Nov 20, 2024

zjysteven / lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 207 24 Updated Dec 16, 2024

PKU-YuanGroup / Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,085 219 Updated Dec 3, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,780 2,291 Updated Aug 12, 2024

LLaVA-VL / LLaVA-NeXT

Python 3,125 272 Updated Oct 16, 2024

AlonzoLeeeooo / awesome-video-generation

A collection of awesome video generation studies.

TeX 393 15 Updated Dec 20, 2024

Atrewin / SignXmDA

This is the official code repository for the paper 'Cross-modality Data Augmentation for End-to-End Sign Language Translation'. Accepted at Findings EMNLP 2023

Python 13 Updated Oct 18, 2023