Stars
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Bringing BERT into modernity via both architecture changes and scaling
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
Pytorch implementation of TrAISformer---A generative transformer for AIS trajectory prediction (https://arxiv.org/abs/2109.03958).
PRGC: Potential Relation and Global Correspondence Based Joint Relational Triple Extraction
Train a 1B LLM with 1T tokens from scratch by personal
released code for our EMNLP22 paper: UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction
ACL2024: TTM-RE Memory-Augmented Document-Level Relation Extraction