Highlights
- Pro
Stars
This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25]
Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"
An open-source implementation for training LLaVA-NeXT.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A simple Python wrapper for YouTube Data API ✨ 🍰 ✨ .
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
A playbook for systematically maximizing the performance of deep learning models.
[IEEE Transactions on Medical Imaging/TMI] This repo is the official implementation of "LViT: Language meets Vision Transformer in Medical Image Segmentation"
Codes for # ACL2019 paper "Progressive Self-Supervised Attention Learning for Aspect-Level Sentiment Analysis", which contains TNet-Att(+AS) and MN(+AS)
Code for "Asynchronous bidirectional decoding for neural machine translation" (AAAI, 2018)
Code for "Multi-Domain Neural Machine Translation with Word-Level Domain Context Discrimination"(EMNLP2019)
Code for "Dynamic Context-guided Capsule Network for Multimodal Machine Translation" (ACM MM2020)
Neural Collective Entity Linking Based on Recurrent Random Walk Network Learning. Code from IJCAI 2019 paper.
Code for "Variational Neural Machine Translation" (EMNLP2016)