-
UQ & SUSTech
- Shenzhen
-
02:17
(UTC +08:00)
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
[TMI'24] "Masked Deformation Modeling for Volumetric Brain MRI Self-supervised Pre-training".
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
When do we not need larger vision models?
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
📄 适合中文的简历模板收集(LaTeX,HTML/JS and so on)由 @hoochanlon 维护
MINT-1T: A one trillion token multimodal interleaved dataset.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Official repository for the paper "FedLPPA: Learning Personalized Prompt and Aggregation for Federated Weakly-supervised Medical Image Segmentation".
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
ACG2vec (Anime Comics Games to vector) are committed to creating a playground that combines ACG and Deep learning.(文本语义检索、以图搜图、语义搜图、图片超分辨率、推荐系统)
【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
[PR'24] "LDDMM-Face: Large deformation diffeomorphic metric learning for cross-annotation face alignment".
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
An Open-source Toolkit for LLM Development
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
✨✨Latest Advances on Multimodal Large Language Models
Open-Sora: Democratizing Efficient Video Production for All
Official implementation of SET: Superpixel Embeded Transformer for Skin Lesion Segmentation (MedIA2024)
[MICCAI2024] A Parameter and Memory Efficient Transfer Learning Method
A collection of resources on applications of multi-modal learning in medical imaging.
The official repository for "One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts"
An open source implementation of CLIP.
This repository contains code used to prepare the LUMIERE Glioblastoma dataset.
[ICCV 2023] CLIP-Driven Universal Model; Rank first in MSD Competition.