Lists (17)
Sort Name ascending (A-Z)
Starred repositories
[AAAI2024] Official implementation of SurgicalSAM
Repository for "SAM-Path: A Segment Anything Model for Semantic Segmentation in Digital Pathology" (MedAGI2023, MICCAI2023 workshop)
Self-Prompting Polyp Segmentation in Colonoscopy Using Hybrid YOLO-SAM 2 Model
Unofficial implementation of YOLO-World + EfficientSAM for ComfyUI
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
Segment Anything in Medical Images
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Medical SAM 2: Segment Medical Images As Video Via Segment Anything Model 2
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
A curated list of awesome self-supervised learning methods in videos
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
LAVIS - A One-stop Library for Language-Vision Intelligence
Paper List for Contrastive Learning for Natural Language Processing
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences from C4 using a tagged corruption model. The approach and the d…
Automatically Update NLP Papers Daily using Github Actions (ref: https://github.com/Vincentqyw/cv-arxiv-daily)
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
Port of OpenAI's Whisper model in C/C++
A collection of ChatGPT and GPT-3.5 instruction-based prompts for generating and classifying text.
Code for the paper "Language Models are Unsupervised Multitask Learners"
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
✅ Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解