Lists (1)
Sort Name descending (Z-A)
Stars
Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Multimodal Whole Slide Foundation Model for Pathology
We present a comprehensive and deep review of the HFM in challenges, opportunities, and future directions. The released paper: https://arxiv.org/abs/2404.03264
BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks
中文nlp解决方案(大模型、数据、模型、训练、推理)
PyTorch code and models for the DINOv2 self-supervised learning method.
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge managemen…
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
A lightweight utility that makes the Windows taskbar translucent/transparent.
Combining deep neural networks with PCA and k-NN classification for abdominal organ recognition in ultrasound images.
Organ Classification on Abdominal Ultrasound using Javascript