Stars
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新中国人的核心宗教,核心信念。
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
A bepinex mod to enhance game experience (maybe)
Implementation of Graph Convolutional Networks in TensorFlow
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
salaniz / pycocoevalcap
Forked from tylin/coco-captionPython 3 support for the MS COCO caption evaluation tools
Simple image captioning model
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
A Transformer model that caption images
Image Captioning using CNN and Transformer.
computer vision projects | 计算机视觉相关好玩的AI项目(Python、C++、embedded system)
Attention Is All You Need | a PyTorch Tutorial to Transformers
LAVIS - A One-stop Library for Language-Vision Intelligence
Graph Neural Network Library for PyTorch
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Open source, roguelike deck-building card game template
Must-read papers on graph neural networks (GNN)
深度学习入门教程, 优秀文章, Deep Learning Tutorial
A faster pytorch implementation of faster r-cnn
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
How Powerful are Graph Neural Networks?
Image captioning model with Resnet50 encoder and LSTM decoder