Stars
✨✨Latest Advances on Multimodal Large Language Models
Visual Instruction Tuning towards General-Purpose Multimodal Model: A Survey
Collection of AWESOME vision-language models for vision tasks
📺 A place to discover the latest machine learning courses on YouTube.
The Little Prince - Embeddings using OpenAI's text-embedding-ada model. This app allows you to speak with The Little Prince book.
This python script uses OpenAI API Text To Speech TTS Voice to convert Epub books to Audiobooks with ability to save progress and resume it.
A simple tool to download audio books from Tokybook.com
"Be careful about reading health books. Some fine day you'll die of a misprint." ― Markus Herz
"There is no friend as loyal as a book." ― Ernest Hemingway
经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
The official GitHub page for the survey paper "A Survey of Large Language Models".
「3D视觉(三维重建、SLAM、AR/VR) + 传统图像处理 + 计算机视觉(偏AI) 」重要知识点和面试问题。
在 oxford hand 数据集上对 YOLOv3 做模型剪枝(network slimming)
Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。
Papers for CNN, object detection, keypoint detection, semantic segmentation, medical image processing, SLAM, etc.
🚗 The 1st Place Submission to AICity Challenge 2020 re-id track (Baidu-UTS submission)
Lists the papers related to imbalance problems in object detection [TPAMI]