- California
- scottjingtt.github.io
Stars
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Implementation of popular ML algorithms from scratch
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
An Open-source Toolkit for LLM Development
Pathology Language and Image Pre-Training (PLIP) is the first vision and language foundation model for Pathology AI (Nature Medicine). PLIP is a large-scale pre-trained model that can be used to ex…
A playbook for systematically maximizing the performance of deep learning models.
On-device speech-to-text engine powered by deep learning
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Cel…
(TPAMI 2024) A Survey on Open Vocabulary Learning
【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。
EVA Series: Visual Representation Fantasies from BAAI
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
Data preprocessing for IUPUI-CSRC Pedestrian Situated Intent (PSI) benchmark dataset.
Contains scripts for the PSI competition.
Official Page for PSI benchmark.
[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
Official Repository of ChatCaptioner
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Notebooks using the Hugging Face libraries 🤗
LAVIS - A One-stop Library for Language-Vision Intelligence
PyTorch code and models for the DINOv2 self-supervised learning method.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything