-
Ph.D. Student in University of Science and Technology of China (USTC)
- Singapore
-
16:27
(UTC +08:00) - https://scholar.google.com/citations?user=qWOFgUcAAAAJ&hl=zh-CN
Highlights
- Pro
Stars
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行
Instruct-tune LLaMA on consumer hardware
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
LAVIS - A One-stop Library for Language-Vision Intelligence
Best Practices, code samples, and documentation for Computer Vision.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
Reference models and tools for Cloud TPUs.
Learn python3 in one picture.
A simplified implemention of Faster R-CNN that replicate performance from origin paper
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Early stopping for PyTorch
Replication of simple CV Projects including attention, classification, detection, keypoint detection, etc.
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiase…
Repository for ShowUI: One Vision-Language-Action Model for GUI Visual Agent
Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
This is the PyTorch implementation of VGG network trained on CIFAR10 dataset
我的强化学习笔记和学习材料:book: still updating ... ...
MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021
Open-vocabulary Object Segmentation with Diffusion Models