-
VideoLLaMA3 Public
Forked from DAMO-NLP-SG/VideoLLaMA3Frontier Multimodal Foundation Models for Image and Video Understanding
Jupyter Notebook Apache License 2.0 UpdatedJan 26, 2025 -
VideoLLaMA2 Public
Forked from DAMO-NLP-SG/VideoLLaMA2VideoLLaMA 2: Improving Video-LLMs with Convolutional Spatial-Temporal Aggregation and Stronger Audio Capability
-
Inf-CLIP Public
Forked from DAMO-NLP-SG/Inf-CLIP💣💣 The official codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training scheme.
Python Apache License 2.0 UpdatedJan 16, 2025 -
OpenVIS Public
Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.
-
-
-
CMM Public
Forked from DAMO-NLP-SG/CMM✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Python UpdatedOct 17, 2024 -
img2dataset Public
Forked from rom1504/img2datasetEasily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Python MIT License UpdatedAug 24, 2024 -
llm_interview_note Public
Forked from wdndev/llm_interview_note主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedAug 16, 2024 -
Open-Sora-Plan Public
Forked from PKU-YuanGroup/Open-Sora-PlanThis project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
-
TrackEval Public
Forked from JonathonLuiten/TrackEvalHOTA (and other) evaluation metrics for Multi-Object Tracking (MOT).
-
-
[cvpr2023] implementation of out-of-candidate rectification methods
-
-
Tissue-Image-Segmentation Public
Re-implementation of popular tissue image segmentation models
-
Computer-Vision-Course Public
2021 Fall Computer Vision (Jian Zhang)
-
mmsegmentation Public
Forked from open-mmlab/mmsegmentationOpenMMLab Semantic Segmentation Toolbox and Benchmark.
-
mmcv Public
Forked from open-mmlab/mmcvOpenMMLab Computer Vision Foundation
Python Apache License 2.0 UpdatedJul 22, 2021 -
graduate_project Public
This is my undergraduate period graduate project.
Python UpdatedJun 1, 2021 -
-
-
-
-
yolov3_wider_darknet Public
I have trained on WIDER FACE dataset by using darknet to get .weights file.Then I convert it to ckpt or pb file to use it.