-
University of British Columbia
- Toronto, ON
- https://www.cs.ubc.ca/~rgoyal14/
Stars
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper
Most popular metrics used to evaluate object detection algorithms.
higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual training steps.
A deep learning library for video understanding research.
willprice / GulpIO2
Forked from achaiah/GulpIOBinary storage format for deep learning on videos.
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
temporal action detection: benchmark results, features download etc.
Enhance your application with the ability to see and interact with humans using any RGB camera.
Make drawing and labeling bounding boxes easy as cake
Learn the skills required to sysadmin a remote Linux server from the commandline.
Activity Recognition Algorithms for the Charades Dataset
[ECCV 2018] Official code for "Graph R-CNN for Scene Graph Generation"
A jupyter notebook serverextension providing config interfaces for nbextensions.
"Scene Graph Generation by Iterative Message Passing" code repository
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiase…
Implementation for the CVPR2019 paper "Graphical Contrastive Losses for Scene Graph Generation"
A video database bridging human actions and human-object relationships
Code for training temporal fully-connected CRF models in Torch
[CVPR'19] [PyTorch] Gated Spatio Temporal Energy Graph
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
Long-Term Feature Banks for Detailed Video Understanding