-
MossVLN Public
Forked from OpenMICG/MossVLNObservation Driven Memory Synergistic Planning for Continuous Vision-Language Navigation
Python Apache License 2.0 UpdatedJun 14, 2024 -
CoCoMeD Public
Forked from OpenMICG/CoCoMeDConsistency Conditioned Memory Augmented Dynamic Diagnosis Model for Medical Visual Question Answering
-
awesome-chatgpt-prompts-zh Public
Forked from PlexPt/awesome-chatgpt-prompts-zhChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
-
multimodal Public
Forked from facebookresearch/multimodalTorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 15, 2022 -
awesome-vision-language-pretraining-papers Public
Forked from yuewang-cuhk/awesome-vision-language-pretraining-papersRecent Advances in Vision and Language PreTrained Models (VL-PTMs)
UpdatedApr 7, 2022 -
-
ALPRO Public
Forked from salesforce/ALPROAlign and Prompt: Video-and-Language Pre-training with Entity Prompts
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 12, 2022 -
TimeSformer Public
Forked from facebookresearch/TimeSformerThe official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
Python Other UpdatedJan 28, 2022 -
just-ask Public
Forked from antoyang/just-ask[ICCV 2021 Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Jupyter Notebook Apache License 2.0 UpdatedOct 6, 2021 -
ClipBERT Public
Forked from jayleicn/ClipBERT[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
-
activitynet-qa Public
Forked from MILVLG/activitynet-qaAn VideoQA dataset based on the videos from ActivityNet
Python MIT License UpdatedNov 22, 2020 -
hcrn-videoqa Public
Forked from thaolmk54/hcrn-videoqaImplementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
-
HME-VideoQA Public
Forked from fanchenyou/HME-VideoQAHeterogeneous Memory Enhanced Multimodal Attention Model for VideoQA
Python UpdatedSep 21, 2019 -
cvpr2019 Public
Forked from extreme-assistant/CVPR2024-Paper-Code-Interpretationcvpr2019 papers,极市团队整理
UpdatedMay 6, 2019 -
Gated-Spatio-Temporal-Energy-Graph Public
Forked from yaohungt/Gated-Spatio-Temporal-Energy-Graph[CVPR'19] [PyTorch] Gated Spatio Temporal Energy Graph
Python UpdatedApr 2, 2019 -
DenseVideoCaptioning Public
Forked from JaywongWang/DenseVideoCaptioningOfficial Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2018, with code, model and prediction results.
Python MIT License UpdatedMar 13, 2019 -
ns-vqa Public
Forked from kexinyi/ns-vqaNeural-symbolic visual question answering
Python UpdatedJan 13, 2019 -
awesome-question-answering Public
Forked from dapurv5/awesome-question-answeringResources, datasets, papers on Question Answering
UpdatedJan 8, 2019 -
video-caption.pytorch Public
Forked from xiadingZ/video-caption.pytorchpytorch implementation of video captioning
Python MIT License UpdatedMay 18, 2018 -
Layered-Memory-Network Public
Forked from bowong/Layered-Memory-NetworkA Layered Memory Network for MovieQA
Python UpdatedApr 27, 2018 -
cn-deep-learning Public
Forked from udacity/cn-deep-learninghttps://cn.udacity.com/course/deep-learning-nanodegree-foundation--nd101/
Jupyter Notebook MIT License UpdatedApr 4, 2018 -
IndRNN_Theano_Lasagne Public
Forked from Sunnydreamrain/IndRNN_Theano_LasagneThis code is to implement the IndRNN.
Python UpdatedApr 2, 2018 -
indrnn Public
Forked from batzner/indrnnTensorFlow implementation of Independently Recurrent Neural Networks
Python Apache License 2.0 UpdatedMar 31, 2018 -
deep-learning Public
Forked from udacity/deep-learningRepo for the Deep Learning Nanodegree Foundations program.
Jupyter Notebook MIT License UpdatedMar 29, 2018 -
-
film Public
Forked from ethanjperez/filmFiLM: Visual Reasoning with a General Conditioning Layer
Python Other UpdatedFeb 23, 2018 -
Dynamic-Memory-Networks-in-TensorFlow Public
Forked from barronalex/Dynamic-Memory-Networks-in-TensorFlowDynamic Memory Network implementation in TensorFlow
Python MIT License UpdatedFeb 16, 2018 -
SENet Public
Forked from hujie-frank/SENetSqueeze-and-Excitation Networks
Cuda Apache License 2.0 UpdatedFeb 15, 2018 -
Tensorflow-Tutorial Public
Forked from MorvanZhou/Tensorflow-TutorialTensorflow tutorial from basic to hard
Python MIT License UpdatedFeb 13, 2018 -
AIND-CV-FacialKeypoints Public
Forked from udacity/AIND-CV-FacialKeypointsAIND, computer vision capstone project. This repo contains starting code for an end-to-end facial keypoint recognition system that relies on a combination of computer vision and deep learning techn…
Jupyter Notebook MIT License UpdatedFeb 2, 2018