-
Boston University
- Boston
- cs-people.bu.edu/wdqin
Stars
[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"
Implementation for Machine-Generated Text Localization (ACL 2024 Findings)
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).
EMNLP 2021: Visual Goal-Step Inference using wikiHow
Vision-Language Navigation with Random Environmental Mixup
Code for Paper "Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic Information", NAACL 2021
Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
Faster RCNN model in Pytorch version, pretrained on the Visual Genome with ResNet 101
The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`
AI Research Platform for Reinforcement Learning from Real Panoramic Images.
An easy implementation of Faster R-CNN (https://arxiv.org/pdf/1506.01497.pdf) in PyTorch.