-
Amazon Alexa AI.
- San Jose
-
PaddleSeg Public
Forked from PaddlePaddle/PaddleSegEasy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …
Python Apache License 2.0 UpdatedMar 17, 2023 -
img2dataset Public
Forked from rom1504/img2datasetEasily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Python MIT License UpdatedMar 8, 2023 -
Dreambooth-Stable-Diffusion Public
Forked from XavierXiao/Dreambooth-Stable-DiffusionImplementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Jupyter Notebook MIT License UpdatedNov 17, 2022 -
stable-diffusion-1 Public
Forked from justinpinkney/stable-diffusionJupyter Notebook MIT License UpdatedOct 26, 2022 -
stable-diffusion Public
Forked from CompVis/stable-diffusionJupyter Notebook Other UpdatedSep 11, 2022 -
unilm Public
Forked from microsoft/unilmLarge-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Python MIT License UpdatedSep 4, 2022 -
botocore Public
Forked from boto/botocoreThe low-level, core functionality of boto 3.
Python Apache License 2.0 UpdatedJul 11, 2022 -
all-in-one Public
Forked from showlab/all-in-one[Arxiv2022] All in One: Exploring Unified Video-Language Pre-training
Python UpdatedJun 7, 2022 -
CLIP4Clip Public
Forked from ArrowLuo/CLIP4ClipAn official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Python MIT License UpdatedJun 1, 2022 -
ViTCAP Public
Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".
-
pytorchvideo Public
Forked from facebookresearch/pytorchvideoA deep learning library for video understanding research.
Python Apache License 2.0 UpdatedMay 20, 2022 -
SwinBERT Public
Forked from microsoft/SwinBERTResearch code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
Python MIT License UpdatedMay 11, 2022 -
-
ASU-Thesis-Format Public
Forked from Mahdisadjadi/ASU-Thesis-FormatASU Thesis Format
TeX UpdatedMar 20, 2022 -
denseflow Public
Forked from open-mmlab/denseflowExtracting optical flow and frames
C++ MIT License UpdatedMar 1, 2022 -
-
video-swin-transformer-pytorch Public
Forked from haofanwang/video-swin-transformer-pytorchVideo Swin Transformer - PyTorch
Python MIT License UpdatedJan 4, 2022 -
-
WebQA Public
Forked from WebQnA/WebQAShell Creative Commons Zero v1.0 Universal UpdatedOct 15, 2021 -
color-aware-style-transfer Public
Forked from mahmoudnafifi/color-aware-style-transferReference code for the paper CAMS: Color-Aware Multi-Style Transfer.
Jupyter Notebook UpdatedAug 27, 2021 -
SparseR-CNN Public
Forked from PeizeSun/SparseR-CNNEnd-to-End Object Detection with Learnable Proposal, CVPR2021
-
Video2Commonsense Public
Video captioning baseline models on Video2Commonsense Dataset.
-
-
LocalizingMoments Public
Forked from LisaAnne/LocalizingMomentsGithub for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"
OpenEdge ABL UpdatedOct 31, 2020 -
markdown-content Public
Forked from aerobatic/markdown-contentMarkdown content for the www.aerobatic.io website
UpdatedOct 2, 2020 -
info-ground Public
Forked from BigRedT/info-groundLearning phrase grounding from captioned images through InfoNCE bound on mutual information
Python Other UpdatedAug 22, 2020 -
maskrcnn-benchmark Public
Forked from amsword/maskrcnn-benchmarkFast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
-
IMRAM Public
Forked from HuiChen24/IMRAMcode for our CVPR2020 paper "IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval"
Python UpdatedJul 25, 2020 -
-