-
08:22
(UTC -12:00) - https://www.woaipdf.cn/
Lists (1)
Sort Name ascending (A-Z)
Stars
A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.
💮 amazing QRCode generator in Python (supporting animated gif) - Python amazing 二维码生成器(支持 gif 动态图片二维码)
Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies
A ComfyUI custom node designed for advanced image background removal and object segmentation, utilizing multiple models including RMBG-2.0, INSPYRENET, BEN, SAM, and GroundingDINO.
Unofficial implementation of BRIA RMBG Model for ComfyUI
🍞🎨 Full-featured photo image editor using canvas. It is really easy, and it comes with great filters.
Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.
FFmpeg auto generated unsafe bindings for C#/.NET and Core (Linux, MacOS and Mono).
The swiss army knife of lossless video/audio editing
Script for Aegisub to cut video and voice files | 在Aegisub中用字幕切割视频和音频文件
Auto-Editor: Efficient media analysis and rendering
A simple GUI to show shot boundary detection based on TransNet V2.
[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
TransNet V2: Shot Boundary Detection Neural Network
End-to-End Object Detection with Transformers
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)
Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
[EMNLP2024 Demo], [ICASSP 2025] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
[NeurIPS 2021] Moment-DETR code and QVHighlights dataset
Crawl a site to generate knowledge files to create your own custom GPT from a URL
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.