Skip to content
View vztu's full-sized avatar
🦝
Feeding Raccoons
🦝
Feeding Raccoons

Highlights

  • Pro

Block or report vztu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

AutoTrust, a groundbreaking benchmark designed to assess the trustworthiness of DriveVLMs. This work aims to enhance public safety by ensuring DriveVLMs operate reliably across critical dimensions.

Python 38 2 Updated Dec 24, 2024

Wan: Open and Advanced Large-Scale Video Generative Models

Python 3,882 315 Updated Feb 27, 2025
Python 42 1 Updated Oct 17, 2024

CARLA2Real is a tool that enhances the photorealism of the CARLA simulator in real-time, leveraging the Enhancing Photorealism Enhancement model proposed by Intel Labs.

Python 15 2 Updated Feb 7, 2025

FlashMLA: Efficient MLA Decoding Kernel for Hopper GPUs

C++ 10,523 669 Updated Feb 27, 2025

A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.

Python 30 Updated Feb 19, 2025

[ICLR'25] Official Implementation of STAMP: Scalable Task And Model-agnostic Collaborative Perception

Python 17 1 Updated Feb 4, 2025

OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.

Python 515 67 Updated Feb 19, 2025

Official released code for VQA² series models

Python 30 1 Updated Jan 31, 2025

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,051 61 Updated Feb 7, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 8,804 716 Updated Feb 20, 2025

TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs)

Python 76 5 Updated Feb 25, 2025

The Most Comprehensive Survey of Video Quality Assessment to Date.

64 1 Updated Dec 24, 2024

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 1,042 60 Updated Feb 14, 2025

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 368 24 Updated Aug 12, 2024

[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Python 3,582 302 Updated Dec 12, 2024

A fork to add multimodal model training to open-r1

Python 889 49 Updated Feb 8, 2025

[TIP2024] MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers

Python 45 3 Updated Dec 6, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,411 2,158 Updated Feb 1, 2025

Fully open reproduction of DeepSeek-R1

Python 21,660 1,919 Updated Feb 27, 2025

The official implementation of "ReF-LDM: A Latent Diffusion Model for Reference-based Face Image Restoration" [NeurIPS 2024]

Python 17 Updated Feb 25, 2025

Driver Attention Prediction in Accidental Scenarios

Python 96 14 Updated Dec 11, 2024

[ACM MM 2020] CCD dataset for traffic accident anticipation.

103 10 Updated Sep 2, 2023

[CVPR2024] NeuRAD: Neural Rendering for Autonomous Driving

Python 381 28 Updated Jan 22, 2025

Pretraining a foundation model for generalizable fluorescence microscopy-based image restoration

Python 47 3 Updated Apr 24, 2024

HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection

Jupyter Notebook 8 2 Updated Jan 6, 2025

🏆 [CVPRW 2024] COVER: A Comprehensive Video Quality Evaluator. 🥇 Winner solution for Video Quality Assessment Challenge at the 1st AIS 2024 workshop @ CVPR 2024

Python 51 4 Updated Jul 18, 2024

Python Library to evaluate VLM models' robustness across diverse benchmarks

Jupyter Notebook 191 14 Updated Feb 27, 2025
Next
Showing results