vztu

🦝

Feeding Raccoons

Zhengzhong Tu vztu

🦝

Feeding Raccoons

Assistant Professor of CS at TAMU

247 followers · 260 following

@texas A&M University @google @google-research @UTAustin
College Station, TX
https://vztu.github.io
@_vztu
in/zhengzhongtu

Achievements

Highlights

Starred repositories

taco-group / AutoTrust

AutoTrust, a groundbreaking benchmark designed to assess the trustworthiness of DriveVLMs. This work aims to enhance public safety by ensuring DriveVLMs operate reliably across critical dimensions.

Python 38 2 Updated Dec 24, 2024

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 3,882 315 Updated Feb 27, 2025

sNiper-Qian / pianomime

Python 42 1 Updated Oct 17, 2024

stefanos50 / CARLA2Real

CARLA2Real is a tool that enhances the photorealism of the CARLA simulator in real-time, leveraging the Enhancing Photorealism Enhancement model proposed by Intel Labs.

Python 15 2 Updated Feb 7, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA Decoding Kernel for Hopper GPUs

C++ 10,523 669 Updated Feb 27, 2025

taco-group / Re-Align

A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.

Python 30 Updated Feb 19, 2025

taco-group / STAMP

[ICLR'25] Official Implementation of STAMP: Scalable Task And Model-agnostic Collaborative Perception

Python 17 1 Updated Feb 4, 2025

taco-group / OpenEMMA

OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.

Python 515 67 Updated Feb 19, 2025

Q-Future / Visual-Question-Answering-for-Video-Quality-Assessment

Official released code for VQA² series models

Python 30 1 Updated Jan 31, 2025

rhymes-ai / Allegro

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,051 61 Updated Feb 7, 2025

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 8,804 716 Updated Feb 20, 2025

TrustGen / TrustEval-toolkit

TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs)

Python 76 5 Updated Feb 25, 2025

jkwang28 / OSDFace

132 5 Updated Dec 8, 2024

taco-group / Video-Quality-Assessment-A-Comprehensive-Survey

The Most Comprehensive Survey of Video Quality Assessment to Date.

64 1 Updated Dec 24, 2024

Junyi42 / monst3r

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 1,042 60 Updated Feb 14, 2025

Q-Future / Q-Align

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 368 24 Updated Aug 12, 2024

XPixelGroup / DiffBIR

[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Python 3,582 302 Updated Dec 12, 2024

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 889 49 Updated Feb 8, 2025

taco-group / MWFormer

[TIP2024] MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers

Python 45 3 Updated Dec 6, 2024

deepseek-ai / DeepSeek-V3

Python 89,686 14,455 Updated Feb 24, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,411 2,158 Updated Feb 1, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 21,660 1,919 Updated Feb 27, 2025

ChiWeiHsiao / ref-ldm

The official implementation of "ReF-LDM: A Latent Diffusion Model for Reference-based Face Image Restoration" [NeurIPS 2024]

Python 17 Updated Feb 25, 2025

JWFangit / LOTVS-DADA

Driver Attention Prediction in Accidental Scenarios

Python 96 14 Updated Dec 11, 2024

Cogito2012 / CarCrashDataset

[ACM MM 2020] CCD dataset for traffic accident anticipation.

103 10 Updated Sep 2, 2023

georghess / neurad-studio

[CVPR2024] NeuRAD: Neural Rendering for Autonomous Driving

Python 381 28 Updated Jan 22, 2025

cxm12 / UNiFMIR

Pretraining a foundation model for generalizable fluorescence microscopy-based image restoration

Python 47 3 Updated Apr 24, 2024

taco-group / HFMF

HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection

Jupyter Notebook 8 2 Updated Jan 6, 2025

taco-group / COVER

🏆 [CVPRW 2024] COVER: A Comprehensive Video Quality Evaluator. 🥇 Winner solution for Video Quality Assessment Challenge at the 1st AIS 2024 workshop @ CVPR 2024

Python 51 4 Updated Jul 18, 2024

facebookresearch / unibench

Python Library to evaluate VLM models' robustness across diverse benchmarks

Jupyter Notebook 191 14 Updated Feb 27, 2025

Zhengzhong Tu vztu

Highlights

Starred repositories

Awesome Lists