Skip to content
View ProvenceStar's full-sized avatar
  • Huazhong University of Science and Technology
  • Wuhan, China

Block or report ProvenceStar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A generative world for general-purpose robotics & embodied AI learning.

Python 21,350 1,678 Updated Jan 3, 2025

Liquid: Language Models are Scalable Multi-modal Generators

55 Updated Dec 12, 2024

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Python 254 6 Updated Sep 11, 2024

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Python 22 Updated Dec 9, 2024

(NeurIPS 2024) Learning to Visual Question Answering, Asking and Assessment

Python 79 2 Updated Nov 7, 2024

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 641 16 Updated Dec 30, 2024

(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator

Python 114 Updated Oct 17, 2024

Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"

Python 203 8 Updated Dec 28, 2024

A taxonomy of industrial anomaly detection methods and datasets (updating).

73 6 Updated Oct 31, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,309 725 Updated Aug 5, 2024

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,239 179 Updated Dec 31, 2024

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,096 219 Updated Dec 3, 2024

Train transformer language models with reinforcement learning.

Python 10,496 1,355 Updated Dec 29, 2024

LLaMA 2 implemented from scratch in PyTorch

Python 273 51 Updated Sep 25, 2023

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 4,259 368 Updated Dec 22, 2024

[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Python 316 22 Updated Jul 17, 2024

[Arxiv] Towards Generic Anomaly Detection and Understanding: Large-scale Visual-linguistic Model (GPT-4V) Takes the Lead.

117 6 Updated Nov 24, 2023

[CVPR 2023] Unofficial re-implementation of "WinCLIP: Zero-/Few-Shot Anomaly Classification and Segmentation".

Python 283 25 Updated Mar 16, 2024

[IEEE TII 2023] Collaborative Discrepancy Optimization for Reliable Image Anomaly Localization

Python 66 7 Updated May 13, 2023

Official implementation of "Segment Any Anomaly without Training via Hybrid Prompt Regularization (SAA+)".

Jupyter Notebook 761 76 Updated Dec 20, 2023

[ECCV2024] The Official Implementation for ''AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection''

Python 171 8 Updated Dec 26, 2024
Python 22 1 Updated Sep 27, 2024

Refine high-quality datasets and visual AI models

Python 9,029 584 Updated Jan 3, 2025

[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)

Python 890 50 Updated Dec 2, 2024

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 595 63 Updated Aug 30, 2024

Using SparseInst as a Detector for Video Instance Segmentation

Python 3 Updated Sep 10, 2024

ICCV'2023 | CTVIS: Consistent Training for Online Video Instance Segmentation

Python 71 4 Updated Oct 15, 2023

DROID Policy Learning and Evaluation

Python 152 12 Updated Dec 21, 2024

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 443 63 Updated Dec 23, 2024
Next