Skip to content
View uyzhang's full-sized avatar
🌏
🌏
  • Haidian district, Beijing
  • 09:18 (UTC +08:00)

Highlights

  • Pro

Block or report uyzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

O1 Replication Journey: A Strategic Progress Report – Part I

1,689 51 Updated Nov 30, 2024

The related works and background techniques about Openai o1

170 6 Updated Nov 9, 2024

Jittor implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 15 3 Updated Oct 19, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

782 20 Updated Jul 31, 2024

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

Python 1,525 212 Updated Dec 17, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,116 837 Updated Dec 16, 2024

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 326 14 Updated Nov 17, 2024

A curated list of papers in Test-time Adaptation, Test-time Training and Source-free Domain Adaptation

468 45 Updated Jun 23, 2024

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,063 64 Updated Jul 14, 2024

[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization

JavaScript 606 49 Updated Sep 10, 2024

A Monorepo for My ( 🌏 Personal Website / 🔬 Academic Profile ) and { Other Related Projects }

TypeScript 1,012 27 Updated Dec 16, 2024

风云天气是Android 平台开源天气 App,采用Kotlin、Room、OKHttp3、 协程等框架实现。

Kotlin 2,060 257 Updated Sep 14, 2023

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,800 117 Updated Oct 30, 2024

Refine high-quality datasets and visual AI models

Python 8,990 575 Updated Dec 18, 2024

[ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.

Python 62 1 Updated Jul 27, 2024

PixelLM is an effective and efficient LMM for pixel-level reasoning and understanding. PixelLM is accepted by CVPR 2024.

Python 190 5 Updated Jun 3, 2024

The official Meta Llama 3 GitHub site

Python 27,531 3,138 Updated Aug 12, 2024

LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning

Python 104 9 Updated Apr 16, 2024

Official repository for "SODA: Bottleneck Diffusion Models for Representation Learning"

23 Updated Mar 21, 2024

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,337 50 Updated Dec 11, 2024

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Jupyter Notebook 1,406 79 Updated Jun 28, 2024

JDiffusion is a diffusion model library for generating images or videos based on Diffusers and Jittor.

Python 243 4 Updated Jul 16, 2024

Recent weakly supervised semantic segmentation paper

281 22 Updated Oct 9, 2024

Pytorch❤️ Keras 😋😋

Jupyter Notebook 1,823 239 Updated Oct 28, 2024

The official gpt4free repository | various collection of powerful language models

Python 62,737 13,439 Updated Dec 17, 2024

🎨 Python Echarts Plotting Library

Python 14,977 2,852 Updated Nov 6, 2024

Official Implementation for CVPR 2024 paper: CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor

101 3 Updated Jun 23, 2024

The Fast Cross-Platform Package Manager

C++ 7,024 364 Updated Dec 12, 2024

TikTok 主页/合辑/直播/视频/图集/原声;抖音主页/视频/图集/实况/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具

Python 8,249 1,319 Updated Dec 16, 2024
Next