Skip to content
View zhezh's full-sized avatar
  • Microsoft Research Asia (Research Intern)

Block or report zhezh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.

Python 2,876 188 Updated Mar 6, 2025

An open-source framework for training large multimodal models.

Python 3,835 297 Updated Aug 31, 2024

Represent, send, store and search multimodal data

Python 3,023 234 Updated Feb 25, 2025

深度学习经典、新论文逐段精读

29,341 2,590 Updated Nov 17, 2024

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Jupyter Notebook 2,658 357 Updated Mar 2, 2025

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,557 96 Updated Feb 16, 2024

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 985 100 Updated Jul 29, 2024

全国各省市停贷通知汇总

HTML 20,726 2,173 Updated Jul 13, 2024

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,086 379 Updated Aug 13, 2024

Free Google Translator API 免费的Google翻译

Python 191 57 Updated Mar 11, 2023

🌐 Google 翻译 Mac 客户端

TypeScript 793 109 Updated Jul 26, 2021

A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).

657 48 Updated Jan 7, 2024

Repo for external large-scale work

Python 6,516 727 Updated Apr 27, 2024

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Python 1,546 151 Updated Mar 3, 2025

Azure HPC/AI VM Images

Shell 101 80 Updated Feb 28, 2025

A data augmentations library for audio, image, text, and video.

Python 4,990 303 Updated Feb 28, 2025

一个多模态内容理解算法框架,其中包含数据处理、预训练模型、常见模型以及模型加速等模块。

Python 309 54 Updated Oct 26, 2021

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

1,151 104 Updated Aug 19, 2022

Run upstream VS Code on a remote machine with access through a modern web browser from any device, anywhere.

TypeScript 5,230 462 Updated Mar 5, 2025

VS Code in the browser

TypeScript 70,104 5,794 Updated Mar 6, 2025

Bridging Vision and Language Model

Python 282 31 Updated Mar 27, 2023

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

Python 227 37 Updated Jul 4, 2022

Reading list for research topics in multimodal machine learning

6,300 872 Updated Aug 20, 2024

我终于能用谷歌搜中文了……

7,474 281 Updated Feb 29, 2024

A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]

Python 3,942 640 Updated May 6, 2024

CenterFusion: Center-based Radar and Camera Fusion for 3D Object Detection

Python 565 147 Updated Oct 25, 2022

Use the browser's online image format converter, no need to upload files, you can convert jpeg, jpg, png, gif, webp, svg, ico, bmp files to jpeg, png, webp animation, gif, base64,avif,mozjpeg. 使用浏览…

JavaScript 1,867 283 Updated Mar 21, 2023

Official implementation of "VoxelPose: Towards Multi-Camera 3D Human Pose Estimation in Wild Environment"

Python 508 94 Updated Jul 24, 2023

中国提供计算机视觉(CV)算法岗位的公司名单,欢迎大家提交issues进行补充

982 91 Updated Jul 1, 2024
Next