Skip to content
View peraktong's full-sized avatar

Block or report peraktong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

DeepEP: an efficient expert-parallel communication library

Cuda 7,206 642 Updated Mar 14, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 6,953 547 Updated Feb 24, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,756 2,200 Updated Feb 1, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Jupyter Notebook 7,697 496 Updated Mar 18, 2025

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,009 43 Updated Feb 23, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 10,367 1,140 Updated Mar 17, 2025

The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 1,781 112 Updated Mar 12, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 24,413 2,123 Updated Mar 17, 2025

Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

Python 1,390 169 Updated Feb 7, 2025

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,548 2,437 Updated Feb 10, 2025

数字人资料整理

748 86 Updated Jan 8, 2025

A large-scale benchmark and learning environment.

Python 1,319 264 Updated Jan 25, 2025

Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

Python 3,340 255 Updated Jan 21, 2025

Code for SIGGRAPH 2020 paper "RigNet: Neural Rigging for Articulated Characters"

Python 1,427 193 Updated Nov 4, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 2,761 225 Updated Dec 5, 2024

The Memory layer for AI Agents

Python 26,355 2,496 Updated Mar 18, 2025

Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.

Python 15,374 1,604 Updated Mar 18, 2025

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Python 234 11 Updated Apr 22, 2024

[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation

Python 5,702 482 Updated Sep 9, 2024
Python 438 46 Updated Jul 19, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 37,236 4,406 Updated Aug 19, 2024

Command-line program to download videos from YouTube.com and other video sites

Python 134,709 10,245 Updated Mar 11, 2025

A series of code large language models developed by PKU-KCL

Python 1,625 119 Updated Jul 18, 2024

[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Python 3,319 266 Updated Dec 27, 2024

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 51,855 5,693 Updated Mar 18, 2025

real time face swap and one-click video deepfake with only a single image

Python 44,678 6,595 Updated Mar 15, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,865 508 Updated Mar 18, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,524 1,586 Updated Feb 29, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,266 561 Updated Feb 26, 2025
Next