-
ex-GREE, ex-Softbank
- tokyo
-
14:55
(UTC +09:00) - kimaris.vercel.app
- https://orcid.org/0009-0001-9554-0098
Stars
Build ultra fast, tiny, and cross-platform desktop apps with Typescript.
A Multipurpose toolkit for managing, editing and creating models.
[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
[SIGGRAPH Asia & TOG 2024] This is the official implementation of our SIGGRAPH Asia journal artical: TEXGen: a Generative Diffusion Model for Mesh Textures
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
[CVPR 2024] EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Out-of-the-box (OOTB) GUI Agent for Windows and macOS
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
An Extension for Forge Webui that implements IC-Light
Auto detecting, masking and inpainting with detection model.
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch
From anything to mesh like human artists. Official impl. of "MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization"
The code releasing for https://image-dream.github.io/
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)
[NeurIPS 2023] Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
A Scalable Pipeline for Making Steerable Multi-Task Mid-Level Vision Datasets from 3D Scans [ICCV 2021]
Animate124: Animating One Image to 4D Dynamic Scene