yamano1212

Follow

yamano1212

Follow

9 followers · 14 following

Achievements

Achievements

Stars

yutaojiang1 / Diffuse3D

Official pytorch implementation of "Diffuse3D: Wide-Angle 3D Photography via Bilateral Diffusion"

Python 24 Updated Nov 2, 2023

sony / genwarp

Python 256 20 Updated Sep 26, 2024

xingyi-li / 3d-cinemagraphy

[CVPR 2023] 3D Cinemagraphy from a Single Image

Python 274 14 Updated May 5, 2024

yerfor / Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Python 996 119 Updated Oct 18, 2024

satoshiiizuka / siggraphasia2019_remastering

Code for the paper "DeepRemaster: Temporal Source-Reference Attention Networks for Comprehensive Video Enhancement". http://iizuka.cs.tsukuba.ac.jp/projects/remastering/

Python 486 103 Updated Jan 6, 2022

microsoft / Bringing-Old-Photos-Back-to-Life

Bringing Old Photo Back to Life (CVPR 2020 oral)

Python 15,318 2,037 Updated Oct 26, 2023

yangdongchao / RSTnet

Real-time Speech-Text Foundation Model Toolkit (wip)

Python 128 12 Updated Oct 14, 2024

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,506 600 Updated Feb 9, 2025

Thytu / Agentarium

open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for designing complex, interactive environments where agents can act,…

Python 898 73 Updated Jan 30, 2025

gusye1234 / nano-graphrag

A simple, easy-to-hack GraphRAG implementation

Python 2,411 233 Updated Jan 15, 2025

humanlayer / humanlayer

HumanLayer enables AI agents to communicate with humans in tool-based and async workflows. Guarantee human oversight of high-stakes function calls with approval workflows across slack, email and mo…

Python 589 49 Updated Feb 6, 2025

KanjiVG / kanjivg

Kanji vector graphics

Python 1,109 188 Updated Feb 19, 2025

Lux-AI-Challenge / Lux-Design-S3

Repository for the Lux AI Challenge, season 3 @NeurIPS 24. Hosted on @kaggle

Python 302 63 Updated Feb 5, 2025

pashanitw / W2V2-BERT-ASR-Training

Python 13 1 Updated Mar 25, 2024

pashanitw / xeus-finetune

Python 10 1 Updated Aug 20, 2024

h1karu-s / pretraining_LayoutLMv3_PubLayNet

Jupyter Notebook 23 1 Updated Mar 7, 2023

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,040 536 Updated Dec 25, 2024

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 67,608 7,250 Updated Feb 18, 2025

kohya-ss / sd-scripts

Python 5,709 931 Updated Feb 18, 2025

aiola-lab / whisper-medusa

Whisper with Medusa heads

Python 821 50 Updated Feb 11, 2025

ostris / ai-toolkit

Various AI scripts. Mostly Stable Diffusion stuff.

Python 4,018 454 Updated Feb 18, 2025

XLabs-AI / x-flux

Python 1,877 134 Updated Nov 8, 2024

apple / ml-mdm

Train high-quality text-to-image diffusion models in a data & compute efficient manner

Python 475 36 Updated Feb 12, 2025

piddnad / DDColor

[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders

Jupyter Notebook 1,213 127 Updated Dec 31, 2024

kongzhecn / OMG

[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models

Python 684 45 Updated Jul 2, 2024

bghira / SimpleTuner

A general fine-tuning kit geared toward diffusion models.

Python 2,088 198 Updated Feb 10, 2025

instantX-research / InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,410 833 Updated Jul 18, 2024

kousw / experimental-consistory

Python 109 6 Updated Mar 3, 2024

tosiyuki / LLaVA-JP

LLaVA-JP is a Japanese VLM trained by LLaVA method

Python 59 13 Updated Jul 3, 2024

LLaVA-VL / LLaVA-NeXT

Python 3,416 311 Updated Feb 13, 2025