jarkkotulensalo

jarkkotulensalo

1 follower · 1 following

Achievements

Lists (1)

Sort

🔮 Future ideas

1 repository

Stars

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,219 192 Updated Jan 30, 2025

Stability-AI / stable-point-aware-3d

SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images

Python 630 66 Updated Jan 22, 2025

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 25,171 1,904 Updated Jan 27, 2025

zsyOAOA / InvSR

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Python 866 56 Updated Jan 8, 2025

Stability-AI / sd3.5

Python 938 67 Updated Jan 8, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,628 446 Updated Jan 27, 2025

FilippoO2 / Quantized-Training-of-SD3

Quantized training of Stable Diffusion 3 Medium to significantly reduce memory usage.

Python 12 2 Updated Jul 10, 2024

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,486 984 Updated Jan 22, 2025

sayakpaul / diffusers-torchao

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

Python 314 9 Updated Jan 1, 2025

rensortino / ColorizeNet

Forked from lllyasviel/ControlNet

Let us control diffusion models for colorization!

Python 36 7 Updated Sep 6, 2023

ManuelFay / ImageSearcher

This repository aims to implement an Image Search engine powered by the CLIP model.

Python 40 2 Updated Jul 15, 2022

VAST-AI-Research / TripoSR

Python 4,839 562 Updated Aug 16, 2024

Vaibhavs10 / optimise-my-whisper

Jupyter Notebook 196 17 Updated May 27, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 30,693 3,063 Updated Jan 7, 2025

metavoiceio / metavoice-src

Foundational model for human-like, expressive TTS

Python 4,005 674 Updated Jul 30, 2024

andrewnguonly / Lumos

A RAG LLM co-pilot for browsing the web, powered by local LLMs

TypeScript 1,467 107 Updated Jan 26, 2025

threestudio-project / threestudio

A unified framework for 3D content generation.

Jupyter Notebook 6,518 503 Updated Dec 16, 2024

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 45,118 4,818 Updated Jan 22, 2025

ExponentialML / Text-To-Video-Finetuning

Finetune ModelScope's Text To Video model using Diffusers 🧨

Python 678 107 Updated Dec 14, 2023

microsoft / generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 69,834 36,052 Updated Jan 24, 2025

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 10,931 884 Updated Jul 31, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,269 2,341 Updated Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jarkkotulensalo

Achievements

Achievements

Block or report jarkkotulensalo

Lists (1)

🔮 Future ideas

Stars

NVlabs / Sana

Stability-AI / stable-point-aware-3d

opendatalab / MinerU

zsyOAOA / InvSR

Stability-AI / sd3.5

microsoft / OmniParser

FilippoO2 / Quantized-Training-of-SD3

THUDM / CogVideo

sayakpaul / diffusers-torchao

rensortino / ColorizeNet

ManuelFay / ImageSearcher

VAST-AI-Research / TripoSR

Vaibhavs10 / optimise-my-whisper

myshell-ai / OpenVoice

metavoiceio / metavoice-src

andrewnguonly / Lumos

threestudio-project / threestudio

mlabonne / llm-course

ExponentialML / Text-To-Video-Finetuning

microsoft / generative-ai-for-beginners

guoyww / AnimateDiff

haotian-liu / LLaVA