Skip to content
View jarkkotulensalo's full-sized avatar

Block or report jarkkotulensalo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,219 192 Updated Jan 30, 2025

SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images

Python 630 66 Updated Jan 22, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 25,171 1,904 Updated Jan 27, 2025

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Python 866 56 Updated Jan 8, 2025
Python 938 67 Updated Jan 8, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,628 446 Updated Jan 27, 2025

Quantized training of Stable Diffusion 3 Medium to significantly reduce memory usage.

Python 12 2 Updated Jul 10, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,486 984 Updated Jan 22, 2025

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

Python 314 9 Updated Jan 1, 2025

Let us control diffusion models for colorization!

Python 36 7 Updated Sep 6, 2023

This repository aims to implement an Image Search engine powered by the CLIP model.

Python 40 2 Updated Jul 15, 2022
Jupyter Notebook 196 17 Updated May 27, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 30,693 3,063 Updated Jan 7, 2025

Foundational model for human-like, expressive TTS

Python 4,005 674 Updated Jul 30, 2024

A RAG LLM co-pilot for browsing the web, powered by local LLMs

TypeScript 1,467 107 Updated Jan 26, 2025

A unified framework for 3D content generation.

Jupyter Notebook 6,518 503 Updated Dec 16, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 45,118 4,818 Updated Jan 22, 2025

Finetune ModelScope's Text To Video model using Diffusers 🧨

Python 678 107 Updated Dec 14, 2023

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 69,834 36,052 Updated Jan 24, 2025

Official implementation of AnimateDiff.

Python 10,931 884 Updated Jul 31, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,269 2,341 Updated Aug 12, 2024