dmitrymailk

Follow

🎈

Focusing

dmitrymailk

🎈

Focusing

Follow

https://vk.com/dimweb

19 followers · 30 following

https://t.me/dim_web

Achievements

Achievements

Highlights

Pro

Starred repositories

thewh1teagle / kokoro-onnx

TTS with kokoro and onnx runtime

Python 1,384 118 Updated Jan 29, 2025

ray-project / llmperf

LLMPerf is a library for validating and benchmarking LLMs

Python 713 121 Updated Dec 9, 2024

tdrussell / diffusion-pipe

A pipeline parallel training script for diffusion models.

Python 477 44 Updated Jan 26, 2025

TencentARC / BrushEdit

The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"

Python 497 25 Updated Dec 26, 2024

deepbeepmeep / Cosmos1GP

Forked from NVIDIA/Cosmos

Cosmos1GP for the GPU Poor by DeepBeepMeep

Python 38 1 Updated Jan 21, 2025

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,633 1,483 Updated Jan 27, 2025

SonyResearch / micro_diffusion

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,210 47 Updated Jan 12, 2025

open-mmlab / Live2Diff

Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.

Python 180 15 Updated Jul 22, 2024

NJU-PCALab / STAR

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Python 823 44 Updated Jan 22, 2025

ChenDarYen / NitroFusion

NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training

266 19 Updated Jan 6, 2025

chengzeyi / Comfy-WaveSpeed

[WIP] The all in one inference optimization solution for ComfyUI, universal, flexible, and fast.

Python 682 22 Updated Feb 2, 2025

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,349 455 Updated Jan 28, 2025

vidstack / player

UI components and hooks for building video/audio players on the web. Robust, customizable, and accessible. Modern alternative to JW Player and Video.js.

TypeScript 2,579 142 Updated Feb 2, 2025

Nuked88 / DreamingAI

Repo of the YT Channel

Python 27 4 Updated Feb 14, 2024

lucasgelfond / webgpu-sam2

Segment Anything 2, 100% in the browser (with WebGPU!)

TypeScript 99 5 Updated Dec 18, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 72,747 10,482 Updated Feb 2, 2025

pytransitions / transitions

A lightweight, object-oriented finite state machine implementation in Python with many extensions

Python 5,903 533 Updated Aug 23, 2024

pytransitions / transitions-gui

A frontend for transitions state machines

Python 69 7 Updated Sep 1, 2024

dottxt-ai / outlines

Structured Text Generation

Python 10,546 556 Updated Jan 31, 2025

nmandic78 / AI-VoiceAssistant

A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local LLM via llama.cpp or OpenAI API. Includes clipboard integrat…

Python 9 1 Updated Dec 2, 2024

huggingface / audio-transformers-course

The Hugging Face Course on Transformers for Audio

MDX 362 105 Updated Jan 23, 2025

dmitrymailk / auto_remaster

Python 5 Updated Jan 22, 2025

logtd / ComfyUI-LTXTricks

A set of ComfyUI nodes providing additional control for the LTX Video model

Python 433 19 Updated Dec 21, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 39,700 4,455 Updated Jan 18, 2025

cumulo-autumn / StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,904 726 Updated Dec 4, 2024

brunodev85 / winlator

Android application for running Windows applications with Wine and Box86/Box64

C 10,711 566 Updated Jan 6, 2025

orzhan / robotic-arm-gpt

Python 2 1 Updated Dec 19, 2024

kijai / ComfyUI-KJNodes

Various custom nodes for ComfyUI

Python 814 88 Updated Feb 1, 2025

dwojtasik / PyHook

Python hook for ReShade processing

C++ 36 4 Updated Mar 25, 2023

Vchitect / VEnhancer

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

Python 492 27 Updated Sep 16, 2024

Starred topics

vue

Unity

Terminal

SQL

Python

PWA

OpenGL

Natural language processing

Machine learning

Linux

See all starred topics