dmitrymailk

🎈

Focusing

dmitrymailk

🎈

Focusing

https://vk.com/dimweb

19 followers · 30 following

https://t.me/dim_web

Achievements

Highlights

Starred repositories

282 results for source starred repositories

Clear filter

thewh1teagle / kokoro-onnx

TTS with kokoro and onnx runtime

Python 1,387 119 Updated Feb 3, 2025

ray-project / llmperf

LLMPerf is a library for validating and benchmarking LLMs

Python 714 121 Updated Dec 9, 2024

tdrussell / diffusion-pipe

A pipeline parallel training script for diffusion models.

Python 477 43 Updated Jan 26, 2025

TencentARC / BrushEdit

The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"

Python 497 25 Updated Dec 26, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,636 1,484 Updated Jan 27, 2025

SonyResearch / micro_diffusion

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,211 47 Updated Jan 12, 2025

open-mmlab / Live2Diff

Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.

Python 180 15 Updated Jul 22, 2024

NJU-PCALab / STAR

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Python 824 44 Updated Jan 22, 2025

ChenDarYen / NitroFusion

NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training

266 19 Updated Jan 6, 2025

chengzeyi / Comfy-WaveSpeed

[WIP] The all in one inference optimization solution for ComfyUI, universal, flexible, and fast.

Python 682 22 Updated Feb 2, 2025

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,350 456 Updated Jan 28, 2025

vidstack / player

UI components and hooks for building video/audio players on the web. Robust, customizable, and accessible. Modern alternative to JW Player and Video.js.

TypeScript 2,582 142 Updated Feb 2, 2025

Nuked88 / DreamingAI

Repo of the YT Channel

Python 27 4 Updated Feb 14, 2024

lucasgelfond / webgpu-sam2

Segment Anything 2, 100% in the browser (with WebGPU!)

TypeScript 99 5 Updated Dec 18, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 72,758 10,485 Updated Feb 2, 2025

pytransitions / transitions

A lightweight, object-oriented finite state machine implementation in Python with many extensions

Python 5,903 533 Updated Aug 23, 2024

pytransitions / transitions-gui

A frontend for transitions state machines

Python 69 7 Updated Sep 1, 2024

dottxt-ai / outlines

Structured Text Generation

Python 10,548 556 Updated Jan 31, 2025

nmandic78 / AI-VoiceAssistant

A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local LLM via llama.cpp or OpenAI API. Includes clipboard integrat…

Python 9 1 Updated Dec 2, 2024