Skip to content
View WeberJulian's full-sized avatar

Highlights

  • Pro

Block or report WeberJulian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,425 71 Updated Dec 16, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 34,618 2,625 Updated Dec 16, 2024

The Mojo Programming Language

Mojo 23,429 2,596 Updated Dec 14, 2024

Grok open release

Python 49,726 8,342 Updated Aug 30, 2024
JavaScript 172 19 Updated Dec 1, 2023

Faster Whisper transcription with CTranslate2

Python 12,984 1,085 Updated Dec 12, 2024

LLM inference in C/C++

C++ 69,317 9,979 Updated Dec 16, 2024

French instruction-following and chat models

Jupyter Notebook 501 47 Updated Dec 5, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 72,911 8,697 Updated Dec 1, 2024

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Python 115 15 Updated Jul 14, 2022

A walkthrough of transformer architecture code

Jupyter Notebook 318 39 Updated Feb 20, 2024

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python 557 158 Updated Aug 19, 2023

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,903 193 Updated Dec 9, 2024

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

C++ 2,304 278 Updated Mar 11, 2024

A tokenizer, text cleaner, and phonemizer for many human languages.

Python 288 37 Updated Nov 15, 2024

Text-to-Speech in JavaScript using eSpeak

C++ 1,297 293 Updated Jan 30, 2020

[Does not work anymore!] Script to enable systemd support on current Ubuntu WSL2 images

Shell 1,566 385 Updated Sep 17, 2023

🐸 - A general purpose model trainer, as flexible as it gets

Python 200 119 Updated Mar 7, 2024

Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2

112 21 Updated May 20, 2019

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Jupyter Notebook 920 81 Updated Nov 4, 2024

An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.

Python 81 17 Updated May 20, 2023

Simple but maybe too simple config management through python data classes. We use it for machine learning.

Python 98 34 Updated Apr 12, 2023

This repository contains the source code for the paper First Order Motion Model for Image Animation

Jupyter Notebook 14,602 3,228 Updated Nov 14, 2024

TensorFlow port of first-order motion model. TF Lite and TF.js compatible, supports the original's checkpoints and implements in-graph kp processing, but inference only (no training).

Python 34 9 Updated Jun 22, 2021

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Python 831 157 Updated Oct 10, 2023

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,126 4,429 Updated Aug 16, 2024

A vocoder that can convert audio to Mel-Spectrogram and reverse with WaveGlow, with GPU.

Python 16 3 Updated Nov 30, 2020

Code for Neural Architecture Search without Training (ICML 2021)

Python 463 63 Updated Aug 6, 2021

Notepad++ official repository

C++ 23,315 4,650 Updated Dec 14, 2024
Next