Skip to content
View naymaraq's full-sized avatar
  • David Karamyan
  • Yerevan

Block or report naymaraq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
30 stars written in Python
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,155 8,734 Updated Dec 1, 2024

The official Meta Llama 3 GitHub site

Python 27,569 3,143 Updated Aug 12, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,489 2,573 Updated Dec 21, 2024

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Python 5,284 324 Updated Dec 20, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,655 303 Updated Oct 28, 2024

Reference implementations of MLPerf™ training benchmarks

Python 1,628 563 Updated Oct 17, 2024

Complete YOLO v3 TensorFlow implementation. Support training on your own dataset.

Python 1,552 579 Updated Sep 16, 2022

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Python 1,458 153 Updated Dec 2, 2024

Tools for handling speech data in machine learning projects.

Python 963 221 Updated Dec 19, 2024

Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".

Python 849 55 Updated Oct 28, 2024

LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…

Python 787 57 Updated Dec 3, 2024

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Python 517 73 Updated Sep 25, 2024

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 484 29 Updated Oct 25, 2024

A fast and lightweight python-based CTC beam search decoder for speech recognition.

Python 434 90 Updated Jul 13, 2023

Efficient LLM Inference over Long Sequences

Python 322 14 Updated Dec 6, 2024

NeMo text processing for ASR and TTS

Python 288 91 Updated Dec 11, 2024

LLM KV cache compression made easy

Python 273 14 Updated Dec 20, 2024

[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition

Python 214 32 Updated Jun 22, 2023

A toolkit for processing speech data and creating speech datasets

Python 98 21 Updated Dec 21, 2024

Python package for combining diarization system outputs.

Python 80 13 Updated Oct 12, 2023

Official repository of NeXt-TDNN for speaker verification

Python 63 6 Updated Oct 10, 2024
Python 51 3 Updated Feb 8, 2024

Tensorflow QANet with ELMo

Python 15 3 Updated Mar 13, 2019

SLT 2024 Challenge: Post-ASR-Speaker-Tagging

Python 14 1 Updated Jun 16, 2024

Supervised/Unsupervised Alignment of Clear/Anonymized X-Vector with Procrustes/Wasserstein Procrustes

Python 7 Updated Jan 19, 2023

Tokenizer for Armenian Language

Python 6 Updated Apr 25, 2020

[Yerevan 24] Authorship Style Transfer with Policy Optimization

Python 2 2 Updated Jul 4, 2024