Skip to content
View JuanPZuluaga's full-sized avatar

Highlights

  • Pro

Block or report JuanPZuluaga

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Prepend universal audio attack segment to mute Whisper

Python 16 4 Updated Dec 5, 2024
Python 3 Updated Jun 19, 2024

This repository contains the SpeechBrain Benchmarks

Python 103 39 Updated Dec 9, 2024

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 19,459 1,366 Updated Dec 12, 2024

MLX: An array framework for Apple silicon

C++ 17,918 1,034 Updated Dec 20, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 40,148 4,258 Updated Jul 28, 2024

Reference code for the paper The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation.

Python 1 Updated Dec 7, 2023

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 12,937 1,384 Updated Dec 18, 2024

Repository for Accent Recognition (Hackathon @SLT2022)

Jupyter Notebook 24 9 Updated May 12, 2024

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

Python 37 4 Updated Feb 9, 2023

text window manager, shell multiplexer, integrated DevOps environment

Shell 1,258 123 Updated May 2, 2024

Acceptance rates for the major AI conferences

Jupyter Notebook 4,300 303 Updated Dec 10, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,484 2,572 Updated Dec 20, 2024

A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications

Python 52 5 Updated Mar 24, 2023

Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWSLT2022.

16 5 Updated Nov 30, 2022

A PyTorch-based Speech Toolkit

Python 9,082 1,414 Updated Dec 9, 2024

Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering

Python 18 1 Updated Oct 19, 2023

Tools and Modeling Code for the MASSIVE dataset

Python 542 57 Updated Nov 28, 2022
Python 35 6 Updated Jul 10, 2024
Jupyter Notebook 6 2 Updated Dec 5, 2022

BUT-FIT at SemEval-2019 Task 7: Determining the Rumour Stance with Pre-Trained Deep Bidirectional Transformers

Python 9 6 Updated Jun 26, 2023

NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations

Python 778 196 Updated May 19, 2024

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Python 655 98 Updated Nov 1, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 34,687 2,629 Updated Dec 20, 2024
Python 965 305 Updated Dec 19, 2024

Moved to https://github.com/k2-fsa/icefall

Python 144 42 Updated Oct 13, 2022

Tools for handling speech data in machine learning projects.

Python 963 221 Updated Dec 19, 2024

NYU Deep Learning Spring 2020

Jupyter Notebook 6,705 2,221 Updated Sep 10, 2024
Next