-
IDIAP
- Switzerland
- https://juanpzuluaga.github.io/
- @PabloGomez3
Highlights
- Pro
Stars
Prepend universal audio attack segment to mute Whisper
This repository contains the SpeechBrain Benchmarks
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Reference code for the paper The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Repository for Accent Recognition (Hackathon @SLT2022)
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
text window manager, shell multiplexer, integrated DevOps environment
Acceptance rates for the major AI conferences
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWSLT2022.
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
BUT-FIT at SemEval-2019 Task 7: Determining the Rumour Stance with Pre-Trained Deep Bidirectional Transformers
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Tools for handling speech data in machine learning projects.