**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speech processing with prompting paradigm

Python 97 8 Updated Aug 25, 2023

ga642381 / SpeechGen

《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》

74 5 Updated Jun 9, 2023

dair-ai / Transformers-Recipe

🧠 A study guide to learn about Transformers

1,545 146 Updated Jun 3, 2023

mlfoundations / open_clip

An open source implementation of CLIP.

Python 10,631 1,003 Updated Dec 4, 2024

j3soon / arxiv-utils

Meaningful titles for tabs and PDF downloads! Also supports tab search.

JavaScript 293 19 Updated Oct 20, 2024

shengyp / doing_the_PhD

2,061 263 Updated Nov 22, 2024

nat / natbot

Drive a browser with GPT-3

Python 1,914 279 Updated Jun 9, 2024

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 36,476 3,735 Updated Dec 22, 2024

chorowski-lab / hCPC

Implementation of multi-level Contrastive Predictive Coding (CPC) methods

Python 19 3 Updated Jan 12, 2023

arenjansen / ZRTools

Zero-Resource Speech Discovery, Search, and Evaluation Tools

C 29 17 Updated Aug 6, 2015

boazbk / tcs

Book in preparation: introduction to theoretical computer science

TeX 919 185 Updated Mar 18, 2024

lumaku / ctc-segmentation

Segment an audio file and obtain utterance alignments. (Python package)

Python 324 29 Updated May 15, 2024

zhaoyanpeng / xcfg

X (weighted / probabilistic) Context-Free Grammars

Python 25 2 Updated Jan 30, 2024

SpeechColab / GigaSpeech

Large, modern dataset for speech recognition

Shell 653 62 Updated Feb 26, 2024

dpressel / mint

MinT: Minimal Transformer Library and Tutorials

Python 251 14 Updated Jul 26, 2022

bobwan1995 / cliora

Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling

Yuan Tseng roger-tseng

Lists (1)

⭐ To Read

Starred repositories

Natural language processing

Machine learning

Deep learning