Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 2,871 365 Updated May 8, 2024

swordlidev / Efficient-Multimodal-LLMs-Survey

Efficient Multimodal Large Language Models: A Survey

296 13 Updated Aug 16, 2024

AIS-Clemson / VisionGPT

LLM-Assisted Real-Time Anomaly Detection for Safe Visual Navigation

Python 25 1 Updated Apr 25, 2024

facebookresearch / fastText

Library for fast text representation and classification.

HTML 26,002 4,726 Updated Mar 22, 2024

davabase / transcriber_app

Real time speech to text transcription app.

Python 392 74 Updated Jan 14, 2023

davabase / whisper_real_time

Real time transcription with OpenAI Whisper.

Python 2,466 415 Updated Jun 1, 2024

ipl-uw / AIC23_Track1_UWIPL_ETRI

Official repository of the 1st place solution for the 7th NVIDIA AI City Challenge (2023) Track 1: Multi-Camera People Tracking

Python 73 16 Updated Jan 25, 2024

MycroftAI / mimic-recording-studio

Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2

JavaScript 502 117 Updated Apr 28, 2023

padmalcom / ttsdatasetcreator

Python 23 15 Updated Jan 5, 2023

daanzu / speech-training-recorder

Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.

Python 40 9 Updated Aug 15, 2021

davidsandberg / facenet

Face recognition using Tensorflow

Python 13,878 4,816 Updated Jul 24, 2023

ageitgey / face_recognition

The world's simplest facial recognition api for Python and the command line

Python 53,811 13,523 Updated Aug 21, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,739 6,434 Updated Oct 18, 2024

khanld / ASR-Wav2vec-Finetune

⚡ Finetune Wa2vec 2.0 For Speech Recognition

Python 121 24 Updated Nov 7, 2023

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,889 27,412 Updated Dec 29, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,488 8,781 Updated Dec 1, 2024

aravindpai / Speech-Recognition

This repository contains the code for the speech recognition in python

Jupyter Notebook 92 124 Updated Dec 12, 2023

timesler / facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

Python 4,628 961 Updated Aug 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aqib Mumtaz aqibmumtaz

Achievements

Achievements

Organizations

Block or report aqibmumtaz

Stars

pipecat-ai / pipecat-flows

microsoft / BitNet

llmware-ai / llmware

tomgoldstein / loss-landscape

pipecat-ai / pipecat

PharMolix / OpenBioMed

taokz / BiomedGPT

bndr / pipreqs

Vision-CAIR / MiniGPT-4

OpenGVLab / VideoMAEv2

Uberi / speech_recognition

facebookresearch / jepa

facebookresearch / ijepa