Skip to content
View aqibmumtaz's full-sized avatar
  • SlimLogix
  • Lahore, Pakistan

Organizations

@Slimlogix

Block or report aqibmumtaz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open source conversation framework and visual editor for structured Pipecat dialogues

Python 43 2 Updated Dec 29, 2024

Official inference framework for 1-bit LLMs

C++ 12,488 875 Updated Dec 20, 2024

Unified framework for building enterprise RAG pipelines with small, specialized models

Python 8,344 1,521 Updated Dec 20, 2024

Code for visualizing the loss landscape of neural nets

Python 2,872 406 Updated Apr 5, 2022

Open Source framework for voice and multimodal conversational AI

Python 4,037 404 Updated Dec 24, 2024
Python 725 84 Updated Jun 17, 2024

BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks

Python 576 67 Updated Oct 25, 2024

pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.

Python 6,932 404 Updated Jul 6, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,524 2,924 Updated Sep 2, 2024

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Python 550 64 Updated Oct 8, 2024

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python 8,505 2,408 Updated Dec 29, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,733 259 Updated Aug 9, 2024

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 2,871 365 Updated May 8, 2024

Efficient Multimodal Large Language Models: A Survey

296 13 Updated Aug 16, 2024

LLM-Assisted Real-Time Anomaly Detection for Safe Visual Navigation

Python 25 1 Updated Apr 25, 2024

Library for fast text representation and classification.

HTML 26,002 4,726 Updated Mar 22, 2024

Real time speech to text transcription app.

Python 392 74 Updated Jan 14, 2023

Real time transcription with OpenAI Whisper.

Python 2,466 415 Updated Jun 1, 2024

Official repository of the 1st place solution for the 7th NVIDIA AI City Challenge (2023) Track 1: Multi-Camera People Tracking

Python 73 16 Updated Jan 25, 2024

Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2

JavaScript 502 117 Updated Apr 28, 2023

Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.

Python 40 9 Updated Aug 15, 2021

Face recognition using Tensorflow

Python 13,878 4,816 Updated Jul 24, 2023

The world's simplest facial recognition api for Python and the command line

Python 53,811 13,523 Updated Aug 21, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,739 6,434 Updated Oct 18, 2024

⚡ Finetune Wa2vec 2.0 For Speech Recognition

Python 121 24 Updated Nov 7, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,889 27,412 Updated Dec 29, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,488 8,781 Updated Dec 1, 2024

This repository contains the code for the speech recognition in python

Jupyter Notebook 92 124 Updated Dec 12, 2023

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

Python 4,628 961 Updated Aug 2, 2024
Next