Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrat…

Perl 1 Updated Nov 8, 2023

k-m-irfan / Fastspeech2_HS_Flask_API

Flask API implementation of the Text to Speech Model developed my Speech Lab, IIT Madras

Perl 2 Updated Nov 17, 2023

k-m-irfan / k-m-irfan

About.me

3 Updated Jan 10, 2024

k-m-irfan / Humanoid_Robot_Arm_Controller

An Intuitive Humanoid Robot Arm Controller for Teleoperation.

Python 1 Updated Nov 1, 2024

Amshaker / GroupMamba

Official implementation of paper titled "GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model"

Python 65 4 Updated Jan 23, 2025

mbzuai-oryx / LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 827 62 Updated Jul 10, 2024

mominabbass / LinC

Code for "Enhancing In-context Learning via Linear Probe Calibration"

Python 35 Updated Apr 24, 2024

Amshaker / MAVOS

[WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory

Python 52 2 Updated Jan 23, 2025

mbzuai-oryx / MobiLlama

MobiLlama : Small Language Model tailored for edge devices

Python 619 48 Updated Mar 3, 2024

mbzuai-oryx / PALO

(WACV 2025) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, Bengali and Urdu.

Python 81 5 Updated Sep 10, 2024

mbzuai-oryx / BiMediX

Bilingual Medical Mixture of Experts LLM

28 1 Updated Nov 23, 2024

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 8,477 939 Updated Feb 5, 2025

wuhy68 / Parameter-Efficient-MoE

Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks

Python 139 18 Updated Sep 20, 2024

mbzuai-oryx / groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 821 41 Updated Nov 23, 2024

mbzuai-oryx / XrayGPT

[BIONLP@ACL 2024] XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.

Python 482 58 Updated Aug 8, 2024

mbzuai-oryx / ClimateGPT

[EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabic languages.

Python 77 10 Updated Sep 24, 2024

mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,283 111 Updated Aug 27, 2024

muzairkhattak / ViFi-CLIP

[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".

Python 263 19 Updated Apr 3, 2024

hanoonaR / object-centric-ovd

[NeurIPS 2022] Official repository of paper titled "Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection".

Jupyter Notebook 287 20 Updated Oct 12, 2022

muzairkhattak / multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Python 702 57 Updated Jul 24, 2023

mmaaz60 / mvits_for_class_agnostic_od

[ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".

Python 306 25 Updated May 9, 2023

gokulkarthik / Deformable-DETR

Forked from fundamentalvision/Deformable-DETR

An Empirical Study Of Self-supervised Learning Approaches For Object Detection With Transformers - CV703 Course Project

Python 6 Updated Jan 23, 2023

LanarsInc / hotel-booking-concept-flutter

Hotel Booking Concept is a promo sample application inspired by

Dart 110 35 Updated Sep 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sahal Shaji sahalshajim

Achievements

Achievements

Organizations

Block or report sahalshajim

Stars

k-m-irfan / simplified_mediapipe_face_landmarks

k-m-irfan / MER_dataset_cleaning

k-m-irfan / mediapipe_FaceMesh

k-m-irfan / microexpression_recognition

k-m-irfan / Fun_With_Image_Processing

k-m-irfan / voice_translation_workshop

k-m-irfan / Fastspeech2_HS