Skip to content
View sahalshajim's full-sized avatar

Organizations

@mbzuai-oryx

Block or report sahalshajim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Extracts essential Mediapipe face landmarks and arranges them in a sequenced order.

Python 27 2 Updated Jul 19, 2022

Methods I used to clean the Micro_Expressions dataset

Jupyter Notebook 3 1 Updated Jun 24, 2022

Mediapipe Face Mesh

Python 14 Updated Jun 24, 2022

Realtime micro-expression recognition using OpenCV and PyTorch

Python 57 12 Updated Jun 24, 2022

Fun With Image Processing - Course

4 4 Updated Dec 26, 2022

Template files for workshop "NLP FOR VISION AND SPEECH IMPAIRED" by IIT Madras Research Park | Empower Conference

Python 1 Updated Oct 5, 2023

Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrat…

Perl 1 Updated Nov 8, 2023

Flask API implementation of the Text to Speech Model developed my Speech Lab, IIT Madras

Perl 2 Updated Nov 17, 2023

About.me

3 Updated Jan 10, 2024

An Intuitive Humanoid Robot Arm Controller for Teleoperation.

Python 1 Updated Nov 1, 2024

Official implementation of paper titled "GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model"

Python 65 4 Updated Jan 23, 2025

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 827 62 Updated Jul 10, 2024

Code for "Enhancing In-context Learning via Linear Probe Calibration"

Python 35 Updated Apr 24, 2024

[WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory

Python 52 2 Updated Jan 23, 2025

MobiLlama : Small Language Model tailored for edge devices

Python 619 48 Updated Mar 3, 2024

(WACV 2025) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, Bengali and Urdu.

Python 81 5 Updated Sep 10, 2024

Bilingual Medical Mixture of Experts LLM

28 1 Updated Nov 23, 2024

Go ahead and axolotl questions

Python 8,477 939 Updated Feb 5, 2025

Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks

Python 139 18 Updated Sep 20, 2024

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 821 41 Updated Nov 23, 2024

[BIONLP@ACL 2024] XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.

Python 482 58 Updated Aug 8, 2024

[EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabic languages.

Python 77 10 Updated Sep 24, 2024

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,283 111 Updated Aug 27, 2024

[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".

Python 263 19 Updated Apr 3, 2024

[NeurIPS 2022] Official repository of paper titled "Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection".

Jupyter Notebook 287 20 Updated Oct 12, 2022

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Python 702 57 Updated Jul 24, 2023

[ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".

Python 306 25 Updated May 9, 2023

An Empirical Study Of Self-supervised Learning Approaches For Object Detection With Transformers - CV703 Course Project

Python 6 Updated Jan 23, 2023

Hotel Booking Concept is a promo sample application inspired by

Dart 110 35 Updated Sep 20, 2021