mmaaz60

Follow

😀

Muhammad Maaz mmaaz60

😀

Follow

An Electrical Engineer with experience in Computer Vision software development. Skilled in Machine Learning, Deep Learning and Computer Vision.

148 followers · 4 following

Achievements

Achievements

Organizations

mmaaz60/README.md

Hi there 👋

🔭 I’m currently working on multi-modal transformers and multi-task learning
🌱 I’m currently learning to play Table Tennis 🏓
📫 How to reach me: [email protected]

Pinned Loading

mbzuai-oryx/Video-ChatGPT Public

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1.3k 111
mbzuai-oryx/groundingLMM Public

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 854 45
mbzuai-oryx/VideoGPT-plus Public

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Python 264 17
mbzuai-oryx/LLaVA-pp Public

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 835 62
mbzuai-oryx/PALO Public

(WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, Bengali and Urdu.

Python 84 5
EdgeNeXt Public

[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications".

Python 363 41

528 contributions in the last year

Learn how we count contributions

Less

More

Contribution activity

March 2025

7 contributions in private repositories Mar 11 – Mar 17