Stars
This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.
Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multiple-Choice on test-standard.
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Using VGG16 to extract features from image to train ML model.
Transfer Learning Using VGG-19 on MNIST dataset
Introduction to APIs - v2
This is all my notebooks, lab solutions, and assignments for the DeepLearning.AI Natural Language Processing Specialization on Coursera.
This repository contains demos I made with the Transformers library by HuggingFace.
🏅 Collection of Kaggle Solutions and Ideas 🏅
AIOZ AI - Overcoming Data Limitation in Medical Visual Question Answering (MICCAI 2019)
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Codes to complement YouTube videos and blog posts on Medium.
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…
Neural Networks: Zero to Hero
Bayesian active learning library for research and industrial usecases.