Skip to content
View aghazahedim's full-sized avatar

Block or report aghazahedim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,144 2,327 Updated Aug 12, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,174 991 Updated Nov 18, 2024

Visual self-questioning for large vision-language assistant.

Python 39 3 Updated Oct 1, 2024

AQUA dataset and VIKING model for the task of Art Visual Question Answering

Python 23 4 Updated Jun 4, 2021

Description and pointers of laion datasets

HTML 241 9 Updated Nov 5, 2022

The National Gallery of Art Open Data Program

Shell 370 67 Updated Jul 8, 2024

VIP cheatsheets for Stanford's CME 106 Probability and Statistics for Engineers

690 219 Updated Sep 9, 2020

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,974 662 Updated Aug 5, 2024

CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"

Jupyter Notebook 16 2 Updated Sep 11, 2024

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 10,994 1,586 Updated Jan 19, 2025

Class Activation Mapping

MATLAB 1,854 465 Updated Sep 13, 2022

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, B…

Python 483 36 Updated Apr 21, 2024

TranSalNet: Towards perceptually relevant visual saliency prediction. Neurocomputing (2022)

Python 56 9 Updated Jul 25, 2024

pytorch implementation of the different DeepGaze models

Jupyter Notebook 121 21 Updated Jun 15, 2023

This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, human visual search.

44 1 Updated Dec 3, 2024

Google Research

Jupyter Notebook 34,712 7,990 Updated Jan 18, 2025

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Python 819 75 Updated Apr 17, 2024

Summary of related papers on visual attention. Related code will be released based on Jittor gradually.

Python 2,760 410 Updated Oct 20, 2024

Refine high-quality datasets and visual AI models

Python 9,092 592 Updated Jan 20, 2025

MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning

Python 134 5 Updated Jun 20, 2023

Materials for Hawley's Deep Learning & AI Ethics course

Jupyter Notebook 39 13 Updated Nov 6, 2024

A collection of PyTorch notebooks for learning and practicing deep learning

Jupyter Notebook 135 125 Updated Dec 30, 2019
Python 16 2 Updated Aug 1, 2024
Jupyter Notebook 26 5 Updated Nov 24, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 15,358 2,645 Updated Dec 18, 2024

Examples and guides for using the OpenAI API

MDX 61,236 9,810 Updated Jan 17, 2025

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 52,563 5,115 Updated Jan 9, 2025

Must-read papers on prompt-based tuning for pre-trained language models.

4,130 382 Updated Jul 17, 2023

📺 Discover the latest machine learning / AI courses on YouTube.

16,183 1,936 Updated Jan 22, 2024

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,721 280 Updated Nov 25, 2024
Next