Skip to content
View MalvinaNikandrou's full-sized avatar

Highlights

  • Pro

Block or report MalvinaNikandrou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch implementation of Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration (NeurIPS2024 Spotlight).

Python 3 Updated Dec 17, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,804 2,608 Updated Jan 11, 2025

Machine Learning and Computer Vision Engineer - Technical Interview Questions

3,169 523 Updated Jan 4, 2025

A repository to prepare you for your machine learning interview, involving most of the questions asked by all the tech giants and local companies. Do this to Ace your Machine Learning Engineer Inte…

462 113 Updated Jul 28, 2024

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 37,119 5,382 Updated Jan 11, 2025

Codebase for CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts

2 Updated Oct 20, 2024

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Python 4,379 664 Updated Aug 16, 2024

Data and Code for Paper "From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models"

2 Updated Jul 4, 2024

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Python 377 30 Updated Jan 2, 2025

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,900 112 Updated Jul 29, 2024

Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling

Python 2 Updated Dec 3, 2024

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Jupyter Notebook 2,023 245 Updated Jan 3, 2025

Famous Vision Language Models and Their Architectures

Markdown 551 29 Updated Sep 8, 2024

Curated list of data science interview questions and answers

3,836 895 Updated Sep 29, 2024

The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mobile phone. The dataset is presented with a teachable object …

Python 98 23 Updated Aug 13, 2024

Multilingual Image Captioning Evaluation

Python 2 Updated May 10, 2024

(WACV 2025) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, Bengali and Urdu.

Python 81 5 Updated Sep 10, 2024

inference code for SOTA closed and open vision-language models

Python 4 Updated Oct 17, 2024

A Python Perceptual Image Hashing Module

Python 3,484 336 Updated Oct 9, 2024

A curated list of papers & resources linked to concept learning

10 Updated Aug 9, 2023

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,267 184 Updated Jan 9, 2025

A paper list of some recent Mamba-based CV works.

247 14 Updated Jan 10, 2025

MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingual text perception and comprehension capabilities across nine…

Python 49 2 Updated Dec 13, 2024

Track emissions from Compute and recommend ways to reduce their impact on the environment.

Python 1,224 183 Updated Jan 11, 2025

😸 💬 A module to compute textual lexical richness (aka lexical diversity).

Python 98 19 Updated Aug 27, 2023

FRP Fork

Go 142 20 Updated Dec 2, 2024

A playbook for systematically maximizing the performance of deep learning models.

27,808 2,299 Updated Jun 18, 2024

Code for paper: VL-ICL Bench: The Devil in the Details of Benchmarking Multimodal In-Context Learning

Python 34 2 Updated Jan 9, 2025

Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)

Python 523 69 Updated Nov 28, 2023
Next