Skip to content
View hanajibsa's full-sized avatar

Block or report hanajibsa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2025] CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs

Python 12 Updated Feb 20, 2025

Preference Learning for LLaVA

Python 41 Updated Nov 9, 2024

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Python 85 5 Updated Jan 30, 2024
Python 320 9 Updated Jan 27, 2024

Collection of AWESOME vision-language models for vision tasks

2,612 202 Updated Mar 24, 2025

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

Jupyter Notebook 542 278 Updated Sep 6, 2024

LLM을 활용한 실전 AI 애플리케이션 개발

Jupyter Notebook 143 112 Updated Aug 29, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,391 1,009 Updated Nov 18, 2024

Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".

Python 271 27 Updated May 23, 2023

Weakly-supervised learning pipeline for histopathology images. Publications: Biomarker prediction in colorectal cancer (CRC)

Jupyter Notebook 68 15 Updated Feb 6, 2024

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,117 451 Updated Mar 22, 2025

Examples of XCIST applications

Python 11 1 Updated Feb 3, 2025

Siamese and triplet networks with online pair/triplet mining in PyTorch

Python 3,132 635 Updated Apr 29, 2023

Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.

Python 50 6 Updated Mar 30, 2022

S3D Text-Video model trained on HowTo100M using MIL-NCE

Python 194 21 Updated Jul 3, 2020

ImageBind One Embedding Space to Bind Them All

Python 8,567 801 Updated Jul 31, 2024

Awesome-LLM: a curated list of Large Language Model

22,369 1,843 Updated Mar 26, 2025

[NeurIPS 2024] Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis

Python 139 3 Updated Dec 2, 2024

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 3,522 502 Updated Aug 6, 2024

Codes of Learning Prior Feature and Attention Enhanced Image Inpainting (ECCV2022)

Jupyter Notebook 85 4 Updated Mar 5, 2024

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,667 1,253 Updated Jul 23, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,130 679 Updated Aug 5, 2024

Tackling the Generative Learning Trilemma with Denoising Diffusion GANs https://arxiv.org/abs/2112.07804

Python 726 82 Updated Dec 2, 2022

Contrastive unpaired image-to-image translation, faster and lighter training than cyclegan (ECCV 2020, in PyTorch)

Python 2,318 424 Updated Sep 5, 2023

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,873 327 Updated Jul 14, 2024

Image-to-Image Translation in PyTorch

Python 23,795 6,411 Updated May 14, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 28,122 3,509 Updated Jul 23, 2024

ADN: Artifact Disentanglement Network for Unsupervised Metal Artifact Reduction

Python 179 40 Updated Dec 10, 2019
Next