Skip to content
View hanajibsa's full-sized avatar

Block or report hanajibsa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2025] CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs

Python 9 Updated Feb 20, 2025

Preference Learning for LLaVA

Python 40 Updated Nov 9, 2024

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Python 83 6 Updated Jan 30, 2024
Python 318 9 Updated Jan 27, 2024

Collection of AWESOME vision-language models for vision tasks

2,582 200 Updated Dec 3, 2024

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

Jupyter Notebook 541 277 Updated Sep 6, 2024

LLM을 활용한 실전 AI 애플리케이션 개발

Jupyter Notebook 134 106 Updated Aug 29, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,357 1,008 Updated Nov 18, 2024

Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".

Python 271 27 Updated May 23, 2023

Weakly-supervised learning pipeline for histopathology images. Publications: Biomarker prediction in colorectal cancer (CRC)

Jupyter Notebook 68 15 Updated Feb 6, 2024

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,979 446 Updated Jan 12, 2025

Examples of XCIST applications

Python 11 1 Updated Feb 3, 2025

Siamese and triplet networks with online pair/triplet mining in PyTorch

Python 3,130 635 Updated Apr 29, 2023

Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.

Python 50 6 Updated Mar 30, 2022

S3D Text-Video model trained on HowTo100M using MIL-NCE

Python 195 21 Updated Jul 3, 2020

ImageBind One Embedding Space to Bind Them All

Python 8,549 801 Updated Jul 31, 2024

Awesome-LLM: a curated list of Large Language Model

22,134 1,819 Updated Mar 15, 2025

[NeurIPS 2024] Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis

Python 139 3 Updated Dec 2, 2024

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 3,485 498 Updated Aug 6, 2024

Codes of Learning Prior Feature and Attention Enhanced Image Inpainting (ECCV2022)

Jupyter Notebook 84 4 Updated Mar 5, 2024

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,636 1,253 Updated Jul 23, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,105 678 Updated Aug 5, 2024

Tackling the Generative Learning Trilemma with Denoising Diffusion GANs https://arxiv.org/abs/2112.07804

Python 723 82 Updated Dec 2, 2022

Contrastive unpaired image-to-image translation, faster and lighter training than cyclegan (ECCV 2020, in PyTorch)

Python 2,309 425 Updated Sep 5, 2023

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,861 325 Updated Jul 14, 2024

Image-to-Image Translation in PyTorch

Python 23,730 6,403 Updated May 14, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,936 3,492 Updated Jul 23, 2024

ADN: Artifact Disentanglement Network for Unsupervised Metal Artifact Reduction

Python 178 40 Updated Dec 10, 2019
Next