Skip to content
View seoulsky-field's full-sized avatar
  • Republic of Korea, Seoul
  • 00:00 (UTC +09:00)

Block or report seoulsky-field

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,235 56 Updated Mar 12, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

383 20 Updated Mar 14, 2025

[TMLR 2025🔥] A survey for the autoregressive models in vision.

431 14 Updated Mar 12, 2025

MedRAX: Medical Reasoning Agent for Chest X-ray

Python 533 107 Updated Mar 11, 2025

🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

Python 14,748 1,294 Updated Mar 14, 2025

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Python 47 Updated Feb 28, 2025

A Python package for evaluating radiology report generation using multiple standard and medical-specific metrics.

Python 3 1 Updated Mar 11, 2025

[MICCAI 2024] RadiomicsFill-Mammo: Synthetic Mammogram Mass Manipulation with Radiomics Features

Python 7 Updated Oct 21, 2024

Preference Learning for LLaVA

Python 39 Updated Nov 9, 2024

[ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745

Python 238 9 Updated Oct 30, 2024

Taming Stable Diffusion for Lip Sync!

Python 2,940 439 Updated Mar 14, 2025

A course on aligning smol models.

Jupyter Notebook 5,575 1,931 Updated Jan 24, 2025

MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization

Python 25 2 Updated Feb 11, 2025
Python 18 4 Updated Nov 12, 2024

NeurIPS'24 DB (Spotlight) | Instruction Tuning Large Language Models to Understand Electronic Health Records

Jupyter Notebook 30 1 Updated Dec 21, 2024
Python 44 1 Updated Feb 26, 2025

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,073 140 Updated Feb 16, 2025

[MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501

Python 51 3 Updated Jul 26, 2024

Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)

Jupyter Notebook 136 22 Updated Sep 21, 2024

"Repository for implementing the Back-in-Time Diffusion method for detecting medical deepfakes in CT and MRI scans, including training and evaluation tools."

Jupyter Notebook 5 1 Updated Sep 3, 2024

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024

Python 333 9 Updated Sep 24, 2024

MC-CoT implementation code

Python 12 1 Updated Oct 31, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,566 364 Updated Mar 14, 2025

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,192 378 Updated Jan 27, 2025

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Python 82 6 Updated Jan 30, 2024
Python 3,536 328 Updated Feb 24, 2025

[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

1,442 130 Updated Mar 10, 2025

Train transformer language models with reinforcement learning.

Python 12,499 1,688 Updated Mar 14, 2025

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

Python 102 8 Updated Oct 16, 2024
Next