Skip to content
View erfanshayegani's full-sized avatar
😗
Building a Foundation
😗
Building a Foundation

Highlights

  • Pro

Block or report erfanshayegani

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,507 1,353 Updated Feb 1, 2025

Witness the aha moment of VLM with less than $3.

Python 2,765 214 Updated Feb 21, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 7,800 553 Updated Feb 20, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,141 2,124 Updated Feb 1, 2025

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,020 70 Updated Feb 19, 2025

Famous Vision Language Models and Their Architectures

Markdown 647 34 Updated Feb 9, 2025

What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets

Python 206 20 Updated Nov 16, 2024

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,843 68 Updated Jan 22, 2025

This is the official Gtihub repo for our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models".

HTML 14 1 Updated Jul 3, 2024
Python 14 1 Updated Oct 14, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 40,348 5,398 Updated Feb 20, 2025

Recipes to train reward model for RLHF.

Python 1,185 84 Updated Feb 9, 2025

[ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.

Python 55 2 Updated Jan 19, 2025
C# 62 18 Updated Nov 7, 2024

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 3,644 271 Updated Aug 10, 2024

Official release of the ProGen models

Python 638 118 Updated Aug 4, 2023

official code for "Large Language Models as Optimizers"

Python 499 58 Updated Dec 4, 2024

Enhancing Large Vision Language Models with Self-Training on Image Comprehension.

Python 63 4 Updated May 31, 2024

Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"

Python 41 3 Updated Jan 15, 2025

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 917 39 Updated Feb 21, 2025

Biological foundation modeling from molecular to genome scale

Jupyter Notebook 1,308 161 Updated Dec 18, 2024
Python 14 Updated Oct 30, 2024

Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)

Python 127 13 Updated Oct 1, 2024

DrugGPT: A GPT-based Strategy for Designing Potential Ligands Targeting Specific Proteins

Python 111 15 Updated Feb 18, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,423 1,743 Updated Feb 19, 2025

LLM Unlearning

Python 142 19 Updated Oct 20, 2023

This repository contains the code and data for the paper "SelfIE: Self-Interpretation of Large Language Model Embeddings" by Haozhe Chen, Carl Vondrick, and Chengzhi Mao.

Python 44 3 Updated Dec 9, 2024

⏰ Computer Architecture and Security Conference Deadline Countdowns (Based on AI Deadlines)

JavaScript 5 Updated Dec 20, 2024

Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper

TypeScript 4,845 765 Updated Sep 28, 2024
Python 7 3 Updated Mar 13, 2024
Next