-
University of California, Riverside - Sharif University of Technology
- California, USA 🌴 🇺🇸
- https://erfanshayegani.github.io/
- in/erfan-shayegani-5362321b2
- @Erf_Shayegani
Highlights
- Pro
Stars
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Witness the aha moment of VLM with less than $3.
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Janus-Series: Unified Multimodal Understanding and Generation Models
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Famous Vision Language Models and Their Architectures
What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
This is the official Gtihub repo for our paper: "BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models".
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Recipes to train reward model for RLHF.
[ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
official code for "Large Language Models as Optimizers"
Enhancing Large Vision Language Models with Self-Training on Image Comprehension.
Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"
Official repository for "AM-RADIO: Reduce All Domains Into One"
Biological foundation modeling from molecular to genome scale
Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)
DrugGPT: A GPT-based Strategy for Designing Potential Ligands Targeting Specific Proteins
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
This repository contains the code and data for the paper "SelfIE: Self-Interpretation of Large Language Model Embeddings" by Haozhe Chen, Carl Vondrick, and Chengzhi Mao.
⏰ Computer Architecture and Security Conference Deadline Countdowns (Based on AI Deadlines)
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper