Eric-mingjie

Mingjie Sun Eric-mingjie

CMU CS PhD

187 followers · 0 following

CMU CS PhD
Pittsburgh, PA
https://eric-mingjie.github.io

Achievements

Stars

cybertronai / gradient-checkpointing

Make huge neural nets fit in memory

Python 2,746 271 Updated Apr 26, 2020

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,342 224 Updated Dec 12, 2024

lucidrains / nGPT-pytorch

Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI

Python 267 19 Updated Nov 7, 2024

unslothai / unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 19,665 1,384 Updated Dec 24, 2024

tml-epfl / llm-past-tense

Does Refusal Training in LLMs Generalize to the Past Tense? [NeurIPS 2024 Safe Generative AI Workshop (Oral)]

Python 58 8 Updated Oct 13, 2024

fmfi-compbio / admm-pruning

Python 21 Updated Jul 22, 2024

liuzhuang13 / bias

103 1 Updated Mar 14, 2024

locuslab / massive-activations

Code accompanying the paper "Massive Activations in Large Language Models"

Python 130 8 Updated Mar 4, 2024

diffusion-classifier / diffusion-classifier

Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training

Python 427 31 Updated Feb 28, 2024

jakespringer / echo-embeddings

Python 133 7 Updated Apr 17, 2024

locuslab / tofu

Landing Page for TOFU

Python 105 27 Updated Dec 20, 2024

apple / ml-aim

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,116 54 Updated Nov 22, 2024

OscarXZQ / weight-selection

Python 170 12 Updated Sep 26, 2024

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 9,505 848 Updated Aug 7, 2024

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,116 960 Updated Sep 1, 2024

luuyin / OWL

Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"

Python 53 8 Updated Jun 26, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 38,078 6,112 Updated Dec 9, 2024

young-geng / EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,429 260 Updated Aug 13, 2024

locuslab / wanda

A simple and effective LLM pruning approach.

Python 688 96 Updated Aug 9, 2024

veronica320 / Faithful-COT

Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".

Python 157 11 Updated May 7, 2024

EleutherAI / the-pile

Python 1,514 131 Updated Apr 27, 2023

FranxYao / chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,620 132 Updated Aug 4, 2024

princeton-nlp / tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,948 463 Updated Jun 22, 2024

locuslab / tta_conjugate

Test-Time Adaptation via Conjugate Pseudo-Labels

Python 39 3 Updated May 25, 2023

AlignmentResearch / tuned-lens

Tools for understanding how transformer predictions are built layer-by-layer

Python 448 48 Updated Jun 2, 2024

togethercomputer / RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,606 350 Updated Dec 7, 2024

inverse-scaling / prize

A prize for finding tasks that cause large language models to show inverse scaling

601 26 Updated Oct 11, 2023

chensong1995 / DeblurSR

DeblurSR: Event-Based Motion Deblurring Under the Spiking Representation (AAAI 2024)

Python 26 1 Updated Nov 8, 2024

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,991 4,172 Updated Dec 20, 2024

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,729 2,225 Updated Jul 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mingjie Sun Eric-mingjie

Achievements

Achievements

Block or report Eric-mingjie

Stars

cybertronai / gradient-checkpointing

facebookresearch / lingua

lucidrains / nGPT-pytorch

unslothai / unsloth

tml-epfl / llm-past-tense

fmfi-compbio / admm-pruning

liuzhuang13 / bias

locuslab / massive-activations

diffusion-classifier / diffusion-classifier

jakespringer / echo-embeddings

locuslab / tofu

apple / ml-aim

OscarXZQ / weight-selection

facebookresearch / dinov2

srush / GPU-Puzzles

luuyin / OWL

karpathy / nanoGPT

young-geng / EasyLM

locuslab / wanda

veronica320 / Faithful-COT

EleutherAI / the-pile

FranxYao / chain-of-thought-hub

princeton-nlp / tree-of-thought-llm

locuslab / tta_conjugate

AlignmentResearch / tuned-lens

togethercomputer / RedPajama-Data

inverse-scaling / prize

chensong1995 / DeblurSR

microsoft / DeepSpeed

tloen / alpaca-lora