Skip to content
View Eric-mingjie's full-sized avatar

Block or report Eric-mingjie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Make huge neural nets fit in memory

Python 2,746 271 Updated Apr 26, 2020

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,342 224 Updated Dec 12, 2024

Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI

Python 267 19 Updated Nov 7, 2024

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 19,665 1,384 Updated Dec 24, 2024

Does Refusal Training in LLMs Generalize to the Past Tense? [NeurIPS 2024 Safe Generative AI Workshop (Oral)]

Python 58 8 Updated Oct 13, 2024
Python 21 Updated Jul 22, 2024
103 1 Updated Mar 14, 2024

Code accompanying the paper "Massive Activations in Large Language Models"

Python 130 8 Updated Mar 4, 2024

Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training

Python 427 31 Updated Feb 28, 2024

Landing Page for TOFU

Python 105 27 Updated Dec 20, 2024

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,116 54 Updated Nov 22, 2024
Python 170 12 Updated Sep 26, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 9,505 848 Updated Aug 7, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,116 960 Updated Sep 1, 2024

Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"

Python 53 8 Updated Jun 26, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 38,078 6,112 Updated Dec 9, 2024

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,429 260 Updated Aug 13, 2024

A simple and effective LLM pruning approach.

Python 688 96 Updated Aug 9, 2024

Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".

Python 157 11 Updated May 7, 2024
Python 1,514 131 Updated Apr 27, 2023

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,620 132 Updated Aug 4, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,948 463 Updated Jun 22, 2024

Test-Time Adaptation via Conjugate Pseudo-Labels

Python 39 3 Updated May 25, 2023

Tools for understanding how transformer predictions are built layer-by-layer

Python 448 48 Updated Jun 2, 2024

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,606 350 Updated Dec 7, 2024

A prize for finding tasks that cause large language models to show inverse scaling

601 26 Updated Oct 11, 2023

DeblurSR: Event-Based Motion Deblurring Under the Spiking Representation (AAAI 2024)

Python 26 1 Updated Nov 8, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,991 4,172 Updated Dec 20, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,729 2,225 Updated Jul 29, 2024
Next