Skip to content
View maks5507's full-sized avatar

Highlights

  • Pro

Block or report maks5507

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

YaFSDP: Yet another Fully Sharded Data Parallel

Python 864 45 Updated Dec 24, 2024

Cramming the training of a (BERT-type) language model into limited compute.

Python 1,304 99 Updated Jun 13, 2024

🗃️ A Python ORM-like interface for the Clingo Answer Set Programming (ASP) reasoner

Python 53 5 Updated Jul 1, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,595 491 Updated Dec 15, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,743 520 Updated Dec 14, 2024

Fine-tuning LLMs using QLoRA

Jupyter Notebook 243 53 Updated Jun 8, 2024

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,276 401 Updated Sep 13, 2024

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,814 290 Updated Dec 30, 2024

distributed trainer for LLMs

Python 554 79 Updated May 20, 2024

Efficient Attention for Long Sequence Processing

Python 90 11 Updated Dec 17, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,070 5,035 Updated Jan 3, 2025

Fast and memory-efficient exact attention

Python 14,882 1,407 Updated Jan 2, 2025

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,547 95 Updated Feb 16, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,137 827 Updated Jun 10, 2024

A part of options trading educational platform project implemented for HackNYU 2023.

Python 2 1 Updated Feb 27, 2023

The most lightweight python docker image possible

Python 51 7 Updated Dec 10, 2024

Resources, datasets, papers on Question Answering

676 193 Updated Mar 17, 2023

Tools for checking ACL paper submissions

Python 604 48 Updated Oct 20, 2024

MT Evaluation in Many Languages via Zero-Shot Paraphrasing

Python 102 23 Updated Jul 25, 2024

Facebook Low Resource (FLoRes) MT Benchmark

Python 714 124 Updated Nov 20, 2023

Multi-container environment with Hadoop, Spark and Hive

Shell 3 5 Updated Jan 31, 2023
Python 3 1 Updated Mar 25, 2021

Theano code for experiments in the paper "A Hybrid Convolutional Variational Autoencoder for Text Generation."

Python 205 45 Updated Oct 5, 2018

How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.

Python 134 12 Updated Jun 30, 2022

ELSA combines extractive and abstractive approaches to the automatic text summarization

Python 4 Updated Apr 6, 2021

A PyTorch implementation of Transformer in "Attention is All You Need"

Python 104 29 Updated Dec 6, 2020
Python 57 12 Updated Sep 13, 2022

Tools for extracting tables and results from Machine Learning papers

Python 397 55 Updated Nov 28, 2022

Quantile-based approach to estimating cognitive text complexity

Python 5 Updated Sep 15, 2020