PteroMaplePT

PteroMaplePT

Stars

9 stars written in Python

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31,913 4,849 Updated Dec 15, 2024

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 19,293 1,354 Updated Dec 12, 2024

Efficient Triton Kernels for LLM Training

Python 3,830 230 Updated Dec 15, 2024

Python 6 2 Updated Jun 21, 2021

SEEN: Structured Event Enhancement Network for Explainable Need Detection of Information Recall Assistance

Python 4 Updated Aug 21, 2022

Learning to Generate Explanation from e-Hospital Services for Medical Suggestion

Python 3 Updated Nov 3, 2022

Contrastively learning participant representations per round in thread-based debates.

Python 2 Updated Oct 25, 2023

Analysis Model of Discourse Relations within a Document(AMDRD)

Python 2 1 Updated Aug 11, 2023

Python 1 1 Updated Jun 20, 2021