Skip to content
View PteroMaplePT's full-sized avatar

Block or report PteroMaplePT

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
9 stars written in Python
Clear filter

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31,913 4,849 Updated Dec 15, 2024

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 19,293 1,354 Updated Dec 12, 2024

Efficient Triton Kernels for LLM Training

Python 3,830 230 Updated Dec 15, 2024

SEEN: Structured Event Enhancement Network for Explainable Need Detection of Information Recall Assistance

Python 4 Updated Aug 21, 2022

Learning to Generate Explanation from e-Hospital Services for Medical Suggestion

Python 3 Updated Nov 3, 2022

Contrastively learning participant representations per round in thread-based debates.

Python 2 Updated Oct 25, 2023

Analysis Model of Discourse Relations within a Document(AMDRD)

Python 2 1 Updated Aug 11, 2023
Python 1 1 Updated Jun 20, 2021