Skip to content
View amirgholami's full-sized avatar

Highlights

  • Pro

Block or report amirgholami

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference

Python 43 4 Updated Nov 20, 2024

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,877 1,390 Updated Feb 1, 2025

[EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!

Python 370 58 Updated Sep 4, 2024

This Repository includes some of the presentations and tutorials I have made

5 Updated Jan 16, 2024

[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Python 176 12 Updated Mar 25, 2024

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

Python 1,618 124 Updated Jul 10, 2024

Lighter web automation with Python

Python 7,551 459 Updated Feb 20, 2025

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Python 335 30 Updated Aug 13, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,746 755 Updated May 31, 2024

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform…

Python 2,238 198 Updated Aug 18, 2024

Gradient-based adaptive sampling algorithms for self-supervising PINNs

Python 24 2 Updated May 8, 2023

Fork of seldridge/rocket-rocc-examples with tests for a systolic array based matmul accelerator

C 55 42 Updated Feb 15, 2025

A modular, automatable, tunable mapper for accelerator programming

Python 8 1 Updated Apr 27, 2022

Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.

Go 520 18 Updated Jun 7, 2023

[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Python 249 19 Updated Feb 12, 2023

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,753 163 Updated Aug 18, 2024

GLioblastoma Image Analysis for integrating brain tumor growth models with medical imaging

C++ 17 5 Updated Mar 30, 2023

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,002 216 Updated Nov 1, 2024

Perform data science on data that remains in someone else's server

Python 9,626 1,999 Updated Feb 23, 2025

Large datasets for conversational AI

Python 1,325 173 Updated Nov 16, 2019

A list of publically available audio data that anyone can download for ASR or other speech activities

Shell 203 22 Updated Aug 6, 2021

[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition

Jupyter Notebook 31 2 Updated Oct 11, 2021

The People’s Speech Dataset

Jupyter Notebook 102 12 Updated Jan 11, 2024
Python 43 5 Updated Jan 30, 2024

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Python 239 34 Updated Jan 29, 2023

Facebook AI Research's Automatic Speech Recognition Toolkit

C++ 6,408 1,012 Updated Nov 23, 2024

torch-optimizer -- collection of optimizers for Pytorch

Python 3,088 300 Updated Mar 22, 2024

Tutorial notebooks for hls4ml

Jupyter Notebook 319 142 Updated Mar 3, 2025

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Python 427 81 Updated May 15, 2023
Jupyter Notebook 7 Updated Aug 27, 2020
Next