Skip to content
View vasucp1207's full-sized avatar
🦀
🦀

Highlights

  • Pro

Organizations

@jotaijs @biomejs @fury-lang

Block or report vasucp1207

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

C/C++ frontend for MLIR. Also features polyhedral optimizations, parallel optimizations, and more!

C++ 513 127 Updated Oct 2, 2024

FFMPEG Assembly Language Lessons

1,757 48 Updated Jan 27, 2025

A nobuild content delivery network(CDN) for modern web development.

Go 3,324 158 Updated Feb 22, 2025

Denograd is a dependency-free ML library in Typescript for model inference and training with support to WebGPU and other runtimes.

TypeScript 49 Updated Feb 22, 2025

Machine learning compiler based on MLIR for Sophgo TPU.

C++ 672 167 Updated Feb 8, 2025

a simple end to end example of taking a ML graph (TF2 / PyTorch) and running it on a device [cpu, gpu]

MLIR 30 11 Updated Jan 20, 2021

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 2,973 508 Updated Feb 22, 2025

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

C 6,593 1,533 Updated Feb 22, 2025

GPU-accelerated compiler

Futhark 340 10 Updated Mar 20, 2024

a categorical deep learning compiler

Python 195 6 Updated May 7, 2024

Compiler for LightGBM gradient-boosted trees, based on LLVM. Speeds up prediction by ≥10x.

Python 394 33 Updated Dec 4, 2024

A massively parallel, high-level programming language

Rust 18,356 458 Updated Feb 22, 2025

Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch

Cuda 805 170 Updated Jul 19, 2023

A polyhedral compiler for expressing fast and portable data parallel algorithms

C++ 932 133 Updated Nov 20, 2024

A model compilation solution for various hardware

MLIR 405 44 Updated Feb 20, 2025

A verified compiler for a lazy functional language

Standard ML 34 4 Updated Feb 14, 2025

[wip] Deep Learning Compiler based on Polyhedral Compiler, Light-weight IRs, and Optimizing Pattern Matcher.

Common Lisp 185 11 Updated Feb 16, 2025

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 11,230 1,648 Updated Aug 8, 2024

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

C++ 1,433 533 Updated Feb 19, 2025

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,039 3,517 Updated Feb 21, 2025

NuMojo is a library for numerical computing in Mojo 🔥 similar to numpy in Python.

Mojo 133 18 Updated Feb 20, 2025

A high-performance numeric computation library.

Rust 111 29 Updated Feb 17, 2025

A Machine Learning framework from scratch in Pure Mojo 🔥

Mojo 441 29 Updated Jan 18, 2025

Inference Llama 2 in one file of pure 🔥

Mojo 2,107 140 Updated May 21, 2024

graph based intermediate representation and backend for optimising compilers

C 499 58 Updated Jan 8, 2025

In-memory x86-64 assembler for JIT compiler.

Rust 70 8 Updated Dec 10, 2024

Fast Mustache template engine implementation in pure Rust.

Rust 320 29 Updated Feb 13, 2025
Next