-
Facebook
- Menlo Park
-
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 23, 2025 -
-
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedApr 12, 2024 -
colfax-cutlass-kernels Public
Forked from ColfaxResearch/cutlass-kernelsC++ MIT License UpdatedMar 22, 2024 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedOct 23, 2023 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedOct 1, 2023 -
AITemplate_incubator Public
Forked from facebookincubator/AITemplateAITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Python Apache License 2.0 UpdatedMay 18, 2023 -
FBGEMM Public
Forked from pytorch/FBGEMMFB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
C++ Other UpdatedNov 11, 2022 -
fairscale Public
Forked from facebookresearch/fairscalePyTorch extensions for high performance and large scale training.
Python Other UpdatedNov 23, 2021 -
glow Public
Forked from pytorch/glowCompiler for Neural Network hardware accelerators
C++ Apache License 2.0 UpdatedMay 7, 2021 -
-
-
-
javahomework Public
Automatically exported from code.google.com/p/javahomework
Java UpdatedJan 6, 2016 -
-