-
vidur Public
Forked from microsoft/vidurA large-scale simulation framework for LLM inference
Python MIT License UpdatedOct 1, 2024 -
-
amd_matrix_instruction_calculator Public
Forked from ROCm/amd_matrix_instruction_calculatorA tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators
Python MIT License UpdatedJan 2, 2024 -
neuralangelo Public
Forked from NVlabs/neuralangeloOfficial implementation of "Neuralangelo: High-Fidelity Neural Surface Reconstruction" (CVPR 2023)
Python Other UpdatedAug 21, 2023 -
instant-ngp Public
Forked from NVlabs/instant-ngpInstant neural graphics primitives: lightning fast NeRF and more
Cuda Other UpdatedAug 10, 2023 -
rocBLAS-Examples Public
Forked from ROCm/rocBLAS-ExamplesExamples illustrating usage of the rocBLAS library
C++ MIT License UpdatedJul 17, 2023 -
FlexGen Public
Forked from FMInference/FlexLLMGenRunning large language models on a single GPU for throughput-oriented scenarios.
Python Apache License 2.0 UpdatedJun 6, 2023 -
LMFlow Public
Forked from OptimalScale/LMFlowAn Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Language Model for All.
Python Apache License 2.0 UpdatedApr 6, 2023 -
ControlNet Public
Forked from lllyasviel/ControlNetLet us control diffusion models
Python Apache License 2.0 UpdatedFeb 22, 2023 -
lammps Public
Forked from lammps/lammpsPublic development project of the LAMMPS MD software package
C++ GNU General Public License v2.0 UpdatedDec 24, 2021 -
-
-
rocHPCG Public
Forked from ROCm/rocHPCGHPCG benchmark based on ROCm platform
C++ BSD 3-Clause "New" or "Revised" License UpdatedJun 30, 2021 -
scipy Public
Forked from scipy/scipyScipy library main repository
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 13, 2020 -
HIP-Performance-Optmization-on-VEGA64 Public
Forked from fsword73/HIP-Performance-Optmization-on-VEGA6414 basic topics for VEGA64 performance optmization
C++ UpdatedNov 8, 2019 -
rocFFT Public
Forked from ROCm/rocFFTNext generation FFT implementation for ROCm
C++ MIT License UpdatedAug 14, 2018 -
k8s-device-plugin Public
Forked from ROCm/k8s-device-pluginKubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
Go Apache License 2.0 UpdatedJul 6, 2018 -
MEGADOCK Public
Forked from akiyamalab/MEGADOCKAn ultra-high-performance protein-protein docking for heterogeneous supercomputers
C++ GNU General Public License v3.0 UpdatedMay 31, 2018 -
caffe Public
Forked from weiliu89/caffeCaffe: a fast open framework for deep learning.
C++ Other UpdatedJan 2, 2018 -
rocm_bandwidth_test Public
Forked from ROCm/rocm_bandwidth_testBandwidth test for ROCm
C++ Other UpdatedDec 1, 2017 -
resnet-imagenet-caffe Public
Forked from ethanhe42/resnet-imagenet-caffetrain resnet on imagenet from scratch with caffe
Shell UpdatedOct 11, 2017 -
caffe_toolkit Public
Forked from binLearning/caffe_toolkitCaffe toolkit, including installing Caffe, creating various networks.
Python UpdatedSep 30, 2017 -
pva-faster-rcnn Public
Forked from sanghoon/pva-faster-rcnnDemo code for PVANET
Python Other UpdatedNov 21, 2016 -
SqueezeNet Public
Forked from forresti/SqueezeNetSqueezeNet: AlexNet-level accuracy with 50x fewer parameters
BSD 2-Clause "Simplified" License UpdatedSep 5, 2016 -
-
detectionAnalysis Public
Forked from weiliu89/detectionAnalysishttp://web.engr.illinois.edu/~dhoiem/projects/detectionAnalysis/
MATLAB UpdatedMar 5, 2016 -
neuraltalk2 Public
Forked from karpathy/neuraltalk2Efficient Image Captioning code in Torch, runs on GPU
Lua UpdatedDec 5, 2015 -
video-indexing Public
Forked from yuhan210/video-indexingAutomatic indexing on live streaming videos with deep neural network
HTML UpdatedNov 23, 2015 -
neuraltalk Public
Forked from karpathy/neuraltalkNeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
Python UpdatedNov 20, 2015 -
caffe-lstm Public
Forked from junhyukoh/caffe-lstmLSTM implementation on Caffe
C++ Other UpdatedNov 10, 2015