-
NVIDIA
- Santa Clara, CA
-
12:06
(UTC -08:00) - huanghua1994.github.io
- in/hua-huang-146a1b104
Highlights
- Pro
-
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
Python Apache License 2.0 UpdatedDec 7, 2024 -
HPC_Playground Public
Some HPC experiment codes
-
huanghua1994.github.io Public
Forked from academicpages/academicpages.github.ioGithub Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JavaScript MIT License UpdatedNov 17, 2024 -
-
-
CRP-SpMM Public
Forked from scalable-matrix/CRP-SpMMCommunication-Reduced Parallel SpMM
C UpdatedApr 11, 2024 -
H2Pack Public
Forked from scalable-matrix/H2PackH2 Matrix Package
-
H2Pack-Matlab Public
Forked from xinxing02/H2Pack-MatlabMatlab prototype code for H2Pack
-
-
M-SPARC Public
Forked from SPARC-X/M-SPARCMATLAB GNU General Public License v3.0 UpdatedOct 28, 2022 -
polar Public
Forked from ecrc/polarDistributed-memory, double-precision, polar decomposition (QDWH/ZOLO-PD) of a dense matrix, svd (QDWH/ZOLOPD-SVD) of a dense matrix
C BSD 3-Clause "New" or "Revised" License UpdatedSep 5, 2022 -
simint-generator Public
Forked from simint-chem/simint-generatorCode generator for simint vectorized integrals
C Other UpdatedJul 22, 2022 -
-
ctf Public
Forked from cyclops-community/ctfCyclops Tensor Framework: parallel arithmetic on multidimensional arrays
C++ Other UpdatedMar 16, 2022 -
-
pvfmm Public
Forked from dmalhotra/pvfmmA parallel kernel-independent FMM library for particle and volume potentials
C++ GNU Lesser General Public License v3.0 UpdatedFeb 25, 2022 -
ASTER Public
Acceleration with Simd inTrinsic, Easy and Reusable
-
YATDFT Public
Yet Another Tiny DFT
-
multi-gpu-programming-models Public
Forked from NVIDIA/multi-gpu-programming-modelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Cuda BSD 3-Clause "New" or "Revised" License UpdatedDec 4, 2020 -
llvm Public
Forked from intel/llvmIntel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
UpdatedDec 3, 2020 -
High Performance ParalleX runtime system
C BSD 3-Clause "New" or "Revised" License UpdatedNov 21, 2020 -
yaksa Public
Forked from pmodels/yaksaYaksa: High-performance Noncontiguous Data Management
C Other UpdatedOct 20, 2020 -
kokkos-remote-spaces Public
Forked from kokkos/kokkos-remote-spacesThis repository contains remote memory spaces, which implement shared memory semantics across multiple processes.
C++ Other UpdatedSep 10, 2020 -
SPARC Public
Forked from SPARC-X/SPARCSimulation Package for Ab-initio Real-space Calculations
C GNU General Public License v3.0 UpdatedJul 1, 2020 -
smash Public
Forked from cmsi/smashMassively parallel software for quantum chemistry calculations
Fortran Apache License 2.0 UpdatedApr 5, 2020 -
CQC_Playground Public
Some computational quantum chemistry toy codes / prototype codes, mainly in MATLAB
MATLAB GNU Lesser General Public License v2.1 UpdatedMar 24, 2020 -
-
YATSCF-DF Public archive
No longer maintained, please use https://github.com/huanghua1994/YATDFT
C GNU Lesser General Public License v2.1 UpdatedJan 2, 2020 -
YATSCF Public archive
No longer maintained, please use https://github.com/huanghua1994/YATDFT
C GNU Lesser General Public License v2.1 UpdatedJan 2, 2020 -
DFTfun Public archive
Forked from xiangrufan/DFTfun_A_density_functional_theory_solverA matlab implementation of density functional theory, for demonstrative purpose