-
NVIDIA
- Shanghai, China
Stars
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Magic: The Gathering Arena draft tool that utilizes 17Lands data
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
2024中国翻墙软件VPN推荐以及科学上网避坑,稳定好用。对比SSR机场、蓝灯、V2ray、老王VPN、VPS搭建梯子等科学上网与翻墙软件,中国最新科学上网翻墙梯子VPN下载推荐,访问Chatgpt。
A complete daily plan for studying to become a Google software engineer :)
openctp提供CTP股票期权、中泰证券XTP、华鑫证券奇点TORA、东方证券OST、东方财富证券EMT、盈透证券TWS、易盛TAP、量投QDP等各通道的CTPAPI兼容接口,CTP程序可以无缝对接各股票柜台。openctp也提供了一套基于TTS交易系统的模拟环境,同样提供了CTPAPI兼容接口,不仅支持国内期货与期权全品种,也支持A股股票、基金、债券以及股票期权模拟交易,可以替代Simn…
Development repository for the Triton language and compiler
A retargetable MLIR-based machine learning compiler and runtime toolkit.
A list of awesome compiler projects and papers for tensor computation and deep learning.
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
Reference implementations of MLPerf™ inference benchmarks
A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors which can be used for ANN search.
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…
A set of cmake modules to assist in building code
Dive into CPython internals, trying to illustrate every detail of CPython implementation
Automatic architecture search and hyperparameter optimization for PyTorch
A guideline for building practical production-level deep learning systems to be deployed in real world applications.
A modular configuration of Vim and Neovim
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
GoogleTest - Google Testing and Mocking Framework
Open deep learning compiler stack for cpu, gpu and specialized accelerators