NNVM is a reusable computational graph optimization and compilation stack for deep learning systems. NNVM provides modules to:
- Represent deep learning workloads from front-end frameworks via a graph IR.
- Optimize computation graphs to improve performance.
- Compile into executable modules and deploy to different hardware backends with minimum dependency.
NNVM is designed to add new frontend, operators and graph optimizations in a decentralized fashion without changing the core interface. NNVM is part of TVM stack, which provides an end to end IR compilation stack for deploying deep learning workloads into different hardware backends
- TinyFlow on how you can use NNVM to build a TensorFlow like API.
- Apache MXNet uses NNVM as a backend.