Name		Name	Last commit message	Last commit date
Latest commit History 185 Commits
cmake		cmake
dmlc-core @ 04f9195		dmlc-core @ 04f9195
docs		docs
examples		examples
include/nnvm		include/nnvm
make		make
python		python
src		src
tests		tests
tutorials		tutorials
tvm @ 5b8a8d0		tvm @ 5b8a8d0
.gitignore		.gitignore
.gitmodules		.gitmodules
.travis.yml		.travis.yml
CMakeLists.txt		CMakeLists.txt
Jenkinsfile		Jenkinsfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md

Repository files navigation

NNVM: Graph IR Stack for Deep Learning Systems

NNVM is a reusable computational graph compilation stack for deep learning systems. It provides modules to:

Represent deep learning workloads from front-end frameworks via a graph IR.
Optimize computation graphs to improve performance.
Compile into executable modules and deploy to different hardware backends with minimum dependency.

NNVM is designed to add new frontend, operators and graph optimizations in a decentralized fashion without changing the core interface. It is part of TVM stack. The compiler toolchain can target hardware backends supported by TVM. The compiled module can be deployed to server, mobile, embedded devices and browsers with minimum dependency, in languages including c++, python, javascript, java, objective-c.

The following code snippet demonstrates the general workflow of nnvm.

import tvm
from tvm.contrib import graph_runtime, rpc
import nnvm.frontend
import nnvm.compiler

# GET model from frameworks
# change xyz to supported framework name.
graph, params = nnvm.frontend.from_xyz(...)

# OPTIMIZE and COMPILE the graph to get a deployable module
# target can be "opencl", "llvm", "metal" or any target supported by tvm
target = "cuda"
graph, lib, params = nnvm.compiler.build(graph, target, {"data", data_shape}, params=params)

# DEPLOY and run on gpu(0)
module = graph_runtime.create(graph, lib, tvm.gpu(0))
module.set_input(**params)
module.run(data=data_array)
output = tvm.nd.empty(out_shape, ctx=tvm.gpu(0))
module.get_output(0, output)

# DEPLOY to REMOTE mobile/rasp/browser with minimum tvm rpc runtime
# useful for quick experiments on mobile devices
remote = rpc.connect(remote_host, remote_port)
lib.export_library("mylib.so")
remote.upload("mylib.so")
rlib = rpc.load_module("mylib.so")
# run on remote device
rmodule = graph_runtime.create(graph, rlib, remote.gpu(0))
rmodule.set_input(**params)
rmodule.run()

Links

TinyFlow on how you can use NNVM to build a TensorFlow like API.
Apache MXNet uses NNVM as a backend.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NNVM: Graph IR Stack for Deep Learning Systems

Links

About

Releases

Packages

Languages

License

padmaramesh/nnvm

Folders and files

Latest commit

History

Repository files navigation

NNVM: Graph IR Stack for Deep Learning Systems

Links

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages