This repo provides a set of AI models for reference used by Zhouyi SDK.
FTP model download (Suggest FTP tool FileZilla)
Host
: sftp://sftp01.armchina.comAccount
: zhouyi.armchinaPassword
: 114r3cJd
Zhouyi NPU IP is a high efficiency and high performance AI processor that supports wide range of AI application, including vision, audio/speech and NLP. Zhouyi NPU offers fixed function accelerator that speeds up general operations like convolution and activation, meanwhile Zhouyi NPU offers flexible programmability to support new operations or customized operations.
Zhouyi NPU IP has been deployed in a number of market segments, including surveillance, automotive, IOT, etc. With proven peformance and scalablility, Zhouyi NPU IP covers 0.25TOPS to hunderds of TOPS, target market from edge computing to data center.
Zhouyi SDK is a full stack of platform that provides fast time-to-market development and deployment capability for user.
-
Support various framework
Supports TensorFlow, TensorFlow Lite, Caffe, Pytorch and ONNX, extend to support orther framework by IR transition
-
Graph optimization
Many graph optimization techniques are used to reduce network complexity, like conv+batchnorm+relu fusion, to make graph run fast and efficiently on Zhouyi hardware.
-
Quantization
Support training-aware quantization model and post-training quantization method. Natively provides advanced-quantization techniques, also provide interface for user customized quantization parameter.
-
IR (intermediate representation)
Public Zhouyi IR, and can be a standard for other framework to support
-
Customization
Provide plugin interface for user easy to develop custom operations and integrate into SDK
-
Simulator
Provide instruction-level simulator, easy for user to evaluate model before porting to hardware