Skip to content

Latest commit

 

History

History
 
 

Scripts

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

This directory contains different script helping using different components of CNTK.

CNTK Text format Converters

Two Python Scripts for converting Data to CNTK Text format for using as an input for CNTK Text Format Reader (see https://github.com/microsoft/cnTK/wiki/CNTKTextFormat-Reader).

txt2ctf.py 

Converts a set of dictionary files and a plain text file to CNTK Text format. Run python txt2ctf.py -h to see usage instructions. See the comments in the beginning of the script file for the specific usage example.

uci2ctf.py

Converts data stored in a text file in UCI format to CNTK Text format. Run python uci2ctf.py -h to see usage instructions and example. Also see a usage example below:

python Scripts/uci2ctf.py --input_file Examples/Image/MNIST/Data/Train-28x28.txt --features_start 1 --features_dim 784 --labels_start 0 --labels_dim 1 --num_labels 10  --output_file Examples/Image/MNIST/Data/Train-28x28_cntk_text.txt

input_file – original dataset in the (columnar) UCI format features_start – index of the first feature column (start parameter in the UCIFastReader config, see https://github.com/Microsoft/CNTK/wiki/UCI-Fast-Reader) features_dim – number of feature columns (dim parameter in the UCIFastReader config) labels_start - index of the first label column labels_dim – number of label columns num_labels – number of possible label values (labelDim parameter in the UCIFastReader config) output_file – path and filename of the resulting dataset.