From the root of pytorch repo, run:
python -m benchmarks.tensorexpr --help
to show documentation.
An example of an actual command line that one might use as a starting point:
python -m benchmarks.tensorexpr --device gpu --mode fwd --jit-mode trace --cuda-fuser=te