nx/exla at main · elixir-nx/nx

History

Name		Name	Last commit message	Last commit date
parent directory ..
bench		bench
c_src/exla		c_src/exla
config		config
docker		docker
examples		examples
guides		guides
lib		lib
test		test
.dockerignore		.dockerignore
.formatter.exs		.formatter.exs
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
exla.png		exla.png
mix.exs		mix.exs
mix.lock		mix.lock

README.md

Google's XLA (Accelerated Linear Algebra) compiler/backend for Nx. It supports just-in-time (JIT) compilation to GPU (both CUDA and ROCm) and TPUs.

See the documentation.

Installation

In order to use EXLA, you will need Elixir installed. Then create an Elixir project via the mix build tool:

$ mix new my_app

Then you can add EXLA as dependency in your mix.exs:

def deps do
  [
    {:exla, "~> 0.5"}
  ]
end

If you are using Livebook or IEx, you can instead run:

Mix.install([
  {:exla, "~> 0.5"}
])

Once installed, you must configure Nx to use EXLA by default. Check out the "Configuration" section in the docs to learn how to do so.

XLA binaries

EXLA relies on the XLA package to provide the necessary XLA binaries. Whenever possible it tries to download precompiled builds, but you may need to build from source if there is no version matching your target environment. For more details, including GPU/TPU support see the usage section.

Common installation issues

Missing Dependencies
- Some Erlang installs do not include some of the dependencies needed to compile the EXLA NIF. You may need to install erlang-dev separately.
Incompatible protocol buffer versions
- Error message: "this file was generated by an older version of protoc which is incompatible with your Protocol Buffer headers".
- If you have protoc installed on your machine, it may conflict with the protoc precompiled inside XLA. Uninstall, unlink, or remove protoc from your path to continue.
Missing CUDA symbols
- In some cases, you might be compiling a CPU-only version of :xla in an environment that has CUDA available. For these cases, you can set the EXLA_CPU_ONLY environment variable to any value to disable custom CUDA functionality in EXLA.

Usage with Nerves

For cross-compilation, you need to set your XLA_TARGET_PLATFORM variable to the correct target platform value (i.e. aarch64-linux-gnu for the Raspberry Pi 4).

Troubleshooting

EXLA uses NIFs (C-interface code called from Elixir) for part of its functionality. If for any reason these fail to compile or load, troubleshooting can be tricky.

We recommend following the steps below:

If the error appeared after upgrading EXLA, ensure that you have the proper dependency versions given by XLA. Afterwards, compile with mix compile after setting EXLA_FORCE_REBUILD to clean up cached files:
- EXLA_FORCE_REBUILD=partial: Removes the only the libexla.so caches (both local and global ones).
- EXLA_FORCE_REBUILD=true: Removes the libexla.so caches but also removes the intermediate .o compilation artifacts retained from previous builds.
Additional notes on compilation:
- Besides the XLA dependency versions, ensuring gcc (or your compiler of choice), libc and make are compatible is also important.
- Remember to save the compilation logs from this step for further debugging.
- It is a good idea to save the cache/<version>/libexla.so file so that the team can inspect its contents if needed.
If the error persists, look for the ** (RuntimeError) Failed to load NIF library. exception on application start-up. This exception should provide more information on what's the issue when loading the NIF. Share these logs in an issue on GitHub so that the Nx team can investigate further.

Contributing

Building locally

EXLA is a regular Elixir project, therefore, to run it locally:

mix deps.get
mix test

By default, EXLA passes ["-jN"] as a Make argument, where N is System.schedulers_online() - 2, capped at 1. config :exla, :make_args, ... can be used to override this default setting.

In order to run tests on a specific device, use the EXLA_TARGET environment variable, which is a dev-only variable for this project (it has no effect when using EXLA as a dependency). For example, EXLA_TARGET=cuda or EXLA_TARGET=rocm. Make sure to also specify XLA_TARGET to fetch or compile a proper version of the XLA binary.

Building with Docker

The easiest way to build is with Docker. For GPU support, you'll also need to set up the NVIDIA Container Toolkit.

To build, clone this repo, select your preferred Dockerfile, and run:

docker build --rm -t exla:host . # Host Docker image
docker build --rm -t exla:cuda10.2 . # CUDA 10.2 Docker image
docker build --rm -t exla:rocm . # ROCm Docker image

Then to run (without Cuda):

docker run -it \
  -v $PWD:$PWD \
  -e TEST_TMPDIR=$PWD/tmp/bazel_cache \
  -e BUILD_CACHE=$PWD/tmp/xla_extension_cache \
  -w $PWD \
  --rm exla:host bash

With CUDA enabled:

Note: XLA_TARGET should match your CUDA version. See: https://github.com/elixir-nx/xla#xla_target

docker run -it \
  -v $PWD:$PWD \
  -e TEST_TMPDIR=$PWD/tmp/bazel_cache \
  -e BUILD_CACHE=$PWD/tmp/xla_extension_cache \
  -e XLA_TARGET=cuda102 \
  -e EXLA_TARGET=cuda \
  -w $PWD \
  --gpus=all \
  --rm exla:cuda10.2 bash

With ROCm enabled:

docker run -it \
  -v $PWD:$PWD \
  -e TEST_TMPDIR=$PWD/tmp/bazel_cache \
  -e BUILD_CACHE=$PWD/tmp/xla_extension_cache \
  -e XLA_TARGET=rocm \
  -e EXLA_TARGET=rocm \
  -w $PWD \
  --device=/dev/kfd \
  --device=/dev/dri \
  --group-add video \
  --rm exla:rocm bash

Inside the container you can interact with the API from IEx using:

iex -S mix

Or you can run an example:

mix run examples/regression.exs

To run tests:

mix test

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

exla

exla

README.md

Installation

XLA binaries

Common installation issues

Usage with Nerves

Troubleshooting

Contributing

Building locally

Building with Docker

License

Files

exla

Directory actions

More options

Directory actions

More options

Latest commit

History

exla

Folders and files

parent directory

README.md

Installation

XLA binaries

Common installation issues

Usage with Nerves

Troubleshooting

Contributing

Building locally

Building with Docker

License