-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deployment and running of Spark and Dask. #113
Comments
This is not related to EESSI (or the software stack in general) but the underlying system and the (network) policies used there. This is not something that EESSI can influence, aside from maybe documenting what tweaking might be required. |
NERSC does seem to support this, https://docs.nersc.gov/analytics/dask/ not sure what their setup is though. |
A suggested network policy would be helpful to have. Would initially suggest an internal zone for the compute nodes that has many open ports (if using TCP/IP), but which is separated from communication to the outside world. |
Sure, many sites support Dask, I'm just saying that it is dependent on network policies (and these can even differ between login and compute nodes). |
Big data software such as Spark and Dask uses ip addresses and fixed ports that are usually not provided by cluster managers such as SLURM. Deployment of these should be made possible on EESSI, and documentation on their use provided. While it is helpful to have these as modules, self installation in user home directory should also be enabled.
The text was updated successfully, but these errors were encountered: