v1.2.0
This release marks the point of several important bug fixes. For performance and scalability, a long-running issue with nested RPCs causing deadlocks in large-scale workloads has been fixed. Previously, Hermes would wait for an entire task to execute before an RPC completed. This resulted in the requirement of having at least 1 RPC thread for each node Hermes was running on, becoming problematic at scales of larger than a few hundred nodes. Now RPCs are used only for the transfer of tasks, and do not wait for their completion. Hermes can now run with a single RPC thread per node, regardless of scale. In addition, we have changed our github actions to rely on Dockerhub. This improves the performance of actions dramatically while giving the benefit of having a maintained container. Lastly, we have made some changes to the Hermes spack. We now rely on thallium with cereal to maintain compatability with future mochi releases.
What's Changed
- Remove docker for now. Update readme. by @lukemartinlogan in #623
- Fix flushing and data op by @lukemartinlogan in #629
- Fix deadlock with data stager and data op by @lukemartinlogan in #630
- Remove print statements by @lukemartinlogan in #631
- Dev by @lukemartinlogan in #632
- Improvements to RAM utilization, concurrency control, and RPCs by @lukemartinlogan in #653
- Fix graceful runtime stop function by @lukemartinlogan in #654
- Dev by @lukemartinlogan in #657
- Fix the deadlock caused in stop that occurred for single-node cases by @lukemartinlogan in #658
- Change spack installation for hermes by @lukemartinlogan in #661
- Change the wiki to be our GRC website by @lukemartinlogan in #662
- Data staging will open and close files immediately by @lukemartinlogan in #664
- Add back original ReorganizeBlob by @lukemartinlogan in #666
- Point to GRC website for installation and building instructions by @lukemartinlogan in #667
- Add path regexing by @lukemartinlogan in #668
- Deployment updates by @lukemartinlogan in #673
- Always run container on workflow dispatch by @lukemartinlogan in #675
- Make spack force cereal by @lukemartinlogan in #676
Full Changelog: v1.1.1...v1.2.0