Skip to content

Raiju

Pre-release
Pre-release
Compare
Choose a tag to compare
@colinpwahl colinpwahl released this 02 May 15:43
· 14 commits to main since this release
c608fba

Dragon 0.9 Release Summary

This release augments scalability and performance for launching 10k or more processes and greatly improves distributed dictionary performanace. Other highlighted features are:

  • Improvements to ProcessGroup to provide better user experience and performance
  • Improve launch time for large numbers of processes by enabling batch launch
  • New implementation for distributed dictionary that improves performance and scalability
  • Support for placement of processes via Policy API
  • Bug fix for launching a Pool of pools

Two sets of packages are below. The ones with "CRAYEX" in the name include the RDMA-based transport feature and are for Cray EX systems only. The other packages use the TCP-based transport and will work on generic clusters and single node/laptops/etc. Note that the TCP-based transport package may not scale for some use cases above 16 nodes.