Skip to content

Commpath#117

Open
rountree wants to merge 34 commits intollnl:develfrom
rountree:commpath
Open

Commpath#117
rountree wants to merge 34 commits intollnl:develfrom
rountree:commpath

Conversation

@rountree
Copy link
Collaborator

@rountree rountree commented Oct 24, 2025

Renames location environment variables, fields, and enums to commpath.

Builds with --with-rm=[flux|serial|slurm]. Can't build slurm-plugin, but that's not due to these changes.

@rountree
Copy link
Collaborator Author

rountree commented Nov 5, 2025

Found a failure case. Hold off on review for now.

Previously,

chosen_realized_cachepath was copied into
set_intercept_readlink_cachepath()

chosen_realized_cachepath and chosen_parsed_cachepath were copied
into set_should_intercept_cachepath()

This PR removes both setter functions and makes the original
pointers global.
Removes chosen_cachepath and cachepath_bitindex from
  spindle_launch.h

Updates initialization of matching variables in ldcs_process_data.

determineValidCachePaths() moved from spindle_be.cc to
  ldcs_audit_server_process.c to get ldcs_process_data visibility.

Added #include "parseloc.h" to ldcs_audit_server_process.c to get
  declaration of determineValidCachePaths().

Relocated "parseloc.h" to src/util so ldcs_audit_server_process.c
  could find it.

Trued up signedness of types caused my making "parseloc.h" more
  visible, e.g., cachepath_bitidx is now uint64_t everywhere.
The three-message-reply response is now a single message with
two strings.  The symbolic version of the cachepath is no longer
communicated as it was not being used.
New name is ldcs_audit_server_md_allreduce_AND().

If we get to the point where we're using other allreduce operations
we can solve the problem of duplicating the op list in md-land and
cobo-land.  For now, we're only using one op in md-land, so the
op can go into the function name.
dependabot bot added 2 commits January 5, 2026 17:21
Bumps [docker/setup-buildx-action](https://github.com/docker/setup-buildx-action) from 3.11.1 to 3.12.0.
- [Release notes](https://github.com/docker/setup-buildx-action/releases)
- [Commits](docker/setup-buildx-action@e468171...8d2750c)

---
updated-dependencies:
- dependency-name: docker/setup-buildx-action
  dependency-version: 3.12.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Bumps [actions/checkout](https://github.com/actions/checkout) from 6.0.0 to 6.0.1.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](actions/checkout@1af3b93...8e8c483)

---
updated-dependencies:
- dependency-name: actions/checkout
  dependency-version: 6.0.1
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Additionally populates /etc/environment just in case ssh is used
to set up the servers.
enable_maintainer_mode
with_default_port
with_default_num_ports
with_localstorage
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The configure script will give a warning for unrecognized arguments, so removing the --with-localstorage causes builds that still specify --with-localstorage to configure without error, but the option is unused. For example, the script in the CI environment that builds Spindle uses --with-localstorage=/tmp, and you can see the warning in the CI output:

#18 0.165 configure: WARNING: unrecognized options: --with-localstorage

https://github.com/llnl/Spindle/actions/runs/21076617750/job/60620001124?pr=117#step:4:826

I wonder if it would make sense to either:

  1. Give an error rather than a warning if --with-localstorage is specified, where the error message says that --with-cachepaths and --with-commpath should be used instead; or,
  2. Still accept --with-localstorage and use it to set the cachepaths and commpath value.

We should also change the Spindle configure line used in CI to use the new arguments instead of --with-localstorage, in containers/spindle-serial-ubuntu/scripts/build_spindle.sh, containers/spindle-flux-ubuntu/scripts/build_spindle.sh, and containers/spindle-slurm-ubuntu/testing/scripts/build_spindle.sh.

Comment on lines +1286 to +1290
// Ignore Spindle fifo files for now.
if ( strncmp( "315", d->d_name, 3 ) == 0 )
continue;
if ( strncmp( "316", d->d_name, 3 ) == 0 )
continue;
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Where do these numbers (315, 316) come from?

@@ -0,0 +1 @@
Hello, world!
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this was just added to force the CI to run and should be removed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants