Fix for standard error and output stream by xop5 · Pull Request #170 · CDCgov/cfa-cloudops

xop5 · 2026-05-03T21:40:04Z

No description provided.

ryanraaschCDC · 2026-05-05T14:18:49Z

        command_line: str,
        mount_pairs: list[dict] | None = None,
        name_suffix: str = "",
+        blob_container: str | None = None,


I'm not sure we need to introduce another parameter to this function. We're already saving the blob container name that is specified in save_logs_to_blob in create_job() and saved to self. This could get passed in directly when we need it.

ryanraaschCDC · 2026-05-05T14:20:19Z

-            )
-            if rel_mnt_path != "ERROR!":
-                rel_mnt_path = "/" + helpers.format_rel_path(rel_path=rel_mnt_path)
+            rel_mnt_path = "/"


do we still need this rel_mnt_path = "/" when we aren't appending the stdout/stderr file names to the task command?

ryanraaschCDC · 2026-05-05T14:24:37Z

@@ -838,16 +840,7 @@ def add_task(
            logger.debug(f"Using provided container name: {container_name}.")

        if self.save_logs_to_blob:


we could maybe get rid of this check on if self.save_logs_to_blob here and just create the OutputFile in the batch_helpers.add_task

ryanraaschCDC · 2026-05-05T14:30:52Z

            job_name=job_name,
            task_id_base=job_name,
            command_line=command_line,
            save_logs_rel_path=rel_mnt_path,


do we use rel_mnt_path anywhere anymore? If not, we could change the save_logs_rel_path parameter name to something like save_log_blob_container that a user can input a blob container directly or if it's None here then we can pull from self.save_logs_to_blob. This can replace the blob_container parameter below since they both do the same thing

ryanraaschCDC · 2026-05-05T14:33:21Z

    logger.debug(f"Stdout will be saved to: '{stdout_file}'")
    logger.debug(f"Stderr will be saved to: '{stderr_file}'")
-    return f"""/bin/bash -c "mkdir -p {_folder}; {command_line} > {stdout_file} 2> {stderr_file}" """
+    return f"""/bin/bash -c "mkdir -p {_folder}; {command_line} > >(tee {stdout_file}) 2> >(tee {stderr_file})" """


if we're using OutputFiles for saving logs to blob then we don't need to do any of the tee commands. We can use the command that the user provides

ryanraaschCDC · 2026-05-05T15:12:11Z

@@ -1309,24 +1324,17 @@ def add_task(
        logger.debug(f"Generated string-based task ID: '{task_id}'")

    if save_logs_rel_path is not None:


could this whole check be removed?

We still need this check so I can modify the command sent to Azure task.

ryanraaschCDC · 2026-05-05T15:16:18Z

-            working_directory="containerImageDefault",
-        ),
+    output_files = None
+    if blob_container and blob_storage_account:


instead of blob_container we check for save_logs_blob_container or whatever the parameter name is that you choose (from the comment above)

CloudClient will pass the self.save_logs_blob_container as value of blob_container parameter to batch_helpers.add_task

ryanraaschCDC · 2026-05-05T15:17:15Z

+            file_pattern="../std*.txt",
+            blob_container=blob_container,
+            blob_account=blob_storage_account,
+            path=job_name,


for the path we may want to save to job_name/task_id. I believe the output of each task is simply a stdout.txt and stderr.txt so we will need a way to distinguish between tasks too

ryanraaschCDC · 2026-05-05T15:18:11Z

 [project]
 name = "cfa.cloudops"
-version = "0.3.22"
+version = "0.3.23"


since we're doing more changes and possibly renaming parameters, I would suggest bumping the version to 0.4.0 for the next minor update

ryanraaschCDC · 2026-05-07T15:44:10Z

when I test the code on my side to input-test with a logs folder of <job_name>_logs, I get stdout and stderr saved here with long title names, rather than the nested job_nam/task_name/stdout.txt that I expected. It makes me think the OutputFiles isn't working and it's the task output redirection doing the work.

Fix for standard error and output stream

fcb6485

xop5 requested a review from ryanraaschCDC May 3, 2026 21:40

xop5 mentioned this pull request May 3, 2026

save stdout and stderr to job + blob #158

Open

ryanraaschCDC approved these changes May 4, 2026

View reviewed changes

Add streaming for blob and standard output

7e13f84

xop5 force-pushed the stream_task_output branch from d90f9d7 to 7e13f84 Compare May 5, 2026 03:57

xop5 requested a review from ryanraaschCDC May 5, 2026 03:59

ryanraaschCDC requested changes May 5, 2026

View reviewed changes

Simplified interface for CloudClient.add_task function

746818a

xop5 force-pushed the stream_task_output branch from 2978f61 to 746818a Compare May 6, 2026 15:54

Refactored add_task_collection

9491fd2

xop5 force-pushed the stream_task_output branch from 786fafa to 9491fd2 Compare May 6, 2026 16:30

Refactored add_task_collection

3281955

xop5 force-pushed the stream_task_output branch from 3100c35 to 3281955 Compare May 6, 2026 16:34

Refactored add_task_collection

cfd0f18

xop5 force-pushed the stream_task_output branch from 1c76c7a to cfd0f18 Compare May 6, 2026 16:38

xop5 and others added 2 commits May 6, 2026 16:31

Changed target folder location in blob to logs_folder

3f6e788

Enable local hook to remove __init__.py files

24943d8

xop5 requested a review from ryanraaschCDC May 6, 2026 20:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for standard error and output stream#170

Fix for standard error and output stream#170
xop5 wants to merge 8 commits intomainfrom
stream_task_output

xop5 commented May 3, 2026

Uh oh!

ryanraaschCDC May 5, 2026

Uh oh!

ryanraaschCDC May 5, 2026

Uh oh!

ryanraaschCDC May 5, 2026

Uh oh!

ryanraaschCDC May 5, 2026

Uh oh!

ryanraaschCDC May 5, 2026

Uh oh!

ryanraaschCDC May 5, 2026

Uh oh!

xop5 May 6, 2026

Uh oh!

ryanraaschCDC May 5, 2026

Uh oh!

xop5 May 6, 2026

Uh oh!

ryanraaschCDC May 5, 2026

Uh oh!

ryanraaschCDC May 5, 2026

Uh oh!

ryanraaschCDC commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -838,16 +840,7 @@ def add_task(
		logger.debug(f"Using provided container name: {container_name}.")

		if self.save_logs_to_blob:

		@@ -1309,24 +1324,17 @@ def add_task(
		logger.debug(f"Generated string-based task ID: '{task_id}'")

		if save_logs_rel_path is not None:

Conversation

xop5 commented May 3, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ryanraaschCDC commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants