FEAT: Propagate error messages from simulation methods to database by mberz · Pull Request #107 · choras-org/backend

mberz · 2026-06-16T11:55:59Z

Proposed changes

Add support for different StatusCodes and error messages to the CloudExecutor
Handle and propagate errors in the simulation task
- Explicitly raised errors by simulation methods are propagated with specific error message
- Unexpected errors are propagated with generic error message
Set up models and schema for error messages.

Requires choras-org/simulation-backend#18 which introduces custom error classes for the simulation methods.

- Except explicitly raised error from the simulation methods and propagate them into the database - Except unexpected errors and add them to the log, propagate general error message ext

The model and schema for the error messages propagated from the simulation method

Copilot

Pull request overview

This PR adds end-to-end propagation of simulation error messages (including non-zero container exit codes and structured JSON errors) into persisted Simulation/SimulationRun records so the frontend can display meaningful failure reasons.

Changes:

Detect non-zero simulation container exit codes and attempt to extract structured error messages from the result JSON.
Persist errorMessage on Simulation and SimulationRun, and expose it via Marshmallow schemas.
Extend CloudExecutor’s completion stub to carry an exit code and return more informative logs.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 12 comments.

Show a summary per file

File	Description
app/services/simulation_service.py	Adds exit-code checking, structured error extraction, and persists error messages on failures.
app/services/executors/cloud_executor.py	Tracks simulation failure via JSON `"error"` and surfaces an exit code through `_CompletedJob`.
app/schemas/simulation_schema.py	Exposes `errorMessage` in API responses for simulations and runs.
app/models/SimulationRun.py	Adds `errorMessage` column to persist run-level failures.
app/models/Simulation.py	Adds `errorMessage` column to persist simulation-level failures.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

    def _poll_until_complete(
        self,
        remote_json_path: str,
        local_uploads_dir: str,
        remote_app_dir: str,
        remote_sandbox_path: str,
        remote_tar_path: Optional[str] = None,
-    ) -> bool:
-        """Adaptively poll the remote job until all results reach 100 % progress.
+    ) -> tuple[bool, int]:
+        """Adaptively poll the remote job until all results reach 100 % progress or an error occurs.


                success = self._collect_outputs_and_cleanup(
                    remote_app_dir      = remote_app_dir,
                    local_uploads_dir   = local_uploads_dir,
                    remote_sandbox_path = remote_sandbox_path,
                    remote_tar_path     = remote_tar_path,
                )

-                return success
+                return (success, 0)  # Exit code 0 for success


+        success, exit_code = self._poll_until_complete(
            remote_json_path    = f"{remote_app_dir_path}/{json_filename}",
            local_uploads_dir   = local_uploads_dir,
            remote_app_dir      = remote_app_dir_path,
            remote_sandbox_path = f"{self.remote_work_dir}/{sandbox_name}",
            remote_tar_path     = f"{self.remote_work_dir}/{tar_image_name}",
        )

-        return _CompletedJob()
+        # Build log message
+        if exit_code != 0:
+            logs_output = f"Cloud job completed with error (exit code {exit_code})"
+        elif not success:
+            logs_output = "Cloud job completed but cleanup failed"
+        else:
+            logs_output = "Cloud job completed successfully"
+
+        return _CompletedJob(exit_code=exit_code, logs_output=logs_output)


        Returns:
-            bool: ``True`` if outputs were collected and the remote workspace
-                was cleaned up successfully; ``False`` if an error occurred
-                during :meth:`_collect_outputs_and_cleanup`.
+            tuple[bool, int]: A tuple of (success, exit_code) where:
+                - success: ``True`` if outputs were collected and cleaned up successfully
+                - exit_code: 0 for success, 1 for error in simulation
        """


+        except RuntimeError as ex:
+            # These are errors explicitly raised in the simulation-method
+            # including a meaningful error message.
+            # Propagate error messages to the database (frontend).
+            error_msg = str(ex)


        except Exception as ex:
+            # Unexpected errors - log full details but show generic message
+            import traceback
+
+            error_details = traceback.format_exc()


mberz added 3 commits June 16, 2026 13:29

FEAT: Propagate error messages simulation task

4a37e3d

- Except explicitly raised error from the simulation methods and propagate them into the database - Except unexpected errors and add them to the log, propagate general error message ext

FEAT: Add models and schema for error messages

24b5574

The model and schema for the error messages propagated from the simulation method

FEAT: Enable returning error and logs for remote executors

c3bd487

mberz added the enhancement New feature or request label Jun 16, 2026

mberz added this to CHORAS planning Jun 16, 2026

github-project-automation Bot moved this to Backlog in CHORAS planning Jun 16, 2026

mberz requested a review from Copilot June 16, 2026 12:54

Copilot started reviewing on behalf of mberz June 16, 2026 12:55 View session

mberz moved this from Backlog to Implementation in progress in CHORAS planning Jun 16, 2026

Copilot AI reviewed Jun 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FEAT: Propagate error messages from simulation methods to database#107

FEAT: Propagate error messages from simulation methods to database#107
mberz wants to merge 3 commits into
devfrom
feat/propagate_error_messages

mberz commented Jun 16, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

mberz commented Jun 16, 2026

Proposed changes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants