Parameter fixing and tidying by emlynsg · Pull Request #12 · JTuffy/compute-permit-simulator

emlynsg · 2026-02-25T22:17:21Z

No description provided.

JTuffy · 2026-02-26T22:02:29Z


-        effective_compliant = lab.is_compliant or lab.has_permit
-        violation_found = auditor.audit_finds_violation(effective_compliant)
+        is_actually_compliant = ao.realized_excess <= 0


Nice work making this resolve in one place. I think this makes more sense. I'll add myself a todo to record the channel origin in results

JTuffy · 2026-02-26T22:31:40Z

@@ -1,17 +1,21 @@
 {
-    "name": "Crisis World (Maximum Safety)",


Running this scenario I noticed some steps went by without formal audit. I think this is because 0.3 * 0.1 -> less than 1 expected audit per turn. Separate note on this

Good catch!

JTuffy · 2026-02-26T22:38:16Z

+            p_base = self.config.base_prob + signal * (1.0 - self.config.base_prob)
+        else:
+            p_base = self.config.base_prob
+        return min(1.0, p_base * audit_coefficient)


This might be a matter of opinion, but should the audit coefficient scale the signal and the base_prob? Currently a very low coefficient / high evasion can decimate formal audit base prob. Alternatively, it could affect just the firm's signal, which would preserve audit base prob.

This is a good point, and I think your logic makes complete sense. The only trouble is using the audit coefficient without signals activated. I think this is maybe unlikely (since if we are studying realistic auditing we'd have both turned on). I think in the end you're right, but we will need to be clear in the documentation. I will post to the team about this tonight.

JTuffy · 2026-02-26T22:42:15Z


-        # Combined probability: audit occurs AND catches violation
-        p_catch = p_audit * p_catch_if_audited
+    def compute_catch_probability(self, p_w: float = 0.0, p_m: float = 0.0) -> float:


(minor) some params could be cleaned up

I'll double check this bit

JTuffy · 2026-02-26T22:44:14Z

+    p_total = 1 - (1 - p_audit × p_stage2) × (1 - p_whistleblower) × (1 - p_monitoring)
+    whistleblower and monitoring are rolled independently of the audit.
+
+Detection channel labels (used in AgentOutcome.caught_by):


caught_by might be outdates

Ah yeah I think you're right. I'll double check here too.

JTuffy · 2026-02-26T22:50:00Z

Overall really great job! Had no idea it would be so much work, but these are solid improvements. I made a few small points, nothing too important. Happy for you to merge when you're ready

JTuffy

Great work

emlynsg added 2 commits February 25, 2026 23:16

fixing params

788503b

final fixes

399c64b

JTuffy reviewed Feb 26, 2026

View reviewed changes

JTuffy approved these changes Feb 26, 2026

View reviewed changes

emlynsg added 2 commits February 27, 2026 22:43

small fixes

bdbe5d8

fixes based on Joels comments

cb6791e

JTuffy merged commit 0dbb563 into main Feb 28, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parameter fixing and tidying#12

Parameter fixing and tidying#12
JTuffy merged 4 commits into
mainfrom
final-parameters-tidy

emlynsg commented Feb 25, 2026

Uh oh!

JTuffy Feb 26, 2026

Uh oh!

JTuffy Feb 26, 2026

Uh oh!

emlynsg Feb 27, 2026

Uh oh!

JTuffy Feb 26, 2026 •

edited

Loading

Uh oh!

emlynsg Feb 27, 2026

Uh oh!

JTuffy Feb 26, 2026

Uh oh!

emlynsg Feb 27, 2026

Uh oh!

JTuffy Feb 26, 2026

Uh oh!

emlynsg Feb 27, 2026

Uh oh!

JTuffy commented Feb 26, 2026

Uh oh!

JTuffy left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

emlynsg commented Feb 25, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JTuffy Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JTuffy commented Feb 26, 2026

Uh oh!

JTuffy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JTuffy Feb 26, 2026 •

edited

Loading