fix: use absolute paths in fle inspect-eval by JackHopkins · Pull Request #363 · JackHopkins/factorio-learning-environment

JackHopkins · 2026-04-05T20:36:51Z

Summary

Resolve eval/inspect_integration/eval_set.py and agent_task.py paths using Path(__file__).parent instead of hardcoded relative paths, so fle inspect-eval works when run from any directory (not just the source repo root)
Fix bug where multiple --tasks values were joined into a single space-separated string argument instead of separate inspect eval arguments
Update help text examples to use ./solver_experiments.py instead of the old internal relative path

Test plan

cd /tmp && fle inspect-eval -h succeeds without errors
fle inspect-eval --tasks iron_plate_throughput --model openai/gpt-4o-mini --limit 1 prints command with absolute paths
fle inspect-eval --tasks task_a,task_b --model openai/gpt-4o-mini passes each task spec as a separate argument
--eval-set-file ./custom.py still resolves relative to CWD (not changed by this PR)

The `inspect eval` subprocess commands used hardcoded relative paths like `eval/inspect_integration/eval_set.py@task_name`, which only worked when run from the source repo root. This resolves the paths relative to the installed package using `Path(__file__).parent`, following the same pattern already used by `fle_cluster`. Also fixes a bug where multiple `--tasks` were joined into a single space-separated string argument instead of being passed as separate arguments to `inspect eval`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use absolute paths in fle inspect-eval#363

fix: use absolute paths in fle inspect-eval#363
JackHopkins wants to merge 1 commit intomainfrom
fix/inspect-eval-absolute-paths

JackHopkins commented Apr 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

JackHopkins commented Apr 5, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant