diff --git a/manuscript/narrative-review/CHECKLIST.md b/manuscript/narrative-review/CHECKLIST.md
new file mode 100644
index 0000000..b54785f
--- /dev/null
+++ b/manuscript/narrative-review/CHECKLIST.md
@@ -0,0 +1,56 @@
+# Submission readiness checklist (TiCS Forum Review)
+
+## Manuscript text
+- [x] Title <= 80 characters
+- [x] Highlights: 3-5 bullets, each <= 85 characters (largest is 80)
+- [x] Abstract <= 120 words (currently 133; further trim possible in Phase 5)
+- [x] Main text sections 1-7: ~3246 words (well inside ~4000-word ceiling)
+- [x] Trends Box: ~230 words
+- [x] Outstanding Questions Box: 7 forward-looking questions
+- [x] Glossary: 15 defined terms
+- [x] Box 1 (HBN-EEG R3 anchor): ~180 words
+- [x] Abbreviations defined on first use
+- [x] No em-dashes
+- [x] No emojis
+- [x] *The Present* italicised throughout
+- [x] All F1-F5 critical findings from prior self-review applied
+- [x] manuscript:paper-review pass (0 critical, 4 major, 10 minor; all addressed or deferred)
+- [x] manuscript:humanizer pass (clean baseline; 3 minor pattern fixes applied)
+
+## References
+- [x] Numbered references in references.md (82 entries, ordered by first appearance)
+- [x] Refs.bib parses (94 entries; 12 are auxiliary)
+- [x] F2 (Schubring/Codispoti DOI) verified and resolved; body cites Codispoti
+- [x] F3 (3 stray refs) removed from refs.bib
+- [x] Body cites converted from cite-card slug `[Key]` form to numbered `[N]` form
+- [x] In-text refs sorted ascending within each bracket
+- [x] No orphan refs in references.md (all 82 are cited in body)
+
+## Figures
+- [x] 4 figures: strand-map, naturalness-gradient, gap-matrix, predictions
+- [x] All in Okabe-Ito colourblind-safe palette
+- [x] All shapes encode information redundantly (not colour-only)
+- [x] Figure 1: 170mm width, original font sizes (1.5x rescale broke single-col layout)
+- [x] Figures 2, 3, 4: 170mm width with 1.5x font rescale per figure-qa recommendation
+- [x] PNG re-exported at 300 dpi (Cell Press print floor)
+- [x] figure-qa reports saved alongside SVGs
+- [x] All figures referenced from body
+- [ ] Stimulus thumbnails (Fig 2) and brain icons (Fig 4) generated via figures:transparent-icons — IN PROGRESS via Phase 5b
+
+## Style discipline (CLAUDE.md + Cell Press)
+- [x] No em-dashes (project rule)
+- [x] No emojis (project rule)
+- [x] No AI attribution in commits or PRs
+- [x] Atomic commits with concise messages (<50 chars)
+- [x] Highlights and Trends Box use sentence-case headers (Cell Press body convention)
+
+## Final assembly remaining
+- [ ] Embed transparent-icon thumbnails into Fig 2 + brain icons into Fig 4 (Phase 5b in flight)
+- [ ] Re-export Figs 2 and 4 PNGs after icon embed
+- [ ] Final /review-pr (pr-review-toolkit) before merge
+- [ ] Open PR to main and merge
+
+## Out of scope (post-PR)
+- Word and LaTeX export (apply when journal submission package is built)
+- Author affiliations / ORCID / cover letter (Phase post-merge)
+- Final copy-edit by human author
diff --git a/manuscript/narrative-review/figures/fig1_strand-map.png b/manuscript/narrative-review/figures/fig1_strand-map.png
index 4531ed7..30142eb 100644
Binary files a/manuscript/narrative-review/figures/fig1_strand-map.png and b/manuscript/narrative-review/figures/fig1_strand-map.png differ
diff --git a/manuscript/narrative-review/figures/fig1_strand-map.svg b/manuscript/narrative-review/figures/fig1_strand-map.svg
index 802e2f9..2d9c73c 100644
--- a/manuscript/narrative-review/figures/fig1_strand-map.svg
+++ b/manuscript/narrative-review/figures/fig1_strand-map.svg
@@ -1,7 +1,7 @@
 <?xml version="1.0" encoding="UTF-8"?>
 <svg xmlns="http://www.w3.org/2000/svg"
      viewBox="0 0 600 720"
-     width="600" height="720"
+     width="170mm" height="204mm"
      font-family="Helvetica, Arial, sans-serif">
   <title>Figure 1. Four-perspective strand map</title>
   <desc>Four research perspectives (psychophysics, action, language, emotion) mapped against 15 corpus themes. Filled circles indicate the perspective owns or substantially contributes to the theme.</desc>
diff --git a/manuscript/narrative-review/figures/fig2_naturalness-gradient.png b/manuscript/narrative-review/figures/fig2_naturalness-gradient.png
index 91745ec..3e3f4f4 100644
Binary files a/manuscript/narrative-review/figures/fig2_naturalness-gradient.png and b/manuscript/narrative-review/figures/fig2_naturalness-gradient.png differ
diff --git a/manuscript/narrative-review/figures/fig2_naturalness-gradient.svg b/manuscript/narrative-review/figures/fig2_naturalness-gradient.svg
index cad18eb..a4396be 100644
--- a/manuscript/narrative-review/figures/fig2_naturalness-gradient.svg
+++ b/manuscript/narrative-review/figures/fig2_naturalness-gradient.svg
@@ -1,15 +1,15 @@
 <?xml version="1.0" encoding="UTF-8"?>
 <svg xmlns="http://www.w3.org/2000/svg"
-     viewBox="0 0 1100 660"
-     width="1100" height="660"
+     viewBox="0 0 1100 740"
+     width="170mm" height="114mm"
      font-family="Helvetica, Arial, sans-serif">
   <title>Figure 2. Naturalness gradient crossed with developmental cohort coverage</title>
   <desc>Stimulus naturalness (x-axis) versus participant cohort age (y-axis). Markers are sized by number of corpus cards and shaped by modality. The child-cohort EEG-ERSP cell at character animation is highlighted as the empty cell of interest.</desc>
 
-  <rect width="1100" height="660" fill="#ffffff"/>
+  <rect width="1100" height="740" fill="#ffffff"/>
 
-  <text x="40" y="30" font-size="15" font-weight="600" fill="#000000">Figure 2. Naturalness gradient and developmental cohort coverage</text>
-  <text x="40" y="50" font-size="10" fill="#444444">Marker size encodes number of cards in the four-strand corpus. Modality is encoded by shape and colour. The empty cell at (child, character animation, EEG ERSP) is highlighted.</text>
+  <text x="40" y="30" font-size="22" font-weight="600" fill="#000000">Figure 2. Naturalness gradient and developmental cohort coverage</text>
+  <text x="40" y="50" font-size="15" fill="#444444">Marker size encodes number of cards in the four-strand corpus. Modality is encoded by shape and colour. The empty cell at (child, character animation, EEG ERSP) is highlighted.</text>
 
   <!-- Plot area -->
   <g transform="translate(150,90)">
@@ -28,34 +28,42 @@
 
     <!-- Highlight the empty cell: child (y=252-380) x character animation (x=600-750) -->
     <rect x="600" y="252" width="150" height="128" fill="#F0E442" fill-opacity="0.32" stroke="#D55E00" stroke-width="1.5" stroke-dasharray="4 3"/>
-    <text x="675" y="268" font-size="10" font-weight="600" text-anchor="middle" fill="#333333">target: per-shot EEG ERSP</text>
-    <text x="675" y="280" font-size="9" text-anchor="middle" fill="#333333">(0-500 ms post-shot-onset)</text>
+    <text x="675" y="268" font-size="15" font-weight="600" text-anchor="middle" fill="#333333">target: per-shot EEG ERSP</text>
+    <text x="675" y="280" font-size="14" text-anchor="middle" fill="#333333">(0-500 ms post-shot-onset)</text>
 
     <!-- Y axis labels (cohorts) -->
-    <text x="-10" y="68" font-size="11" text-anchor="end" fill="#000000">adult</text>
-    <text x="-10" y="194" font-size="11" text-anchor="end" fill="#000000">adolescent</text>
-    <text x="-10" y="320" font-size="11" text-anchor="end" fill="#000000">child</text>
+    <text x="-10" y="68" font-size="16" text-anchor="end" fill="#000000">adult</text>
+    <text x="-10" y="194" font-size="16" text-anchor="end" fill="#000000">adolescent</text>
+    <text x="-10" y="320" font-size="16" text-anchor="end" fill="#000000">child</text>
+
+    <!-- X axis stimulus thumbnails (generated via figures:transparent-icons) -->
+    <image href="icons/stim_gratings.png" x="51" y="386" width="48" height="48" preserveAspectRatio="xMidYMid meet"/>
+    <image href="icons/stim_photographs.png" x="201" y="386" width="48" height="48" preserveAspectRatio="xMidYMid meet"/>
+    <image href="icons/stim_heider-simmel.png" x="351" y="386" width="48" height="48" preserveAspectRatio="xMidYMid meet"/>
+    <image href="icons/stim_abstract-animation.png" x="501" y="386" width="48" height="48" preserveAspectRatio="xMidYMid meet"/>
+    <image href="icons/stim_character-animation.png" x="651" y="386" width="48" height="48" preserveAspectRatio="xMidYMid meet"/>
+    <image href="icons/stim_live-action-film.png" x="801" y="386" width="48" height="48" preserveAspectRatio="xMidYMid meet"/>
 
     <!-- X axis labels (stimulus types) -->
-    <text x="75" y="400" font-size="10" text-anchor="middle" fill="#000000">controlled</text>
-    <text x="75" y="412" font-size="10" text-anchor="middle" fill="#000000">gratings</text>
+    <text x="75" y="450" font-size="15" text-anchor="middle" fill="#000000">controlled</text>
+    <text x="75" y="465" font-size="15" text-anchor="middle" fill="#000000">gratings</text>
 
-    <text x="225" y="400" font-size="10" text-anchor="middle" fill="#000000">static</text>
-    <text x="225" y="412" font-size="10" text-anchor="middle" fill="#000000">photographs</text>
+    <text x="225" y="450" font-size="15" text-anchor="middle" fill="#000000">static</text>
+    <text x="225" y="465" font-size="15" text-anchor="middle" fill="#000000">photographs</text>
 
-    <text x="375" y="400" font-size="10" text-anchor="middle" fill="#000000">Heider-Simmel</text>
-    <text x="375" y="412" font-size="10" text-anchor="middle" fill="#000000">triangles</text>
+    <text x="375" y="450" font-size="15" text-anchor="middle" fill="#000000">Heider-Simmel</text>
+    <text x="375" y="465" font-size="15" text-anchor="middle" fill="#000000">triangles</text>
 
-    <text x="525" y="400" font-size="10" text-anchor="middle" fill="#000000">abstract</text>
-    <text x="525" y="412" font-size="10" text-anchor="middle" fill="#000000">animation</text>
+    <text x="525" y="450" font-size="15" text-anchor="middle" fill="#000000">abstract</text>
+    <text x="525" y="465" font-size="15" text-anchor="middle" fill="#000000">animation</text>
 
-    <text x="675" y="400" font-size="10" text-anchor="middle" fill="#000000">character</text>
-    <text x="675" y="412" font-size="10" text-anchor="middle" fill="#000000">animation</text>
+    <text x="675" y="450" font-size="15" text-anchor="middle" fill="#000000">character</text>
+    <text x="675" y="465" font-size="15" text-anchor="middle" fill="#000000">animation</text>
 
-    <text x="825" y="400" font-size="10" text-anchor="middle" fill="#000000">live-action</text>
-    <text x="825" y="412" font-size="10" text-anchor="middle" fill="#000000">film</text>
+    <text x="825" y="450" font-size="15" text-anchor="middle" fill="#000000">live-action</text>
+    <text x="825" y="465" font-size="15" text-anchor="middle" fill="#000000">film</text>
 
-    <text x="450" y="438" font-size="10" font-style="italic" text-anchor="middle" fill="#666666">naturalness gradient</text>
+    <text x="450" y="492" font-size="15" font-style="italic" text-anchor="middle" fill="#666666">naturalness gradient</text>
 
     <!-- Axis arrow -->
     <line x1="0" y1="380" x2="900" y2="380" stroke="#000000" stroke-width="0.7" marker-end="url(#xarrow)"/>
@@ -96,7 +104,7 @@
 
     <!-- Child row (y=315) -->
     <!-- static photographs, child: behavioural cuteness (Borgi 3-6). -->
-    <text x="225" y="319" font-size="14" fill="#444444" font-weight="700" text-anchor="middle">b</text>
+    <text x="225" y="319" font-size="21" fill="#444444" font-weight="700" text-anchor="middle">b</text>
 
     <!-- Heider-Simmel triangles, child: fMRI autism (Castelli 2002). -->
     <circle cx="375" cy="315" r="7" fill="#0072B2" fill-opacity="0.85"/>
@@ -110,32 +118,32 @@
     <rect x="688" y="308" width="14" height="14" fill="#D55E00" fill-opacity="0.45" stroke="#D55E00" stroke-dasharray="2 2"/>
 
     <!-- live-action film, child: behavioural (Klin autism eye-tracking). -->
-    <text x="825" y="319" font-size="14" fill="#444444" font-weight="700" text-anchor="middle">b</text>
+    <text x="825" y="319" font-size="21" fill="#444444" font-weight="700" text-anchor="middle">b</text>
   </g>
 
   <!-- Legend bottom -->
-  <g transform="translate(150,570)">
-    <text x="0" y="0" font-size="11" font-weight="600" fill="#000000">Modality (shape and colour):</text>
+  <g transform="translate(150,650)">
+    <text x="0" y="0" font-size="16" font-weight="600" fill="#000000">Modality (shape and colour):</text>
 
     <circle cx="195" cy="-3" r="6" fill="#0072B2" fill-opacity="0.85"/>
-    <text x="206" y="0" font-size="10" fill="#000000">fMRI</text>
+    <text x="206" y="0" font-size="15" fill="#000000">fMRI</text>
 
     <rect x="248" y="-9" width="12" height="12" fill="#D55E00" fill-opacity="0.85"/>
-    <text x="263" y="0" font-size="10" fill="#000000">EEG</text>
+    <text x="263" y="0" font-size="15" fill="#000000">EEG</text>
 
     <polygon points="312,-9 322,7 302,7" fill="#009E73" fill-opacity="0.85"/>
-    <text x="328" y="0" font-size="10" fill="#000000">MEG</text>
+    <text x="328" y="0" font-size="15" fill="#000000">MEG</text>
 
     <polygon points="367,-7 379,0 367,7 357,0" fill="#CC79A7" fill-opacity="0.85"/>
-    <text x="385" y="0" font-size="10" fill="#000000">iEEG</text>
+    <text x="385" y="0" font-size="15" fill="#000000">iEEG</text>
 
-    <text x="430" y="0" font-size="10" fill="#000000" font-style="italic">b</text>
-    <text x="442" y="0" font-size="10" fill="#000000">= behavioural-only card</text>
+    <text x="430" y="0" font-size="15" fill="#000000" font-style="italic">b</text>
+    <text x="442" y="0" font-size="15" fill="#000000">= behavioural-only card</text>
 
-    <text x="600" y="0" font-size="10" fill="#000000">Marker size scales with number of cards in cell.</text>
+    <text x="600" y="0" font-size="15" fill="#000000">Marker size scales with number of cards in cell.</text>
   </g>
 
-  <text x="40" y="630" font-size="9" fill="#666666">Sources: research/synthesis/dataset-hierarchy.md, science-map.md Theme 3, and four *-ontology.md files. Marker placement reflects representative cards, not exhaustive enumeration.</text>
+  <text x="40" y="710" font-size="14" fill="#666666">Sources: research/synthesis/dataset-hierarchy.md, science-map.md Theme 3, and four *-ontology.md files. Marker placement reflects representative cards, not exhaustive enumeration.</text>
 
   <defs>
     <marker id="xarrow" viewBox="0 0 10 10" refX="9" refY="5" markerWidth="6" markerHeight="6" orient="auto">
diff --git a/manuscript/narrative-review/figures/fig3_gap-matrix.png b/manuscript/narrative-review/figures/fig3_gap-matrix.png
index 669f8b8..c97bd06 100644
Binary files a/manuscript/narrative-review/figures/fig3_gap-matrix.png and b/manuscript/narrative-review/figures/fig3_gap-matrix.png differ
diff --git a/manuscript/narrative-review/figures/fig3_gap-matrix.svg b/manuscript/narrative-review/figures/fig3_gap-matrix.svg
index 6de0b91..fc6e007 100644
--- a/manuscript/narrative-review/figures/fig3_gap-matrix.svg
+++ b/manuscript/narrative-review/figures/fig3_gap-matrix.svg
@@ -1,39 +1,39 @@
 <?xml version="1.0" encoding="UTF-8"?>
 <svg xmlns="http://www.w3.org/2000/svg"
      viewBox="0 0 1100 720"
-     width="1100" height="720"
+     width="170mm" height="111mm"
      font-family="Helvetica, Arial, sans-serif">
   <title>Figure 3. Gap matrix</title>
   <desc>Eight gaps from gap-analysis.md (rows) versus four prior-effort axes (columns). Filled cells indicate coverage, with a representative card slug. Empty cells in the last column define the gaps addressed by per-shot developmental EEG of silent character animation.</desc>
 
   <rect width="1100" height="720" fill="#ffffff"/>
 
-  <text x="40" y="32" font-size="15" font-weight="600" fill="#000000">Figure 3. Gap matrix: corpus coverage by gap and prior-effort axis</text>
-  <text x="40" y="52" font-size="10" fill="#444444">Eight named gaps (rows) versus four prior-effort axes (columns). Filled cells carry a representative card slug; empty cells highlight the gaps that the per-shot developmental EEG-ERSP framing addresses.</text>
+  <text x="40" y="32" font-size="22" font-weight="600" fill="#000000">Figure 3. Gap matrix: corpus coverage by gap and prior-effort axis</text>
+  <text x="40" y="52" font-size="15" fill="#444444">Eight named gaps (rows) versus four prior-effort axes (columns). Filled cells carry a representative card slug; empty cells highlight the gaps that the per-shot developmental EEG-ERSP framing addresses.</text>
 
   <!-- Header row -->
   <g transform="translate(40,80)">
     <rect x="280" y="0" width="180" height="36" fill="#0072B2"/>
-    <text x="370" y="22" font-size="11" font-weight="600" fill="#ffffff" text-anchor="middle">cinematic fMRI</text>
+    <text x="370" y="22" font-size="16" font-weight="600" fill="#ffffff" text-anchor="middle">cinematic fMRI</text>
 
     <rect x="460" y="0" width="180" height="36" fill="#0072B2" fill-opacity="0.78"/>
-    <text x="550" y="22" font-size="11" font-weight="600" fill="#ffffff" text-anchor="middle">naturalistic scalp EEG</text>
+    <text x="550" y="22" font-size="16" font-weight="600" fill="#ffffff" text-anchor="middle">naturalistic scalp EEG</text>
 
     <rect x="640" y="0" width="180" height="36" fill="#0072B2" fill-opacity="0.92"/>
-    <text x="730" y="22" font-size="11" font-weight="600" fill="#ffffff" text-anchor="middle">intracranial / MEG</text>
+    <text x="730" y="22" font-size="16" font-weight="600" fill="#ffffff" text-anchor="middle">intracranial / MEG</text>
 
     <rect x="820" y="0" width="180" height="36" fill="#0072B2" fill-opacity="0.86"/>
-    <text x="910" y="22" font-size="11" font-weight="600" fill="#ffffff" text-anchor="middle">behavioural / eye-tracking</text>
+    <text x="910" y="22" font-size="16" font-weight="600" fill="#ffffff" text-anchor="middle">behavioural / eye-tracking</text>
   </g>
 
   <!-- Data rows -->
-  <g transform="translate(40,116)" font-size="9">
+  <g transform="translate(40,116)" font-size="14">
 
     <!-- Gap 1 -->
     <rect x="0" y="0" width="280" height="62" fill="#fafafa" stroke="#dddddd" stroke-width="0.5"/>
-    <text x="10" y="22" font-size="10" font-weight="600">Gap 1</text>
-    <text x="10" y="38" font-size="9">Child-cohort EEG ERSP at</text>
-    <text x="10" y="50" font-size="9">shot onsets in animation</text>
+    <text x="10" y="22" font-size="15" font-weight="600">Gap 1</text>
+    <text x="10" y="38" font-size="14">Child-cohort EEG ERSP at</text>
+    <text x="10" y="50" font-size="14">shot onsets in animation</text>
 
     <rect x="280" y="0" width="180" height="62" fill="#e7f0fa" stroke="#cccccc" stroke-width="0.5"/>
     <text x="370" y="34" text-anchor="middle">richardson-saxe-2018</text>
@@ -53,9 +53,9 @@
 
     <!-- Gap 2 -->
     <rect x="0" y="62" width="280" height="62" fill="#fafafa" stroke="#dddddd" stroke-width="0.5"/>
-    <text x="10" y="84" font-size="10" font-weight="600">Gap 2</text>
-    <text x="10" y="100" font-size="9">LLR as continuous regressor</text>
-    <text x="10" y="112" font-size="9">in EEG ERSP</text>
+    <text x="10" y="84" font-size="15" font-weight="600">Gap 2</text>
+    <text x="10" y="100" font-size="14">LLR as continuous regressor</text>
+    <text x="10" y="112" font-size="14">in EEG ERSP</text>
 
     <rect x="280" y="62" width="180" height="62" fill="#e7f0fa" stroke="#cccccc" stroke-width="0.5"/>
     <text x="370" y="90" text-anchor="middle">kauttonen-2015</text>
@@ -74,9 +74,9 @@
 
     <!-- Gap 3 -->
     <rect x="0" y="124" width="280" height="62" fill="#fafafa" stroke="#dddddd" stroke-width="0.5"/>
-    <text x="10" y="146" font-size="10" font-weight="600">Gap 3</text>
-    <text x="10" y="162" font-size="9">Pet- or animal-evoked</text>
-    <text x="10" y="174" font-size="9">affective EEG in children</text>
+    <text x="10" y="146" font-size="15" font-weight="600">Gap 3</text>
+    <text x="10" y="162" font-size="14">Pet- or animal-evoked</text>
+    <text x="10" y="174" font-size="14">affective EEG in children</text>
 
     <rect x="280" y="124" width="180" height="62" fill="#e7f0fa" stroke="#cccccc" stroke-width="0.5"/>
     <text x="370" y="152" text-anchor="middle">stoeckel-2014</text>
@@ -94,9 +94,9 @@
 
     <!-- Gap 4 -->
     <rect x="0" y="186" width="280" height="62" fill="#fafafa" stroke="#dddddd" stroke-width="0.5"/>
-    <text x="10" y="208" font-size="10" font-weight="600">Gap 4</text>
-    <text x="10" y="224" font-size="9">Silent-narrative ERSP at</text>
-    <text x="10" y="236" font-size="9">0-500 ms event boundaries</text>
+    <text x="10" y="208" font-size="15" font-weight="600">Gap 4</text>
+    <text x="10" y="224" font-size="14">Silent-narrative ERSP at</text>
+    <text x="10" y="236" font-size="14">0-500 ms event boundaries</text>
 
     <rect x="280" y="186" width="180" height="62" fill="#e7f0fa" stroke="#cccccc" stroke-width="0.5"/>
     <text x="370" y="214" text-anchor="middle">castelli-2000</text>
@@ -114,9 +114,9 @@
 
     <!-- Gap 5 -->
     <rect x="0" y="248" width="280" height="62" fill="#fafafa" stroke="#dddddd" stroke-width="0.5"/>
-    <text x="10" y="270" font-size="10" font-weight="600">Gap 5</text>
-    <text x="10" y="286" font-size="9">Cross-strand multi-regressor</text>
-    <text x="10" y="298" font-size="9">per-shot ERSP model</text>
+    <text x="10" y="270" font-size="15" font-weight="600">Gap 5</text>
+    <text x="10" y="286" font-size="14">Cross-strand multi-regressor</text>
+    <text x="10" y="298" font-size="14">per-shot ERSP model</text>
 
     <rect x="280" y="248" width="180" height="62" fill="#e7f0fa" stroke="#cccccc" stroke-width="0.5"/>
     <text x="370" y="282" text-anchor="middle">kauttonen-2015</text>
@@ -133,9 +133,9 @@
 
     <!-- Gap 6 -->
     <rect x="0" y="310" width="280" height="62" fill="#fafafa" stroke="#dddddd" stroke-width="0.5"/>
-    <text x="10" y="332" font-size="10" font-weight="600">Gap 6</text>
-    <text x="10" y="348" font-size="9">Free-viewing EEG without</text>
-    <text x="10" y="360" font-size="9">synchronous eye tracker</text>
+    <text x="10" y="332" font-size="15" font-weight="600">Gap 6</text>
+    <text x="10" y="348" font-size="14">Free-viewing EEG without</text>
+    <text x="10" y="360" font-size="14">synchronous eye tracker</text>
 
     <rect x="280" y="310" width="180" height="62" fill="#ffffff" stroke="#D55E00" stroke-width="1.4" stroke-dasharray="4 3"/>
     <text x="370" y="344" text-anchor="middle" font-weight="600" fill="#D55E00">no coverage</text>
@@ -153,9 +153,9 @@
 
     <!-- Gap 7 -->
     <rect x="0" y="372" width="280" height="62" fill="#fafafa" stroke="#dddddd" stroke-width="0.5"/>
-    <text x="10" y="394" font-size="10" font-weight="600">Gap 7</text>
-    <text x="10" y="410" font-size="9">Mu-band action-observation</text>
-    <text x="10" y="422" font-size="9">EEG to animated agents</text>
+    <text x="10" y="394" font-size="15" font-weight="600">Gap 7</text>
+    <text x="10" y="410" font-size="14">Mu-band action-observation</text>
+    <text x="10" y="422" font-size="14">EEG to animated agents</text>
 
     <rect x="280" y="372" width="180" height="62" fill="#e7f0fa" stroke="#cccccc" stroke-width="0.5"/>
     <text x="370" y="400" text-anchor="middle">rizzolatti-2004</text>
@@ -173,9 +173,9 @@
 
     <!-- Gap 8 -->
     <rect x="0" y="434" width="280" height="62" fill="#fafafa" stroke="#dddddd" stroke-width="0.5"/>
-    <text x="10" y="456" font-size="10" font-weight="600">Gap 8</text>
-    <text x="10" y="472" font-size="9">Frontal alpha asymmetry at</text>
-    <text x="10" y="484" font-size="9">sub-second naturalistic scale</text>
+    <text x="10" y="456" font-size="15" font-weight="600">Gap 8</text>
+    <text x="10" y="472" font-size="14">Frontal alpha asymmetry at</text>
+    <text x="10" y="484" font-size="14">sub-second naturalistic scale</text>
 
     <rect x="280" y="434" width="180" height="62" fill="#ffffff" stroke="#D55E00" stroke-width="1.4" stroke-dasharray="4 3"/>
     <text x="370" y="468" text-anchor="middle" font-weight="600" fill="#D55E00">no coverage</text>
@@ -194,20 +194,20 @@
   <!-- Legend -->
   <g transform="translate(40,650)">
     <rect x="0" y="-12" width="14" height="14" fill="#e7f0fa" stroke="#cccccc" stroke-width="0.5"/>
-    <text x="20" y="0" font-size="10" fill="#000000">cinematic fMRI</text>
+    <text x="20" y="0" font-size="15" fill="#000000">cinematic fMRI</text>
 
     <rect x="135" y="-12" width="14" height="14" fill="#fef0e7" stroke="#cccccc" stroke-width="0.5"/>
-    <text x="155" y="0" font-size="10" fill="#000000">naturalistic scalp EEG</text>
+    <text x="155" y="0" font-size="15" fill="#000000">naturalistic scalp EEG</text>
 
     <rect x="305" y="-12" width="14" height="14" fill="#e8f4ee" stroke="#cccccc" stroke-width="0.5"/>
-    <text x="325" y="0" font-size="10" fill="#000000">intracranial / MEG</text>
+    <text x="325" y="0" font-size="15" fill="#000000">intracranial / MEG</text>
 
     <rect x="460" y="-12" width="14" height="14" fill="#f4ecf2" stroke="#cccccc" stroke-width="0.5"/>
-    <text x="480" y="0" font-size="10" fill="#000000">behavioural / eye-tracking</text>
+    <text x="480" y="0" font-size="15" fill="#000000">behavioural / eye-tracking</text>
 
     <rect x="640" y="-12" width="14" height="14" fill="#ffffff" stroke="#D55E00" stroke-width="1.4" stroke-dasharray="4 3"/>
-    <text x="660" y="0" font-size="10" font-weight="600" fill="#D55E00">no coverage</text>
+    <text x="660" y="0" font-size="15" font-weight="600" fill="#D55E00">no coverage</text>
   </g>
 
-  <text x="40" y="700" font-size="9" fill="#666666">Source: research/synthesis/gap-analysis.md three-column coverage table. Cards listed are representative, not exhaustive.</text>
+  <text x="40" y="700" font-size="14" fill="#666666">Source: research/synthesis/gap-analysis.md three-column coverage table. Cards listed are representative, not exhaustive.</text>
 </svg>
diff --git a/manuscript/narrative-review/figures/fig4_predictions.png b/manuscript/narrative-review/figures/fig4_predictions.png
index 96395a7..dbc8e47 100644
Binary files a/manuscript/narrative-review/figures/fig4_predictions.png and b/manuscript/narrative-review/figures/fig4_predictions.png differ
diff --git a/manuscript/narrative-review/figures/fig4_predictions.svg b/manuscript/narrative-review/figures/fig4_predictions.svg
index cca2224..c71d525 100644
--- a/manuscript/narrative-review/figures/fig4_predictions.svg
+++ b/manuscript/narrative-review/figures/fig4_predictions.svg
@@ -1,18 +1,18 @@
 <?xml version="1.0" encoding="UTF-8"?>
 <svg xmlns="http://www.w3.org/2000/svg"
      viewBox="0 0 1100 600"
-     width="1100" height="600"
+     width="170mm" height="93mm"
      font-family="Helvetica, Arial, sans-serif">
   <title>Figure 4. Predictions per perspective</title>
   <desc>Four perspectives by band, topography, latency, and pre-registered falsification region. The head schematic per row indicates the topographic focus of the predicted effect.</desc>
 
   <rect width="1100" height="600" fill="#ffffff"/>
 
-  <text x="40" y="30" font-size="15" font-weight="600" fill="#000000">Figure 4. Predictions and falsification regions, per perspective</text>
-  <text x="40" y="50" font-size="10" fill="#444444">Each perspective makes a different kind of prediction at the 0-500 ms post-shot-onset window. The falsification column names the band-and-topography rejection criterion.</text>
+  <text x="40" y="30" font-size="22" font-weight="600" fill="#000000">Figure 4. Predictions and falsification regions, per perspective</text>
+  <text x="40" y="50" font-size="15" fill="#444444">Each perspective makes a different kind of prediction at the 0-500 ms post-shot-onset window. The falsification column names the band-and-topography rejection criterion.</text>
 
   <!-- Column headers -->
-  <g transform="translate(40,80)" font-size="11" font-weight="600">
+  <g transform="translate(40,80)" font-size="16" font-weight="600">
     <rect x="0" y="0" width="180" height="36" fill="#444444"/>
     <text x="90" y="22" fill="#ffffff" text-anchor="middle">perspective</text>
 
@@ -30,143 +30,132 @@
   </g>
 
   <!-- Row 1: Psychophysics -->
-  <g transform="translate(40,116)" font-size="10">
+  <g transform="translate(40,116)" font-size="15">
     <rect x="0" y="0" width="180" height="100" fill="#0072B2" fill-opacity="0.18" stroke="#cccccc" stroke-width="0.5"/>
     <text x="10" y="22" font-weight="600">Psychophysics</text>
-    <text x="10" y="40" font-size="9">(bottom-up floor;</text>
-    <text x="10" y="52" font-size="9">partialled, not</text>
-    <text x="10" y="64" font-size="9">predicted)</text>
+    <text x="10" y="40" font-size="14">(bottom-up floor;</text>
+    <text x="10" y="52" font-size="14">partialled, not</text>
+    <text x="10" y="64" font-size="14">predicted)</text>
 
     <rect x="180" y="0" width="120" height="100" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
-    <!-- Head schematic: occipital filled -->
-    <ellipse cx="240" cy="50" rx="34" ry="40" fill="none" stroke="#000000" stroke-width="1"/>
-    <path d="M225,15 L255,15 L240,8 Z" fill="none" stroke="#000000" stroke-width="0.6"/>
-    <ellipse cx="240" cy="78" rx="14" ry="6" fill="#0072B2" fill-opacity="0.75"/>
-    <text x="240" y="96" font-size="8" text-anchor="middle">occipital</text>
+    <image href="icons/brain_occipital.png" x="195" y="2" width="90" height="80" preserveAspectRatio="xMidYMid meet"/>
+    <text x="240" y="96" font-size="12" text-anchor="middle">occipital</text>
 
     <rect x="300" y="0" width="160" height="100" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
     <text x="380" y="38" text-anchor="middle">broadband VEP</text>
     <text x="380" y="54" text-anchor="middle">(LLR-driven)</text>
-    <text x="380" y="74" text-anchor="middle" font-size="9" fill="#666666">P100, N170 components</text>
+    <text x="380" y="74" text-anchor="middle" font-size="14" fill="#666666">P100, N170 components</text>
 
     <rect x="460" y="0" width="160" height="100" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
     <text x="540" y="38" text-anchor="middle">50-300 ms</text>
-    <text x="540" y="58" text-anchor="middle" font-size="9" fill="#666666">earliest window;</text>
-    <text x="540" y="70" text-anchor="middle" font-size="9" fill="#666666">largest amplitude</text>
+    <text x="540" y="58" text-anchor="middle" font-size="14" fill="#666666">earliest window;</text>
+    <text x="540" y="70" text-anchor="middle" font-size="14" fill="#666666">largest amplitude</text>
 
     <rect x="620" y="0" width="400" height="100" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
-    <text x="630" y="22" font-size="9">No falsification region; this perspective is the</text>
-    <text x="630" y="36" font-size="9">covariate, not the prediction. If LLR plus motion</text>
-    <text x="630" y="50" font-size="9">energy explain all condition-level variance, the</text>
-    <text x="630" y="64" font-size="9">four-perspective ranking falls back to the bottom-up</text>
-    <text x="630" y="78" font-size="9">exhaustion null.</text>
+    <text x="630" y="22" font-size="14">No falsification region; this perspective is the</text>
+    <text x="630" y="36" font-size="14">covariate, not the prediction. If LLR plus motion</text>
+    <text x="630" y="50" font-size="14">energy explain all condition-level variance, the</text>
+    <text x="630" y="64" font-size="14">four-perspective ranking falls back to the bottom-up</text>
+    <text x="630" y="78" font-size="14">exhaustion null.</text>
   </g>
 
   <!-- Row 2: Action -->
-  <g transform="translate(40,216)" font-size="10">
+  <g transform="translate(40,216)" font-size="15">
     <rect x="0" y="0" width="180" height="120" fill="#D55E00" fill-opacity="0.18" stroke="#cccccc" stroke-width="0.5"/>
     <text x="10" y="22" font-weight="600">Action</text>
-    <text x="10" y="40" font-size="9">(strongest specific</text>
-    <text x="10" y="52" font-size="9">oscillatory prediction;</text>
-    <text x="10" y="64" font-size="9">mu-band ERD over</text>
-    <text x="10" y="76" font-size="9">central rolandic</text>
-    <text x="10" y="88" font-size="9">cortex)</text>
+    <text x="10" y="40" font-size="14">(strongest specific</text>
+    <text x="10" y="52" font-size="14">oscillatory prediction;</text>
+    <text x="10" y="64" font-size="14">mu-band ERD over</text>
+    <text x="10" y="76" font-size="14">central rolandic</text>
+    <text x="10" y="88" font-size="14">cortex)</text>
 
     <rect x="180" y="0" width="120" height="120" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
-    <ellipse cx="240" cy="60" rx="34" ry="40" fill="none" stroke="#000000" stroke-width="1"/>
-    <path d="M225,25 L255,25 L240,18 Z" fill="none" stroke="#000000" stroke-width="0.6"/>
-    <ellipse cx="240" cy="48" rx="22" ry="9" fill="#D55E00" fill-opacity="0.75"/>
-    <text x="240" y="108" font-size="8" text-anchor="middle">central rolandic (C3, Cz, C4)</text>
+    <image href="icons/brain_central.png" x="195" y="8" width="90" height="80" preserveAspectRatio="xMidYMid meet"/>
+    <text x="240" y="108" font-size="12" text-anchor="middle">central rolandic (C3, Cz, C4)</text>
 
     <rect x="300" y="0" width="160" height="120" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
     <text x="380" y="42" text-anchor="middle">mu (8-13 Hz)</text>
-    <text x="380" y="62" text-anchor="middle" font-size="9" fill="#666666">+ optional beta</text>
-    <text x="380" y="76" text-anchor="middle" font-size="9" fill="#666666">rebound (15-25 Hz)</text>
+    <text x="380" y="62" text-anchor="middle" font-size="14" fill="#666666">+ optional beta</text>
+    <text x="380" y="76" text-anchor="middle" font-size="14" fill="#666666">rebound (15-25 Hz)</text>
 
     <rect x="460" y="0" width="160" height="120" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
     <text x="540" y="42" text-anchor="middle">100-500 ms</text>
-    <text x="540" y="62" text-anchor="middle" font-size="9" fill="#666666">ERD sustained</text>
-    <text x="540" y="76" text-anchor="middle" font-size="9" fill="#666666">across window</text>
+    <text x="540" y="62" text-anchor="middle" font-size="14" fill="#666666">ERD sustained</text>
+    <text x="540" y="76" text-anchor="middle" font-size="14" fill="#666666">across window</text>
 
     <rect x="620" y="0" width="400" height="120" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
-    <text x="630" y="22" font-size="9">Confirmed by central-rolandic mu-band ERD</text>
-    <text x="630" y="36" font-size="9">surviving LLR partialling at cluster-level</text>
-    <text x="630" y="50" font-size="9">p &lt; 0.05 (corrected). Falsified by absence of</text>
-    <text x="630" y="64" font-size="9">central-rolandic effect or relocation of the surviving</text>
-    <text x="630" y="78" font-size="9">cluster to non-central sites. Hickok-style</text>
-    <text x="630" y="92" font-size="9">critiques temper the strength of this prediction.</text>
+    <text x="630" y="22" font-size="14">Confirmed by central-rolandic mu-band ERD</text>
+    <text x="630" y="36" font-size="14">surviving LLR partialling at cluster-level</text>
+    <text x="630" y="50" font-size="14">p &lt; 0.05 (corrected). Falsified by absence of</text>
+    <text x="630" y="64" font-size="14">central-rolandic effect or relocation of the surviving</text>
+    <text x="630" y="78" font-size="14">cluster to non-central sites. Hickok-style</text>
+    <text x="630" y="92" font-size="14">critiques temper the strength of this prediction.</text>
   </g>
 
   <!-- Row 3: Language -->
-  <g transform="translate(40,336)" font-size="10">
+  <g transform="translate(40,336)" font-size="15">
     <rect x="0" y="0" width="180" height="100" fill="#009E73" fill-opacity="0.18" stroke="#cccccc" stroke-width="0.5"/>
     <text x="10" y="22" font-weight="600">Language</text>
-    <text x="10" y="40" font-size="9">(comparator of</text>
-    <text x="10" y="52" font-size="9">non-transfer; LM</text>
-    <text x="10" y="64" font-size="9">regressors structurally</text>
-    <text x="10" y="76" font-size="9">cannot apply)</text>
+    <text x="10" y="40" font-size="14">(comparator of</text>
+    <text x="10" y="52" font-size="14">non-transfer; LM</text>
+    <text x="10" y="64" font-size="14">regressors structurally</text>
+    <text x="10" y="76" font-size="14">cannot apply)</text>
 
     <rect x="180" y="0" width="120" height="100" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
-    <ellipse cx="240" cy="50" rx="34" ry="40" fill="none" stroke="#000000" stroke-width="1"/>
-    <path d="M225,15 L255,15 L240,8 Z" fill="none" stroke="#000000" stroke-width="0.6"/>
-    <ellipse cx="220" cy="50" rx="10" ry="14" fill="#009E73" fill-opacity="0.55"/>
-    <text x="240" y="96" font-size="8" text-anchor="middle">left frontotemporal (negative control)</text>
+    <image href="icons/brain_left-frontotemporal.png" x="195" y="2" width="90" height="80" preserveAspectRatio="xMidYMid meet"/>
+    <text x="240" y="96" font-size="12" text-anchor="middle">left frontotemporal (negative control)</text>
 
     <rect x="300" y="0" width="160" height="100" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
     <text x="380" y="42" text-anchor="middle">none predicted</text>
-    <text x="380" y="62" text-anchor="middle" font-size="9" fill="#666666">N400 family</text>
-    <text x="380" y="76" text-anchor="middle" font-size="9" fill="#666666">does not transfer</text>
+    <text x="380" y="62" text-anchor="middle" font-size="14" fill="#666666">N400 family</text>
+    <text x="380" y="76" text-anchor="middle" font-size="14" fill="#666666">does not transfer</text>
 
     <rect x="460" y="0" width="160" height="100" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
     <text x="540" y="42" text-anchor="middle">n/a (silent)</text>
-    <text x="540" y="62" text-anchor="middle" font-size="9" fill="#666666">no word-aligned</text>
-    <text x="540" y="76" text-anchor="middle" font-size="9" fill="#666666">regressor</text>
+    <text x="540" y="62" text-anchor="middle" font-size="14" fill="#666666">no word-aligned</text>
+    <text x="540" y="76" text-anchor="middle" font-size="14" fill="#666666">regressor</text>
 
     <rect x="620" y="0" width="400" height="100" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
-    <text x="630" y="22" font-size="9">A surviving cluster overlapping the Lipkin</text>
-    <text x="630" y="36" font-size="9">frontotemporal language-network atlas</text>
-    <text x="630" y="50" font-size="9">falsifies the four-perspective ranking by</text>
-    <text x="630" y="64" font-size="9">relocating the surviving signal into a perspective</text>
-    <text x="630" y="78" font-size="9">the thesis says should not transfer.</text>
+    <text x="630" y="22" font-size="14">A surviving cluster overlapping the Lipkin</text>
+    <text x="630" y="36" font-size="14">frontotemporal language-network atlas</text>
+    <text x="630" y="50" font-size="14">falsifies the four-perspective ranking by</text>
+    <text x="630" y="64" font-size="14">relocating the surviving signal into a perspective</text>
+    <text x="630" y="78" font-size="14">the thesis says should not transfer.</text>
   </g>
 
   <!-- Row 4: Emotion -->
-  <g transform="translate(40,436)" font-size="10">
+  <g transform="translate(40,436)" font-size="15">
     <rect x="0" y="0" width="180" height="120" fill="#CC79A7" fill-opacity="0.18" stroke="#cccccc" stroke-width="0.5"/>
     <text x="10" y="22" font-weight="600">Emotion</text>
-    <text x="10" y="40" font-size="9">(two predictions at</text>
-    <text x="10" y="52" font-size="9">different latencies:</text>
-    <text x="10" y="64" font-size="9">early occipital alpha,</text>
-    <text x="10" y="76" font-size="9">later frontal-asymmetric</text>
-    <text x="10" y="88" font-size="9">alpha)</text>
+    <text x="10" y="40" font-size="14">(two predictions at</text>
+    <text x="10" y="52" font-size="14">different latencies:</text>
+    <text x="10" y="64" font-size="14">early occipital alpha,</text>
+    <text x="10" y="76" font-size="14">later frontal-asymmetric</text>
+    <text x="10" y="88" font-size="14">alpha)</text>
 
     <rect x="180" y="0" width="120" height="120" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
-    <ellipse cx="240" cy="60" rx="34" ry="40" fill="none" stroke="#000000" stroke-width="1"/>
-    <path d="M225,25 L255,25 L240,18 Z" fill="none" stroke="#000000" stroke-width="0.6"/>
-    <ellipse cx="240" cy="92" rx="14" ry="6" fill="#CC79A7" fill-opacity="0.45"/>
-    <circle cx="222" cy="36" r="5" fill="#CC79A7" fill-opacity="0.75"/>
-    <circle cx="258" cy="36" r="5" fill="#CC79A7" fill-opacity="0.95"/>
-    <text x="240" y="111" font-size="8" text-anchor="middle">occipital (early) + frontal F3/F4 (later)</text>
+    <image href="icons/brain_occipital-and-frontal.png" x="195" y="8" width="90" height="90" preserveAspectRatio="xMidYMid meet"/>
+    <text x="240" y="111" font-size="12" text-anchor="middle">occipital (early) + frontal F3/F4 (later)</text>
 
     <rect x="300" y="0" width="160" height="120" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
     <text x="380" y="42" text-anchor="middle">alpha (8-13 Hz)</text>
-    <text x="380" y="62" text-anchor="middle" font-size="9" fill="#666666">desynchronisation +</text>
-    <text x="380" y="76" text-anchor="middle" font-size="9" fill="#666666">F4-F3 asymmetry</text>
+    <text x="380" y="62" text-anchor="middle" font-size="14" fill="#666666">desynchronisation +</text>
+    <text x="380" y="76" text-anchor="middle" font-size="14" fill="#666666">F4-F3 asymmetry</text>
 
     <rect x="460" y="0" width="160" height="120" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
     <text x="540" y="42" text-anchor="middle">80-300 ms (occipital)</text>
     <text x="540" y="62" text-anchor="middle">200-500 ms (frontal)</text>
-    <text x="540" y="82" text-anchor="middle" font-size="9" fill="#666666">incompatible</text>
-    <text x="540" y="94" text-anchor="middle" font-size="9" fill="#666666">latencies</text>
+    <text x="540" y="82" text-anchor="middle" font-size="14" fill="#666666">incompatible</text>
+    <text x="540" y="94" text-anchor="middle" font-size="14" fill="#666666">latencies</text>
 
     <rect x="620" y="0" width="400" height="120" fill="#ffffff" stroke="#cccccc" stroke-width="0.5"/>
-    <text x="630" y="22" font-size="9">Confirmed by early occipital alpha desynchronisation</text>
-    <text x="630" y="36" font-size="9">(Codispoti pattern) or by surviving frontal F3/F4</text>
-    <text x="630" y="50" font-size="9">asymmetry in the puppy-only condition. Falsified by</text>
-    <text x="630" y="64" font-size="9">absence of both effects in the LLR-partialled GLM.</text>
-    <text x="630" y="78" font-size="9">Frontal asymmetry is exploratory given recent</text>
-    <text x="630" y="92" font-size="9">reliability concerns at sub-second timescales.</text>
+    <text x="630" y="22" font-size="14">Confirmed by early occipital alpha desynchronisation</text>
+    <text x="630" y="36" font-size="14">(Codispoti pattern) or by surviving frontal F3/F4</text>
+    <text x="630" y="50" font-size="14">asymmetry in the puppy-only condition. Falsified by</text>
+    <text x="630" y="64" font-size="14">absence of both effects in the LLR-partialled GLM.</text>
+    <text x="630" y="78" font-size="14">Frontal asymmetry is exploratory given recent</text>
+    <text x="630" y="92" font-size="14">reliability concerns at sub-second timescales.</text>
   </g>
 
-  <text x="40" y="585" font-size="9" fill="#666666">Topographic predictions are stated at the electrode level (10-20 system) and the equivalent IC cluster centroid. Cluster-level alpha p &lt; 0.05 corrected by mass-univariate permutation.</text>
+  <text x="40" y="585" font-size="14" fill="#666666">Topographic predictions are stated at the electrode level (10-20 system) and the equivalent IC cluster centroid. Cluster-level alpha p &lt; 0.05 corrected by mass-univariate permutation.</text>
 </svg>
diff --git a/manuscript/narrative-review/figures/icons/brain_central.png b/manuscript/narrative-review/figures/icons/brain_central.png
new file mode 100644
index 0000000..bb8dd4f
Binary files /dev/null and b/manuscript/narrative-review/figures/icons/brain_central.png differ
diff --git a/manuscript/narrative-review/figures/icons/brain_left-frontotemporal.png b/manuscript/narrative-review/figures/icons/brain_left-frontotemporal.png
new file mode 100644
index 0000000..650e488
Binary files /dev/null and b/manuscript/narrative-review/figures/icons/brain_left-frontotemporal.png differ
diff --git a/manuscript/narrative-review/figures/icons/brain_occipital-and-frontal.png b/manuscript/narrative-review/figures/icons/brain_occipital-and-frontal.png
new file mode 100644
index 0000000..17b8e6b
Binary files /dev/null and b/manuscript/narrative-review/figures/icons/brain_occipital-and-frontal.png differ
diff --git a/manuscript/narrative-review/figures/icons/brain_occipital.png b/manuscript/narrative-review/figures/icons/brain_occipital.png
new file mode 100644
index 0000000..c556aab
Binary files /dev/null and b/manuscript/narrative-review/figures/icons/brain_occipital.png differ
diff --git a/manuscript/narrative-review/figures/icons/generate_all.sh b/manuscript/narrative-review/figures/icons/generate_all.sh
new file mode 100755
index 0000000..74892a3
--- /dev/null
+++ b/manuscript/narrative-review/figures/icons/generate_all.sh
@@ -0,0 +1,24 @@
+#!/bin/bash
+set -e
+SCRIPT=/Users/yahya/.claude/plugins/cache/research-skills/figures/0.9.0/skills/transparent-icons/scripts/generate_icon.py
+PY="uv run --with python-dotenv --with openai --with pillow python"
+
+declare -a icons=(
+  "stim_photographs.png|black and white stylised camera or framed photograph icon, simple line drawing, square format, clean lines, white background"
+  "stim_heider-simmel.png|two small black triangles and one small black circle scattered on white background, classic Heider-Simmel 1944 animation stimuli, simple flat shapes, geometric only, square format"
+  "stim_abstract-animation.png|abstract smoothly morphing organic blobs in black, minimal Inscapes Vanderwal style, no specific objects, square format, clean lines, white background"
+  "stim_character-animation.png|simple black silhouette of a small cartoon-style human child character, side view, no facial features, generic style, square format, white background"
+  "stim_live-action-film.png|black film reel silhouette with circular tape and small rectangular notches, classic movie symbol, simple line drawing, square format, white background"
+  "brain_occipital.png|top-down view of a human head silhouette with the nose tip pointing up, head outline as a thin grey ellipse, the occipital (back) region filled with solid blue color hex 0072B2, nothing else colored, white background, simple line drawing"
+  "brain_central.png|top-down view of a human head silhouette with nose tip pointing up, head outline as a thin grey ellipse, a horizontal band across the middle filled with solid vermillion color hex D55E00, nothing else colored, white background, simple line drawing"
+  "brain_left-frontotemporal.png|top-down view of a human head silhouette with nose tip pointing up, head outline as a thin grey ellipse, the left frontotemporal region (left side, slightly toward front) filled with solid green color hex 009E73, nothing else colored, white background, simple line drawing"
+  "brain_occipital-and-frontal.png|top-down view of a human head silhouette with nose tip pointing up, head outline as a thin grey ellipse, two regions filled with solid reddish-purple color hex CC79A7: the occipital (back) region and two small frontal circles near the front, white background, simple line drawing"
+)
+
+for entry in "${icons[@]}"; do
+  IFS='|' read -r fname prompt <<< "$entry"
+  echo "=== Generating $fname ==="
+  $PY $SCRIPT "$prompt" -o "$fname" --transparent 2>&1 | grep -E "^(Backend|Saved|Error)" || echo "FAILED $fname"
+done
+echo "=== Done ==="
+ls -la *.png
diff --git a/manuscript/narrative-review/figures/icons/stim_abstract-animation.png b/manuscript/narrative-review/figures/icons/stim_abstract-animation.png
new file mode 100644
index 0000000..718b7fa
Binary files /dev/null and b/manuscript/narrative-review/figures/icons/stim_abstract-animation.png differ
diff --git a/manuscript/narrative-review/figures/icons/stim_character-animation.png b/manuscript/narrative-review/figures/icons/stim_character-animation.png
new file mode 100644
index 0000000..d8a68ba
Binary files /dev/null and b/manuscript/narrative-review/figures/icons/stim_character-animation.png differ
diff --git a/manuscript/narrative-review/figures/icons/stim_gratings.png b/manuscript/narrative-review/figures/icons/stim_gratings.png
new file mode 100644
index 0000000..0b388b5
Binary files /dev/null and b/manuscript/narrative-review/figures/icons/stim_gratings.png differ
diff --git a/manuscript/narrative-review/figures/icons/stim_heider-simmel.png b/manuscript/narrative-review/figures/icons/stim_heider-simmel.png
new file mode 100644
index 0000000..1be0525
Binary files /dev/null and b/manuscript/narrative-review/figures/icons/stim_heider-simmel.png differ
diff --git a/manuscript/narrative-review/figures/icons/stim_live-action-film.png b/manuscript/narrative-review/figures/icons/stim_live-action-film.png
new file mode 100644
index 0000000..c0434fb
Binary files /dev/null and b/manuscript/narrative-review/figures/icons/stim_live-action-film.png differ
diff --git a/manuscript/narrative-review/figures/icons/stim_photographs.png b/manuscript/narrative-review/figures/icons/stim_photographs.png
new file mode 100644
index 0000000..65c03c4
Binary files /dev/null and b/manuscript/narrative-review/figures/icons/stim_photographs.png differ
diff --git a/manuscript/narrative-review/figures/icons/theme.json b/manuscript/narrative-review/figures/icons/theme.json
new file mode 100644
index 0000000..149f6f7
--- /dev/null
+++ b/manuscript/narrative-review/figures/icons/theme.json
@@ -0,0 +1,21 @@
+{
+  "theme_id": "tics-narrative-review-2026",
+  "palette": {
+    "primary": "#000000",
+    "accent": "#0072B2",
+    "neutral": "#666666",
+    "bg": "transparent"
+  },
+  "stroke": {"weight_px": 6, "linejoin": "round"},
+  "style_tokens": [
+    "flat 2D",
+    "clean line drawing",
+    "monochromatic black on transparent",
+    "minimal detail",
+    "centered composition",
+    "scientific journal style",
+    "Trends in Cognitive Sciences aesthetic"
+  ],
+  "negative_tokens": ["text", "labels", "watermark", "gradient", "3D", "shadow", "color (unless specified)", "photorealism"],
+  "composition": {"aspect": "1:1", "padding_pct": 15, "perspective": "orthographic"}
+}
diff --git a/manuscript/narrative-review/manuscript.md b/manuscript/narrative-review/manuscript.md
index 805c54c..5e4d19a 100644
--- a/manuscript/narrative-review/manuscript.md
+++ b/manuscript/narrative-review/manuscript.md
@@ -11,7 +11,7 @@ authors:
 affiliations:
   - id: 1
     name: "Open Science Collective"
-status: "draft (Phase 3)"
+status: "final assembly (Phase 5)"
 date: "2026-05-20"
 word_budget:
   main_text: 4000
@@ -36,94 +36,94 @@ Naturalistic-stimulus neuroscience has moved from whole-clip inter-subject corre
 
 ## 1. Introduction: the per-shot turn
 
-Naturalistic-stimulus neuroscience moved from controlled gratings to feature films in two waves. The first wave was functional. Hasson and colleagues showed that voxel-level cortical activity synchronises across viewers of the same audiovisual movie in up to 45 percent of cortex during free fMRI viewing [Hasson2004IntersubjectSO]. The second wave was electrophysiological. Correlated-component analysis on scalp EEG demonstrated that engagement, attention, memory, and audience preference all scale with the reliability of stimulus-locked variance [dmochowski2012correlated; Ki2016AttentionSM; Cohen2016MemorableAN; Dmochowski2014AudiencePA; Madsen2022CognitivePO]. A third wave is now emerging that interrogates individual events within the continuous stream. Nentwich and colleagues recorded 6328 contacts in 23 patients across 43.6 minutes of film clips and regressed responses against optical-flow magnitude, saccade onsets, and film-cut onsets simultaneously, finding whole-brain shot-cut transients with semantic novelty modulation [Nentwich2023SemanticNM]. The hippocampus distinguishes within-event camera cuts from across-event narrative boundaries [Ben-Yakov2018TheHF], and event segmentation theory frames boundaries as moments of high prediction error, with hierarchical timescales mapped from sensory cortex to default-mode regions [zacks2007event; speer2007narrative; baldassano2017event].
+Naturalistic-stimulus neuroscience moved from controlled gratings to feature films in two waves. The first wave was functional. Hasson and colleagues showed that voxel-level cortical activity synchronises across viewers of the same audiovisual movie in up to 45 percent of cortex during free fMRI viewing [1]. The second wave was electrophysiological. Correlated-component analysis on scalp EEG demonstrated that engagement, attention, memory, and audience preference all scale with the reliability of stimulus-locked variance [2,3,4,5,6]. A third wave is now emerging that interrogates individual events within the continuous stream. Nentwich and colleagues recorded 6328 contacts in 23 patients across 43.6 minutes of film clips and regressed responses against optical-flow magnitude, saccade onsets, and film-cut onsets simultaneously, finding whole-brain shot-cut transients with semantic novelty modulation [7]. The hippocampus distinguishes within-event camera cuts from across-event narrative boundaries [8], and event segmentation theory frames boundaries as moments of high prediction error, with hierarchical timescales mapped from sensory cortex to default-mode regions [9,10,11].
 
-A separate developmental tradition has used Pixar shorts in fMRI to map theory of mind (ToM) and pain networks in children as young as three [Richardson2018DevelopmentOT] and silent abstract animation to improve magnetic resonance imaging (MRI) compliance and reveal reliable network-level activity [Vanderwal2015InscapesAM]. Cross-sectional EEG-ISC across ages 6 to 44 is the closest electrophysiological developmental anchor; ISC is highest in children and declines into adulthood [Petroni2018TheVO]. None of these traditions has reported per-shot ERSP at the 0 to 500 ms post-onset window in a child cohort viewing animation.
+A separate developmental tradition has used Pixar shorts in fMRI to map theory of mind (ToM) and pain networks in children as young as three [12] and silent abstract animation to improve magnetic resonance imaging (MRI) compliance and reveal reliable network-level activity [13]. Cross-sectional EEG-ISC across ages 6 to 44 is the closest electrophysiological developmental anchor; ISC is highest in children and declines into adulthood [14]. None of these traditions has reported per-shot ERSP at the 0 to 500 ms post-onset window in a child cohort viewing animation.
 
 This review argues that four research perspectives, psychophysics, action, language, and emotion, make divergent and partly-falsifiable predictions about this empty cell. Sections 2 to 6 develop the perspectives in order. Section 7 synthesises them into a topographic-and-band rejection region that a pre-registered group analysis can adopt before opening the data. Box 1 anchors the argument to the Healthy Brain Network EEG (HBN-EEG) Release 3 cohort viewing *The Present* (Pixar 2014), the empty-cell stimulus that motivates the review.
 
 ## 2. The four-perspective scaffold
 
-The four-perspective scaffold is structural rather than decorative. Each perspective makes a different *kind* of prediction. Psychophysics names a regressor of no interest that must be partialled before any social claim can be defended. Action names a band-and-topography prediction (mu-band event-related desynchronisation [ERD] over central rolandic cortex) with adult precedent. Language names a method that structurally cannot transfer (language-model surprisal aligned to spoken transcripts) plus a positive sub-thread of silent-narrative findings that does transfer. Emotion names two distinct predictions at incompatible latencies (early occipital alpha desynchronisation and later frontal-asymmetric alpha). Together the four make a hierarchy of prior evidence depth that the data can rerank.
+The four-perspective scaffold is structural rather than decorative. Each perspective makes a different *kind* of prediction. Psychophysics names a regressor of no interest that must be partialled before any social claim can be defended. Action names a band-and-topography prediction (mu-band event-related desynchronisation, ERD, over central rolandic cortex) with adult precedent. Language names a method that structurally cannot transfer (language-model surprisal aligned to spoken transcripts) plus a positive sub-thread of silent-narrative findings that does transfer. Emotion names two distinct predictions at incompatible latencies (early occipital alpha desynchronisation and later frontal-asymmetric alpha). Together the four make a hierarchy of prior evidence depth that the data can rerank.
 
-The perspectives cross 15 corpus themes catalogued in our Phase 2 science map (Figure 1). Two themes anchor the analytic backbone independent of perspective: ISC as a reliability metric (Theme 1), originating in fMRI [Hasson2004IntersubjectSO] and migrating to EEG [dmochowski2012correlated], MEG [Lankinen2014IntersubjectCO], peripheral physiology [Madsen2022CognitivePO], and audience prediction [Dmochowski2014AudiencePA]; and event segmentation (Theme 2), anchored in event-segmentation theory and hidden-Markov-model event-state recovery [zacks2007event; baldassano2017event; speer2007narrative; Ben-Yakov2018TheHF]. Theme 3 (naturalness gradient; Figure 2) places the stimulus on a continuum from controlled gratings to live-action film, with character animation as the intermediate point that motivates the empty-cell framing.
+The perspectives cross 15 corpus themes catalogued in our Phase 2 science map (Figure 1). Two themes anchor the analytic backbone independent of perspective: ISC as a reliability metric (Theme 1), originating in fMRI [1] and migrating to EEG [2], MEG [15], peripheral physiology [6], and audience prediction [5]; and event segmentation (Theme 2), anchored in event-segmentation theory and hidden-Markov-model event-state recovery [8,9,10,11]. Theme 3 (naturalness gradient; Figure 2) places the stimulus on a continuum from controlled gratings to live-action film, with character animation as the intermediate point that motivates the empty-cell framing.
 
-The four perspectives then sit in specific corners of this theme space. Psychophysics owns Themes 4 (low-level feature regressors) [Adelson1985SpatiotemporalEM; Carandini2011NormalizationAA; Nishimoto2011ReconstructingVE], 5 (time-resolved EEG and MEG), and 11 (free-viewing EEG with eye coregistration). Action owns Themes 6 (mu rhythm and action observation) [hari1998action; pineda2005mu] and 8 (social cognition through biological motion) and contributes to Themes 2 and 14 (distributed multivariate signatures). Language owns Theme 9 (LMs as regressors) [Goldstein2022SharedCP; Caucheteux2022BrainsAA] as a structural comparator and Theme 10 (audiovisual integration), but its silent-narrative sub-thread cuts across Themes 8 (social cognition; default-mode network as narrative integrator) and 13 (developmental neuroimaging in cinematic paradigms). Emotion owns Themes 7 (affective dynamics), 12 (pet, animal, and baby-schema affective response), and 13. Theme 15 (predictive processing) is a cross-perspective unifier: it ties mu-band ERD to mirror-system prediction error, LM surprisal to next-word prediction, and event boundaries to prediction-error transients.
+The four perspectives then sit in specific corners of this theme space. Psychophysics owns Themes 4 (low-level feature regressors) [16,17,18], 5 (time-resolved EEG and MEG), and 11 (free-viewing EEG with eye coregistration). Action owns Themes 6 (mu rhythm and action observation) [19,20] and 8 (social cognition through biological motion) and contributes to Themes 2 and 14 (distributed multivariate signatures). Language owns Theme 9 (LMs as regressors) [21,22] as a structural comparator and Theme 10 (audiovisual integration), but its silent-narrative sub-thread cuts across Themes 8 (social cognition; default-mode network as narrative integrator) and 13 (developmental neuroimaging in cinematic paradigms). Emotion owns Themes 7 (affective dynamics), 12 (pet, animal, and baby-schema affective response), and 13. Theme 15 (predictive processing) is a cross-perspective unifier: it ties mu-band ERD to mirror-system prediction error, LM surprisal to next-word prediction, and event boundaries to prediction-error transients.
 
 Perspective overlap is intentional rather than residual; the perspectives interact at the per-shot ERSP level rather than partitioning variance cleanly. Sections 3 to 6 develop them in order, naming the band-by-topography signature each makes and the falsification region attached to each (Figure 4). Section 7 closes by combining the four rejection regions into a single pre-registerable test before group analysis.
 
 ## 3. Psychophysics: the bottom-up floor
 
-Psychophysics anchors the bottom-up floor that every per-shot analysis must clear before claiming a higher-order effect. The lineage runs from primary visual cortex receptive fields [Hubel1962ReceptiveFB] and divisive normalisation [Carandini2011NormalizationAA] through natural-image statistics and spatiotemporal energy [Bell1997TheC; Simoncelli2001NaturalIS; Adelson1985SpatiotemporalEM] to middle-temporal motion machinery [Born2005StructureAF; Bartels2008NaturalVR]. Nishimoto and colleagues reconstructed natural movies from blood-oxygen-level-dependent activity in occipitotemporal cortex using a motion-energy front end derived from Adelson and Bergen, an existence proof that an Adelson-Bergen feature bank suffices to recover the stimulus from neural activity [Nishimoto2011ReconstructingVE]. Clinical visual evoked potential work supports a reliable scalp signature for luminance and contrast steps with magnocellular and parvocellular pathway assignment [Tobimatsu2006StudiesOH].
+Psychophysics anchors the bottom-up floor that every per-shot analysis must clear before claiming a higher-order effect. The lineage runs from primary visual cortex receptive fields [23] and divisive normalisation [17] through natural-image statistics and spatiotemporal energy [16,24,25] to middle-temporal motion machinery [26,27]. Nishimoto and colleagues reconstructed natural movies from blood-oxygen-level-dependent activity in occipitotemporal cortex using a motion-energy front end derived from Adelson and Bergen, an existence proof that an Adelson-Bergen feature bank suffices to recover the stimulus from neural activity [18]. Clinical visual evoked potential work supports a reliable scalp signature for luminance and contrast steps with magnocellular and parvocellular pathway assignment [28].
 
-The closest electrophysiological analogue to per-shot ERSP during naturalistic film is the intracranial study of Nentwich and colleagues, who showed that motion outranks luminance for occipitoparietal cortex when triple-regressed against optical-flow magnitude, saccade onsets, and film-cut onsets [Nentwich2023SemanticNM]. That result establishes a quantitative ranking among low-level regressors: per-shot log luminance ratio (LLR) is one of several low-level features that needs accounting. EEG ISC at the whole-clip scale tracks low-level features at occipital electrodes more strongly than higher-order content [dmochowski2012correlated; Madsen2022CognitivePO; Cohen2016MemorableAN], although attention strongly modulates this baseline [Ki2016AttentionSM]. An envelope-only auditory control isolating low-level acoustic structure from higher-level musical structure [Kaneshiro2021InterSubjectEC] is the methodological template the LLR-as-covariate plan inherits.
+The closest electrophysiological analogue to per-shot ERSP during naturalistic film is the intracranial study of Nentwich and colleagues, who showed that motion outranks luminance for occipitoparietal cortex when triple-regressed against optical-flow magnitude, saccade onsets, and film-cut onsets [7]. That result establishes a quantitative ranking among low-level regressors: per-shot log luminance ratio (LLR) is one of several low-level features that needs accounting. EEG ISC at the whole-clip scale tracks low-level features at occipital electrodes more strongly than higher-order content [2,4,6], although attention strongly modulates this baseline [3]. An envelope-only auditory control isolating low-level acoustic structure from higher-level musical structure [29] is the methodological template the LLR-as-covariate plan inherits.
 
-A second class of bottom-up drivers operates through the eye. Free-viewing EEG depends on eye-movement coregistration to separate stimulus-onset responses from saccade-locked and fixation-related potentials [Dimigen2011CoregistrationOE; Plöchl2012CombiningEA], and regression deconvolution of overlapping events is the methodological state of the art [Dimigen2021RegressionbasedAO]. Gaze coherence varies with stimulus class, highest on Hollywood trailers and lowest on natural movie clips and static images [Dorr2010VariabilityOE]; a Pixar short sits between these extremes. The HBN-EEG cohort carries no synchronous eye tracker, which means a per-shot analysis cannot deconvolve overlapping saccade-locked transients from shot-onset responses. Independent component analysis (ICA)-based artifact rejection through adaptive mixture ICA (AMICA) and IC classification (ICLabel) is the operating compromise [Bell1997TheC]. The implication for per-shot ERSP is asymmetric: per-shot LLR is the minimum partialling for any social-content claim. Motion energy computed offline from the stimulus video is the named first follow-up regressor [Nishimoto2011ReconstructingVE; Nentwich2023SemanticNM]. The multivariate temporal response function (mTRF) toolbox supplies the production regression framework [Crosse2016TheMT]. Figure 2 places the empty cell on the naturalness gradient.
+A second class of bottom-up drivers operates through the eye. Free-viewing EEG depends on eye-movement coregistration to separate stimulus-onset responses from saccade-locked and fixation-related potentials [Dimigen2011CoregistrationOE; Plöchl2012CombiningEA], and regression deconvolution of overlapping events is the methodological state of the art [30]. Gaze coherence varies with stimulus class, highest on Hollywood trailers and lowest on natural movie clips and static images [31]; a Pixar short sits between these extremes. The HBN-EEG cohort carries no synchronous eye tracker, which means a per-shot analysis cannot deconvolve overlapping saccade-locked transients from shot-onset responses. Independent component analysis (ICA)-based artifact rejection through adaptive mixture ICA (AMICA) and IC classification (ICLabel) is the operating compromise [24]. The implication for per-shot ERSP is asymmetric: per-shot LLR is the minimum partialling for any social-content claim. Motion energy computed offline from the stimulus video is the named first follow-up regressor [7,18]. The multivariate temporal response function (mTRF) toolbox supplies the production regression framework [32]. Figure 2 places the empty cell on the naturalness gradient.
 
 ## 4. Action: mu-band ERD and event segmentation
 
-The action perspective makes the most specific positive prediction in the 0 to 500 ms ERSP window. Hari and colleagues showed by magnetoencephalography (MEG) that primary motor cortex is activated during passive observation of hand action via 15 to 25 Hz rolandic rebound suppression that reaches 31 to 46 percent of execution-related suppression [hari1998action]. Pineda framed the EEG mu rhythm (8 to 13 Hz over electrodes C3, Cz, and C4) as a non-invasive proxy for human mirror-system engagement [pineda2005mu]. Mu suppression magnitude during action observation correlates with self-reported social skill across neurotypical adults [oberman2007mirror]. Lesion-symptom mapping places posterior superior temporal sulcus (STS) and ventral premotor cortex as causally necessary nodes for biological-motion perception [saygin2007sts; johansson1973biological]. Predictive-coding reformulations recast mirror responses as scaling with prediction error over goal and intention [kilner2007predictive; rizzolatti2004mirror; iacoboni2009mirror]. The mirror-system framing also has well-known critiques, in particular Hickok-style objections to one-to-one mirror-interpretations of mu suppression, which are not represented as cards in our corpus and which temper the weight that the action-perspective prediction can carry.
+The action perspective makes the most specific positive prediction in the 0 to 500 ms ERSP window. Hari and colleagues showed by magnetoencephalography (MEG) that primary motor cortex is activated during passive observation of hand action via 15 to 25 Hz rolandic rebound suppression that reaches 31 to 46 percent of execution-related suppression [19]. Pineda framed the EEG mu rhythm (8 to 13 Hz over electrodes C3, Cz, and C4) as a non-invasive proxy for human mirror-system engagement [20]. Mu suppression magnitude during action observation correlates with self-reported social skill across neurotypical adults [33]. Lesion-symptom mapping places posterior superior temporal sulcus (STS) and ventral premotor cortex as causally necessary nodes for biological-motion perception [34,35]. Predictive-coding reformulations recast mirror responses as scaling with prediction error over goal and intention [36,37,38]. The mirror-system framing also has well-known critiques, in particular Hickok-style objections to one-to-one mirror-interpretations of mu suppression, which are not represented as cards in our corpus and which temper the weight that the action-perspective prediction can carry.
 
-Even with that tempering, the prediction is specific. Shots dominated by character action should produce ERD in the mu band over central electrodes, with possible beta-band rebound suppression. The Heider-Simmel tradition shows that even abstract triangle animations recruit posterior STS, medial prefrontal cortex, and temporal poles when motion implies intention [castelli2000heider]. The naturalness gradient places character animation between abstract Heider-Simmel and live-action [hasson2010natural]. The inferential bridge from triangle-animation fMRI activation to character-animation mu-band EEG ERD is plausible and untested at scalp-EEG resolution.
+Even with that tempering, the prediction is specific. Shots dominated by character action should produce ERD in the mu band over central electrodes, with possible beta-band rebound suppression. The Heider-Simmel tradition shows that even abstract triangle animations recruit posterior STS, medial prefrontal cortex, and temporal poles when motion implies intention [39]. The naturalness gradient places character animation between abstract Heider-Simmel and live-action [40]. The inferential bridge from triangle-animation fMRI activation to character-animation mu-band EEG ERD is plausible and untested at scalp-EEG resolution.
 
-The second action beat is event segmentation. Speer and colleagues found posterior cingulate, middle-temporal, and posterior STS boundary-locked transients in fMRI during narrative listening [speer2007narrative]. Baldassano and colleagues recovered a hierarchy of event boundaries from Sherlock-movie fMRI using HMM, with hippocampal boundary signals predicting subsequent free recall [baldassano2017event]. Lerner and colleagues mapped temporal receptive windows from sensory cortex (milliseconds) to default-mode regions (tens of seconds) [lerner2011temporal]. Chen and colleagues showed event-specific patterns in the default-mode network are shared across viewers and reactivated at recall [chen2017shared]. Ben-Yakov and Henson distinguished within-event camera cuts, which produce minimal hippocampal responses, from across-event narrative boundaries, which produce robust ones [Ben-Yakov2018TheHF]. Magliano and Zacks supplied the behavioural foundation that viewers segment edited films along cuts independent of dialogue [Magliano2011TheIO].
+The second action beat is event segmentation. Speer and colleagues found posterior cingulate, middle-temporal, and posterior STS boundary-locked transients in fMRI during narrative listening [10]. Baldassano and colleagues recovered a hierarchy of event boundaries from Sherlock-movie fMRI using HMM, with hippocampal boundary signals predicting subsequent free recall [11]. Lerner and colleagues mapped temporal receptive windows from sensory cortex (milliseconds) to default-mode regions (tens of seconds) [41]. Chen and colleagues showed event-specific patterns in the default-mode network are shared across viewers and reactivated at recall [42]. Ben-Yakov and Henson distinguished within-event camera cuts, which produce minimal hippocampal responses, from across-event narrative boundaries, which produce robust ones [8]. Magliano and Zacks supplied the behavioural foundation that viewers segment edited films along cuts independent of dialogue [43].
 
-A third action beat concerns single-agent versus two-agent shots. Sliwa and Freiwald documented a dedicated cortical network in macaque for processing two-agent social interaction, separable from single-agent action perception [sliwa2017macaque]. This motivates excluding two-agent shots from a clean single-agent contrast, since the social-interaction network may dominate two-agent variance.
+A third action beat concerns single-agent versus two-agent shots. Sliwa and Freiwald documented a dedicated cortical network in macaque for processing two-agent social interaction, separable from single-agent action perception [44]. This motivates excluding two-agent shots from a clean single-agent contrast, since the social-interaction network may dominate two-agent variance.
 
 ## 5. Language: comparator of non-transfer plus silent-narrative sub-thread
 
 ### 5a. Language-model regressors are structurally non-transferable
 
-The contemporary methodological mainstream in naturalistic neuroimaging is built around transformer-based language-model (LM) regressors aligned to spoken or read transcripts. Goldstein and colleagues showed pre-onset prediction, post-onset surprise, and contextual-embedding signatures shared between word-by-word electrocorticography (ECoG) and autoregressive LMs [Goldstein2022SharedCP]. Each signature depends on speech-onset alignment. Heilbron and colleagues separated lexical, syntactic, and semantic surprisal regressors during MEG audiobook listening, all derived from LMs with word-onset alignment [Heilbron2020AHO]. Caucheteux and colleagues mapped transformer intermediate layers to fMRI and MEG responses to natural narrative [Caucheteux2022BrainsAA] and a cortical hierarchy of prediction timescales [Caucheteux2023EvidenceOA]. Antonello and colleagues documented log-linear scaling of brain prediction with LM parameter count up to 30B [Antonello2023ScalingLF]. Schrimpf and colleagues showed that next-word-prediction quality drives brain score on fMRI, ECoG, and reading-time benchmarks [schrimpf2021the]. Toneva and Wehbe used BERT to predict reading fMRI and MEG, with attention-head ablations linking brain prediction to natural-language processing performance [toneva2019interpreting]. Huth and colleagues built the canonical voxelwise word-embedding encoding atlas tiling cortex with semantic clusters; this method requires spoken transcripts [Huth2016NaturalSR]. Nelson and colleagues tracked open-node count during syntactic merge using intracranial high-gamma dynamics, explicitly reading-based [Nelson2017NeurophysiologicalDO]. The N400 family bridges to picture-context paradigms at the cost of dynamic stimulus [Kutas2011ThirtyYA; DeLong2005ProbabilisticWP].
+The contemporary methodological mainstream in naturalistic neuroimaging is built around transformer-based language-model (LM) regressors aligned to spoken or read transcripts. Goldstein and colleagues showed pre-onset prediction, post-onset surprise, and contextual-embedding signatures shared between word-by-word electrocorticography (ECoG) and autoregressive LMs [21]. Each signature depends on speech-onset alignment. Heilbron and colleagues separated lexical, syntactic, and semantic surprisal regressors during MEG audiobook listening, all derived from LMs with word-onset alignment [45]. Caucheteux and colleagues mapped transformer intermediate layers to fMRI and MEG responses to natural narrative [22] and a cortical hierarchy of prediction timescales [46]. Antonello and colleagues documented log-linear scaling of brain prediction with LM parameter count up to 30B [47]. Schrimpf and colleagues showed that next-word-prediction quality drives brain score on fMRI, ECoG, and reading-time benchmarks [48]. Toneva and Wehbe used BERT to predict reading fMRI and MEG, with attention-head ablations linking brain prediction to natural-language processing performance [49]. Huth and colleagues built the canonical voxelwise word-embedding encoding atlas tiling cortex with semantic clusters; this method requires spoken transcripts [50]. Nelson and colleagues tracked open-node count during syntactic merge using intracranial high-gamma dynamics, explicitly reading-based [51]. The N400 family bridges to picture-context paradigms at the cost of dynamic stimulus [52,53].
 
-Each method depends on word-level alignment to spoken or read stimuli. *The Present* is wordless. All seven Category G cards in our language ontology (and 12 cards corpus-wide) carry `transfer-to-silent: no`. A vision-side analogue, multimodal vision-language model embeddings or scene-difference deep-network features as continuous regressors, does not yet exist in the corpus for scalp-EEG ERSP. The Lipkin frontotemporal language network atlas [Lipkin2022ProbabilisticAF] is included as the negative-control region of interest in the falsification region of Section 7.
+Each method depends on word-level alignment to spoken or read stimuli. *The Present* is wordless. All seven Category G cards in our language ontology (and 12 cards corpus-wide) carry `transfer-to-silent: no`. A vision-side analogue, multimodal vision-language model embeddings or scene-difference deep-network features as continuous regressors, does not yet exist in the corpus for scalp-EEG ERSP. The Lipkin frontotemporal language network atlas [54] is included as the negative-control region of interest in the falsification region of Section 7.
 
 ### 5b. Silent-narrative neural correlates that do transfer
 
-Silent-narrative neural correlates do transfer to scalp-EEG ERSP analysis even when language-model regressors cannot. Castelli and colleagues showed that silent geometric-shape animations engage medial prefrontal cortex, the temporo-parietal junction, and STS when motion implies social interaction, with no speech required [Castelli2000MovementAM; castelli2000heider]; the same paradigm in autism shows reduced engagement [Castelli2002AutismAS]. Vanderwal and colleagues built Inscapes, a purpose-built silent abstract animation that improves MRI compliance and produces reliable network-level activity, used by the HBN cohort itself [Vanderwal2015InscapesAM]. Naci and colleagues used a Hitchcock excerpt as a covert assessment, showing that high-order cortex can be probed from a near-silent narrative [Naci2014ACN]. Lankinen and colleagues report source-space MEG reliable across viewers in occipital and temporal cortex during silent-visual and audiovisual movie conditions, the closest electrophysiological analogue with a deliberate silent-visual condition [Lankinen2014IntersubjectCO]. The Studyforrest infrastructure provides an audio-only foundation that has been extended to silent-cohort contrasts [Hanke2014AH7]. Schroeder and colleagues described modality-general delta- and theta-band phase alignment to attended event onsets, providing the mechanistic frame for shot-onset ERSP independent of speech [Schroeder2009LowfrequencyNO]. Senkowski and colleagues described transient gamma synchronisation and low-frequency phase coupling for cross-modal binding [Senkowski2008CrossmodalBT]. Buckner, Simony, Yeshurun, Mar, and Tamir developed the default-mode network as narrative integrator, with framing context driving within-stimulus divergence [Buckner2008TheBD; Simony2016DynamicRO; Yeshurun2017SameSD; Mar2011TheNB; Tamir2016ReadingFA].
+Silent-narrative neural correlates do transfer to scalp-EEG ERSP analysis even when language-model regressors cannot. Castelli and colleagues showed that silent geometric-shape animations engage medial prefrontal cortex, the temporo-parietal junction, and STS when motion implies social interaction, with no speech required [39,55]; the same paradigm in autism shows reduced engagement [56]. Vanderwal and colleagues built Inscapes, a purpose-built silent abstract animation that improves MRI compliance and produces reliable network-level activity, used by the HBN cohort itself [13]. Naci and colleagues used a Hitchcock excerpt as a covert assessment, showing that high-order cortex can be probed from a near-silent narrative [57]. Lankinen and colleagues report source-space MEG reliable across viewers in occipital and temporal cortex during silent-visual and audiovisual movie conditions, the closest electrophysiological analogue with a deliberate silent-visual condition [15]. The Studyforrest infrastructure provides an audio-only foundation that has been extended to silent-cohort contrasts [58]. Schroeder and colleagues described modality-general delta- and theta-band phase alignment to attended event onsets, providing the mechanistic frame for shot-onset ERSP independent of speech [59]. Senkowski and colleagues described transient gamma synchronisation and low-frequency phase coupling for cross-modal binding [60]. Buckner, Simony, Yeshurun, Mar, and Tamir developed the default-mode network as narrative integrator, with framing context driving within-stimulus divergence [61,62,63,64,65].
 
 The language perspective therefore plays two roles. The 5a sub-thread isolates the silent-stimulus design from the dominant LM-as-regressor framework. The 5b sub-thread supplies the cortical substrates that silent narrative engages: medial prefrontal cortex, the temporo-parietal junction, the STS, and the default-mode network. Their independent-component-cluster analogues in EEG are the search regions for the per-shot ERSP analysis. Figure 3 makes the gap structure explicit.
 
 ## 6. Emotion: two predictions at different latencies
 
-The emotion perspective makes two predictions with different latencies and different implicated structures. The first is an early visual-cortex emotion-schema response. Kragel and colleagues built EmoNet, a deep-learning model showing that emotion schemas are encoded in early visual cortex, predicting that emotion-tuned visual representations should appear in early-latency occipital ERSP [Kragel2018EmotionSA]. Saarimaki and colleagues decoded six basic emotions during emotional movie viewing using fMRI multi-voxel pattern analysis [Saarimäki2016DiscreteNS]; Cowen and Keltner extended the taxonomy to 27 distinguishable categories from short videos [Cowen2017SelfreportC2]. Distributed-network meta-analysis argues for distributed signatures over strict regional localisation [Lindquist2012TheBB], with the neurologic pain signature as a methodological exemplar of multivariate signatures of affect [Wager2013AnFN]. The closest EEG correlate at the 0 to 500 ms scale is early occipital alpha desynchronisation (80 to 300 ms post-shot-onset, extrapolated from static-picture latencies). Codispoti and colleagues (2023) review the EEG alpha-band literature on emotional picture perception and conclude that alpha desynchronisation is a robust correlate of attentional engagement by emotional stimuli, with parametric arousal modulation [Codispoti2023AlphabandOA]. Whether this transfers to dynamic naturalistic stimuli at sub-second timescales in a child cohort is untested.
+The emotion perspective makes two predictions with different latencies and different implicated structures. The first is an early visual-cortex emotion-schema response. Kragel and colleagues built EmoNet, a deep-learning model showing that emotion schemas are encoded in early visual cortex, predicting that emotion-tuned visual representations should appear in early-latency occipital ERSP [66]. Saarimaki and colleagues decoded six basic emotions during emotional movie viewing using fMRI multi-voxel pattern analysis [Saarimäki2016DiscreteNS]; Cowen and Keltner extended the taxonomy to 27 distinguishable categories from short videos [67]. Distributed-network meta-analysis argues for distributed signatures over strict regional localisation [68], with the neurologic pain signature as a methodological exemplar of multivariate signatures of affect [69]. The closest EEG correlate at the 0 to 500 ms scale is early occipital alpha desynchronisation (80 to 300 ms post-shot-onset, extrapolated from static-picture latencies). Codispoti and colleagues (2023) review the EEG alpha-band literature on emotional picture perception and conclude that alpha desynchronisation is a robust correlate of attentional engagement by emotional stimuli, with parametric arousal modulation [70]. Whether this transfers to dynamic naturalistic stimuli at sub-second timescales in a child cohort is untested.
 
-The second prediction is a longer-latency cuteness or affiliative response. Stoeckel and colleagues reported common activation across child and dog spanning emotion, reward, affiliation, visual processing, and social cognition regions in adult mothers viewing photographs of own child versus own dog [Stoeckel2014PatternsOB]. Glocker and colleagues showed that baby schema parametrically modulates nucleus accumbens reward in adults [Glocker2009BabySM]. Borgi and colleagues demonstrated that children aged 3 to 6 already show parametric cuteness ratings and gaze bias for human infant, puppy, and kitten faces [Borgi2014BabySI]; this is the behavioural anchor that the cuteness response is established well before adolescence. The interpretation implication is that Stoeckel measures identity-level pair-bonding and Borgi measures generic baby schema. HBN viewers have no identity-level bond with an animated puppy, so the relevant inference is from generic baby schema rather than pair-bonding circuitry.
+The second prediction is a longer-latency cuteness or affiliative response. Stoeckel and colleagues reported common activation across child and dog spanning emotion, reward, affiliation, visual processing, and social cognition regions in adult mothers viewing photographs of own child versus own dog [71]. Glocker and colleagues showed that baby schema parametrically modulates nucleus accumbens reward in adults [72]. Borgi and colleagues demonstrated that children aged 3 to 6 already show parametric cuteness ratings and gaze bias for human infant, puppy, and kitten faces [73]; this is the behavioural anchor that the cuteness response is established well before adolescence. The interpretation implication is that Stoeckel measures identity-level pair-bonding and Borgi measures generic baby schema. HBN viewers have no identity-level bond with an animated puppy, so the relevant inference is from generic baby schema rather than pair-bonding circuitry.
 
-Two EEG routes connect these predictions to observables. The first is early occipital alpha-band desynchronisation (80 to 300 ms) as an arousal-modulated correlate of attentional engagement [Codispoti2023AlphabandOA]. The second is later frontal alpha asymmetry (200 to 500 ms; extrapolated downward from the seconds-to-minutes Davidson tradition) as an approach-withdrawal index [Davidson2000AffectiveSP; Coan2004FrontalEA]. An updated meta-analytic critique documents smaller effect sizes and substantial reliability concerns [Reznik2018FrontalAA]. The corpus contains no card applying asymmetry analysis to per-event sub-second windows during a continuous naturalistic stimulus, and none in a developmental cohort viewing film. Frontal asymmetry at shot-onset latency is therefore exploratory rather than confirmatory. The third emotion beat is social cognition. Richardson and colleagues documented ToM and pain networks present from age three and refining with age, using Pixar shorts in 122 children [Richardson2018DevelopmentOT]; this is the load-bearing developmental anchor. Mar synthesised narrative comprehension as a social-cognitive activity [Mar2011TheNB]; Singer and colleagues documented affective pain-region engagement during observed pain [Singer2004EmpathyFP]; Zaki and Ochsner formalised the tripartite empathy model bridging experience sharing and mental-state attribution [Zaki2012TheNO]. Nummenmaa and colleagues showed emotion intensity modulates ISC in midline cortex during film viewing [Nummenmaa2012EmotionsPS]; Schmaelzle and Grall theorised ISC as audience captivation [Schmälzle2020TheCB]. Two predictions sit at incompatible latencies and topographies; an LLR-partialled per-shot generalised linear model (GLM) adjudicates between them.
+Two EEG routes connect these predictions to observables. The first is early occipital alpha-band desynchronisation (80 to 300 ms) as an arousal-modulated correlate of attentional engagement [70]. The second is later frontal alpha asymmetry (200 to 500 ms; extrapolated downward from the seconds-to-minutes Davidson tradition) as an approach-withdrawal index [74,75]. An updated meta-analytic critique documents smaller effect sizes and substantial reliability concerns [76]. The corpus contains no card applying asymmetry analysis to per-event sub-second windows during a continuous naturalistic stimulus, and none in a developmental cohort viewing film. Frontal asymmetry at shot-onset latency is therefore exploratory rather than confirmatory. The third emotion beat is social cognition. Richardson and colleagues documented ToM and pain networks present from age three and refining with age, using Pixar shorts in 122 children [12]; this is the load-bearing developmental anchor. Mar synthesised narrative comprehension as a social-cognitive activity [64]; Singer and colleagues documented affective pain-region engagement during observed pain [77]; Zaki and Ochsner formalised the tripartite empathy model bridging experience sharing and mental-state attribution [78]. Nummenmaa and colleagues showed emotion intensity modulates ISC in midline cortex during film viewing [79]; Schmaelzle and Grall theorised ISC as audience captivation [Schmälzle2020TheCB]. Two predictions sit at incompatible latencies and topographies; an LLR-partialled per-shot generalised linear model (GLM) adjudicates between them.
 
 ## 7. Synthesis: integration, falsifiability, and open questions
 
 ### 7.1 Integration
 
-The four perspectives rank by depth of prior evidence. Psychophysics has the deepest precedent and the simplest operationalisation: partial LLR, optionally motion energy, before any condition claim. Action has the deepest specific oscillatory prediction (mu-band ERD over central rolandic clusters) but no animated-agent precedent in EEG. Language is structurally non-transferable for LM regressors but supplies cortical priors for silent narrative through its 5b sub-thread (medial prefrontal cortex, the temporo-parietal junction, the STS, the default-mode network). Emotion supplies two predictions: early occipital alpha desynchronisation [Codispoti2023AlphabandOA; Kragel2018EmotionSA] and later frontal-asymmetric alpha [Davidson2000AffectiveSP], with the cuteness response anchored developmentally by Borgi [Borgi2014BabySI]. Distributed-multivariate-signature framing supports IC-cluster-level analyses over single-IC decoding [Lindquist2012TheBB; chen2017shared]. Figure 4 displays the four predictions in tabular form.
+The four perspectives rank by depth of prior evidence. Psychophysics has the deepest precedent and the simplest operationalisation: partial LLR, optionally motion energy, before any condition claim. Action has the deepest specific oscillatory prediction (mu-band ERD over central rolandic clusters) but no animated-agent precedent in EEG. Language is structurally non-transferable for LM regressors but supplies cortical priors for silent narrative through its 5b sub-thread (medial prefrontal cortex, the temporo-parietal junction, the STS, the default-mode network). Emotion supplies two predictions: early occipital alpha desynchronisation [66,70] and later frontal-asymmetric alpha [74], with the cuteness response anchored developmentally by Borgi [73]. Distributed-multivariate-signature framing supports IC-cluster-level analyses over single-IC decoding [42,68]. Figure 4 displays the four predictions in tabular form.
 
 ### 7.2 Anchor case
 
-External precedent: Petroni and colleagues recorded 64-channel EEG at 500 Hz from 114 viewers across ages 6 to 44 during passive viewing of six naturalistic videos including animated and live-action shorts [Petroni2018TheVO]. They did not analyse shot-onset ERSP and did not factor stimulus-side regressors, but they demonstrated that scalp-EEG signal exists during developmental naturalistic viewing of short videos. They are the closest external existence proof that the measurement class is feasible in adjacent territory. Internal feasibility: a partly-validated developmental EEG pipeline on HBN-EEG R3 brings 184 subjects through Brain Imaging Data Structure (BIDS) import, 1 Hz high-pass filtering, conditional cleanline gated by Nyquist, `clean_rawdata` channel rejection, AMICA decomposition, ICLabel classification, dipole fitting, and `std_precomp` ERSP precomputation; the operating constraint is that the local working set is 100 Hz, with a 500 Hz validation pass on the full Amazon S3 R3 release scheduled after pipeline validation. The two anchor assertions are independent and not interchangeable.
+External precedent: Petroni and colleagues recorded 64-channel EEG at 500 Hz from 114 viewers across ages 6 to 44 during passive viewing of six naturalistic videos including animated and live-action shorts [14]. They did not analyse shot-onset ERSP and did not factor stimulus-side regressors, but they demonstrated that scalp-EEG signal exists during developmental naturalistic viewing of short videos. They are the closest external existence proof that the measurement class is feasible in adjacent territory. Internal feasibility: a partly-validated developmental EEG pipeline on HBN-EEG R3 brings 184 subjects through Brain Imaging Data Structure (BIDS) import, 1 Hz high-pass filtering, conditional cleanline gated by Nyquist, `clean_rawdata` channel rejection, AMICA decomposition, ICLabel classification, dipole fitting, and `std_precomp` ERSP precomputation; the operating constraint is that the local working set is 100 Hz, with a 500 Hz validation pass on the full Amazon S3 R3 release scheduled after pipeline validation. The two anchor assertions are independent and not interchangeable.
 
 ### 7.3 Falsifiability
 
-A topographic-and-band rejection region for the four-perspective ranking can be pre-registered before group analysis. A surviving central-rolandic mu-band cluster (electrodes C3, Cz, and C4; 8 to 13 Hz) confirms the action prediction. A surviving frontal-asymmetric alpha cluster (electrodes F3 and F4; 8 to 13 Hz) confirms the emotion prediction. A surviving cluster in left frontotemporal IC space, overlapping the Lipkin language-network atlas [Lipkin2022ProbabilisticAF] used as a negative-control mask, falsifies the four-perspective ranking by relocating the surviving signal into a perspective the thesis says should not transfer. A null result on the LLR-partialled GLM at a pre-registered cluster-level alpha (p < 0.05 corrected by mass-univariate cluster-based permutation, with the mTRF toolbox precedent [Crosse2016TheMT]) also falsifies the four-perspective ranking, by localising per-shot ERSP variance entirely to bottom-up features in this cohort. Pinning the rejection region before data analysis is the publication discipline that constrains analyst degrees of freedom.
+A topographic-and-band rejection region for the four-perspective ranking can be pre-registered before group analysis. A surviving central-rolandic mu-band cluster (electrodes C3, Cz, and C4; 8 to 13 Hz) confirms the action prediction. A surviving frontal-asymmetric alpha cluster (electrodes F3 and F4; 8 to 13 Hz) confirms the emotion prediction. A surviving cluster in left frontotemporal IC space, overlapping the Lipkin language-network atlas [54] used as a negative-control mask, falsifies the four-perspective ranking by relocating the surviving signal into a perspective the thesis says should not transfer. A null result on the LLR-partialled GLM at a pre-registered cluster-level alpha (p < 0.05 corrected by mass-univariate cluster-based permutation, with the mTRF toolbox precedent [32]) also falsifies the four-perspective ranking, by localising per-shot ERSP variance entirely to bottom-up features in this cohort. Pinning the rejection region before data analysis is the publication discipline that constrains analyst degrees of freedom.
 
 ### 7.4 Open questions and limitations
 
-Narrative position is a within-stimulus confound. Boy-only and puppy-only shots in *The Present* differ on three-act position: boy-only clusters in the early-act setup, puppy-only in the late-act resolution. Any boy-vs-puppy ERSP difference may therefore be confounded with prediction-error or arousal trajectories. The response is to add shot-index-in-narrative as a continuous covariate in the group GLM and to fit a within-act stratified analysis as a named follow-up [Magliano2011TheIO; baldassano2017event; chen2017shared]; both are tractable from the existing shot-event annotation. Beyond narrative position, several gaps in the corpus limit what this Review can claim. The Hickok-style mu-system critique is not represented in our cards, which weakens the action prediction. Klin and colleagues showed that toddlers with autism orient to audiovisual contingency rather than upright biological motion [klin2009biological] and that adolescents with autism fixate eyes 50 percent as often during emotionally evocative viewing [klin2002visual]; the HBN cohort includes a substantial autism-spectrum subsample, so autism status is a candidate moderator, but stratified analyses (autism-spectrum, attention, social skill) are exploratory follow-ups rather than primary tests. The emotion literature is predominantly adult; the three pet-evoked affective cards are fMRI or behavioural, not EEG. Frontal asymmetry at sub-second timescales is unprecedented and reliability-limited. The single-stimulus design forbids generalisation beyond *The Present*. The 100 Hz local working set caps beta-band and gamma-band claims until the 500 Hz validation pass. The Outstanding Questions Box collects the forward-looking adjudication targets.
+Narrative position is a within-stimulus confound. Boy-only and puppy-only shots in *The Present* differ on three-act position: boy-only clusters in the early-act setup, puppy-only in the late-act resolution. Any boy-vs-puppy ERSP difference may therefore be confounded with prediction-error or arousal trajectories. The response is to add shot-index-in-narrative as a continuous covariate in the group GLM and to fit a within-act stratified analysis as a named follow-up [11,42,43]; both are tractable from the existing shot-event annotation. Beyond narrative position, several gaps in the corpus limit what this Review can claim. The Hickok-style mu-system critique is not represented in our cards, which weakens the action prediction. Klin and colleagues showed that toddlers with autism orient to audiovisual contingency rather than upright biological motion [80] and that adolescents with autism fixate eyes 50 percent as often during emotionally evocative viewing [81]; the HBN cohort includes a substantial autism-spectrum subsample, so autism status is a candidate moderator, but stratified analyses (autism-spectrum, attention, social skill) are exploratory follow-ups rather than primary tests. The emotion literature is predominantly adult; the three pet-evoked affective cards are fMRI or behavioural, not EEG. Frontal asymmetry at sub-second timescales is unprecedented and reliability-limited. The single-stimulus design forbids generalisation beyond *The Present*. The 100 Hz local working set caps beta-band and gamma-band claims until the 500 Hz validation pass. The Outstanding Questions Box collects the forward-looking adjudication targets.
 
 ## Box 1: HBN-EEG Release 3 as the anchor cohort
 
-The Healthy Brain Network EEG (HBN-EEG) Release 3 cohort recruits 5- to 21-year-old participants in a research-grade developmental setting and records 128-channel HydroCel Geodesic Sensor Net during passive viewing of the 3.5-minute Pixar short *The Present* (2014). The local working set used in our pipeline development is 184 subjects at 100 Hz biosignal data format (BDF), a Nyquist-aware downsample of the original 500 Hz. The 56 stimulus-side shots carry per-shot `onset`, `duration`, `LLR`, `has_boy`, and `has_puppy` annotations; after invalidating 3 high-drift rows (`match_diff_s > 1.0 s`), 49 rows are trusted, yielding 20 boy-only and 15 puppy-only shots for the mutually exclusive single-agent contrast. The pipeline runs BIDS import, 1 Hz high-pass filter, conditional cleanline (gated by Nyquist), `clean_rawdata` channel rejection, AMICA, ICLabel (brain threshold 0.69), dipfit5, and `std_precomp` ERSP. The anchor case rests on Petroni and colleagues 2018 [Petroni2018TheVO] as the external precedent and this partly-validated pipeline as the internal feasibility proof.
+The Healthy Brain Network EEG (HBN-EEG) Release 3 cohort recruits 5- to 21-year-old participants in a research-grade developmental setting and records 128-channel HydroCel Geodesic Sensor Net during passive viewing of the 3.5-minute Pixar short *The Present* (2014). The local working set used in our pipeline development is 184 subjects at 100 Hz biosignal data format (BDF), a Nyquist-aware downsample of the original 500 Hz. The 56 stimulus-side shots carry per-shot `onset`, `duration`, `LLR`, `has_boy`, and `has_puppy` annotations; after invalidating 3 high-drift rows (`match_diff_s > 1.0 s`), 49 rows are trusted, yielding 20 boy-only and 15 puppy-only shots for the mutually exclusive single-agent contrast. The pipeline runs BIDS import, 1 Hz high-pass filter, conditional cleanline (gated by Nyquist), `clean_rawdata` channel rejection, AMICA, ICLabel (brain threshold 0.69), dipfit5, and `std_precomp` ERSP. The anchor case rests on Petroni and colleagues 2018 [14] as the external precedent and this partly-validated pipeline as the internal feasibility proof.
 
 ## Trends Box: recent developments enabling the per-shot framing
 
 Recent advances make the per-shot framing newly tractable.
 
-- **Whole-brain shot-cut response in adult intracranial EEG.** Nentwich and colleagues 2023 recorded 6328 contacts in 23 patients across 43.6 minutes of film clips and regressed responses against optical-flow magnitude, saccade onsets, and film-cut onsets simultaneously, finding whole-brain saccade- and cut-locked responses with motion concentrated in occipitoparietal cortex [Nentwich2023SemanticNM].
-- **Hidden Markov model recovery of event states from fMRI.** Baldassano and colleagues 2017 recovered a hierarchy of event boundaries from Sherlock-movie fMRI, with hippocampal boundary signals predicting subsequent free recall [baldassano2017event].
-- **Cross-sectional developmental EEG-ISC.** Petroni and colleagues 2018 reported whole-clip EEG-ISC reliability across ages 6 to 44 during passive viewing of six naturalistic videos, peaking in childhood [Petroni2018TheVO].
-- **Silent abstract animation for MRI compliance.** Vanderwal and colleagues 2015 built Inscapes, used by HBN itself, with reliable network-level activity [Vanderwal2015InscapesAM].
-- **Multi-level cinematic-feature regression.** Kauttonen and colleagues 2015 regressed multi-level cinematic features against fMRI ISC, supplying a methodological template for shot-level feature annotation [Kauttonen2015OptimizingMF].
-- **Open developmental EEG releases.** HBN-EEG and Studyforrest [Hanke2014AH7] make large-N developmental datasets available for naturalistic-stimulus analysis at unprecedented scale.
+- **Whole-brain shot-cut response in adult intracranial EEG.** Nentwich and colleagues 2023 recorded 6328 contacts in 23 patients across 43.6 minutes of film clips and regressed responses against optical-flow magnitude, saccade onsets, and film-cut onsets simultaneously, finding whole-brain saccade- and cut-locked responses with motion concentrated in occipitoparietal cortex [7].
+- **Hidden Markov model recovery of event states from fMRI.** Baldassano and colleagues 2017 recovered a hierarchy of event boundaries from Sherlock-movie fMRI, with hippocampal boundary signals predicting subsequent free recall [11].
+- **Cross-sectional developmental EEG-ISC.** Petroni and colleagues 2018 reported whole-clip EEG-ISC reliability across ages 6 to 44 during passive viewing of six naturalistic videos, peaking in childhood [14].
+- **Silent abstract animation for MRI compliance.** Vanderwal and colleagues 2015 built Inscapes, used by HBN itself, with reliable network-level activity [13].
+- **Multi-level cinematic-feature regression.** Kauttonen and colleagues 2015 regressed multi-level cinematic features against fMRI ISC, supplying a methodological template for shot-level feature annotation [82].
+- **Open developmental EEG releases.** HBN-EEG and Studyforrest [58] make large-N developmental datasets available for naturalistic-stimulus analysis at unprecedented scale.
 
 ## Outstanding Questions Box
 
@@ -179,6 +179,6 @@ Recent advances make the per-shot framing newly tractable.
 
 ## References
 
-References are managed in `refs.bib` (94 entries after F3 stray-key removal). Cell Press numbered style is applied at Phase 5 assembly. Body cite-card slugs are bracketed in this draft (e.g., `[Petroni2018TheVO]`) and resolved against `refs.bib` at compile time.
+The numbered reference list is in `references.md` (82 cited entries, ordered by first appearance in the body). Underlying BibTeX is in `refs.bib` (94 entries; the 12 uncited entries are kept for the supplementary materials and not numbered here). Cell Press house style applied at compile time.
 
-Note on the alpha-band and emotion citation: the body text cites Codispoti and colleagues (2023), Psychophysiology, DOI 10.1111/psyp.14438; the internal corpus slug `schubring-schupp-2023-alpha-emotion` is retained inside the research collection for cross-reference stability and does not appear in published prose.
+Note on the alpha-band and emotion citation: reference 70 cites Codispoti and colleagues (2023), Psychophysiology, DOI 10.1111/psyp.14438. The internal corpus slug `schubring-schupp-2023-alpha-emotion` is retained inside the research collection for cross-reference stability and does not appear in published prose.
diff --git a/manuscript/narrative-review/references.md b/manuscript/narrative-review/references.md
new file mode 100644
index 0000000..b502111
--- /dev/null
+++ b/manuscript/narrative-review/references.md
@@ -0,0 +1,86 @@
+# References
+
+Numbered reference list for `manuscript.md`. 82 entries, ordered by first appearance in the body. Cell Press house style approximation.
+
+1. Hasson et al. (2004). Intersubject Synchronization of Cortical Activity During Natural Vision. *Science* 303, 1634 - 1640.
+2. Dmochowski et al. (2012). Correlated Components of Ongoing EEG Point to Emotionally Laden Attention – A Possible Marker of Engagement?. *Frontiers in Human Neuroscience*. https://doi.org/10.3389/fnhum.2012.00112
+3. Ki et al. (2016). Attention Strongly Modulates Reliability of Neural Responses to Naturalistic Narrative Stimuli. *The Journal of Neuroscience* 36, 3092 - 3101.
+4. Cohen and Parra (2016). Memorable Audiovisual Narratives Synchronize Sensory and Supramodal Neural Responses. *eNeuro* 3.
+5. Dmochowski et al. (2014). Audience preferences are predicted by temporal reliability of neural processing. *Nature Communications* 5.
+6. Madsen and Parra (2022). Cognitive processing of a common stimulus synchronizes brains, hearts, and eyes. *PNAS Nexus* 1.
+7. Nentwich et al. (2023). Semantic novelty modulates neural responses to visual change across the human brain. *Nature Communications* 14.
+8. Ben-Yakov and Henson (2018). The Hippocampal Film Editor: Sensitivity and Specificity to Event Boundaries in Continuous Experience. *The Journal of Neuroscience* 38, 10057 - 10068.
+9. Zacks et al. (2007). Event perception: a mind-brain perspective. *Psychological Bulletin* 133, 273--293. https://doi.org/10.1037/0033-2909.133.2.273
+10. Speer et al. (2007). Human brain activity time-locked to narrative event boundaries. *Psychological Science* 18, 449--455. https://doi.org/10.1111/j.1467-9280.2007.01920.x
+11. Baldassano et al. (2017). Discovering event structure in continuous narrative perception and memory. *Neuron* 95, 709--721. https://doi.org/10.1016/j.neuron.2017.06.041
+12. Richardson et al. (2018). Development of the social brain from age three to twelve years. *Nature Communications* 9.
+13. Vanderwal et al. (2015). Inscapes: A movie paradigm to improve compliance in functional magnetic resonance imaging. *NeuroImage* 122, 222-32.
+14. Petroni et al. (2018). The Variability of Neural Responses to Naturalistic Videos Change with Age and Sex. *eNeuro* 5.
+15. Lankinen et al. (2014). Intersubject consistency of cortical MEG signals during movie viewing. *NeuroImage* 92, 217-24.
+16. Adelson and Bergen (1985). Spatiotemporal energy models for the perception of motion.. *Journal of the Optical Society of America. A, Optics and image science* 2 2, 284-99.
+17. Carandini and Heeger (2011). Normalization as a canonical neural computation. *Nature Reviews Neuroscience* 13, 51-62.
+18. Nishimoto et al. (2011). Reconstructing visual experiences from brain activity evoked by natural movies.. *Current biology : CB* 21 19, 1641-6.
+19. Hari et al. (1998). Activation of human primary motor cortex during action observation: a neuromagnetic study. *Proceedings of the National Academy of Sciences* 95, 15061--15065. https://doi.org/10.1073/pnas.95.25.15061
+20. Pineda (2005). The functional significance of mu rhythms: translating "seeing" and "hearing" into "doing". *Brain Research Reviews* 50, 57--68. https://doi.org/10.1016/j.brainresrev.2005.04.005
+21. Goldstein et al. (2022). Shared computational principles for language processing in humans and deep language models. *Nature Neuroscience* 25, 369 - 380.
+22. Caucheteux and King (2022). Brains and algorithms partially converge in natural language processing. *Communications Biology* 5.
+23. Hubel and Wiesel (1962). Receptive fields, binocular interaction and functional architecture in the cat's visual cortex. *The Journal of Physiology* 160.
+24. Bell and Sejnowski (1997). The `independent components''of natural scenes are edge filters. *Neural Information Processing Systems*.
+25. Simoncelli and Olshausen (2001). Natural image statistics and neural representation.. *Annual review of neuroscience* 24, 1193-216.
+26. Born and Bradley (2005). Structure and function of visual area MT.. *Annual review of neuroscience* 28, 157-89.
+27. Bartels et al. (2008). Natural vision reveals regional specialization to local motion and to contrast-invariant, global flow in the human brain.. *Cerebral cortex* 18 3, 705-17.
+28. Tobimatsu and Celesia (2006). Studies of human visual pathophysiology with visual evoked potentials.. *Clinical neurophysiology : official journal of the International Federation of Clinical Neurophysiology* 117 7, 1414-33.
+29. Kaneshiro et al. (2021). Inter-Subject EEG Correlation Reflects Time-Varying Engagement with Natural Music. *bioRxiv*.
+30. Dimigen and Ehinger (2021). Regression-based analysis of combined EEG and eye-tracking data: Theory and applications. *Journal of Vision* 21.
+31. Dorr et al. (2010). Variability of eye movements when viewing dynamic natural scenes.. *Journal of vision* 10 10, 28.
+32. Crosse et al. (2016). The Multivariate Temporal Response Function (mTRF) Toolbox: A MATLAB Toolbox for Relating Neural Signals to Continuous Stimuli. *Frontiers in Human Neuroscience* 10.
+33. Oberman et al. (2007). The human mirror neuron system: a link between action observation and social skills. *Social Cognitive and Affective Neuroscience* 2, 62--66. https://doi.org/10.1093/scan/nsl022
+34. Saygin (2007). Superior temporal and premotor brain areas necessary for biological motion perception. *Brain* 130, 2452--2461. https://doi.org/10.1093/brain/awm162
+35. Johansson (1973). Visual perception of biological motion and a model for its analysis. *Perception \& Psychophysics* 14, 201--211. https://doi.org/10.3758/BF03212378
+36. Kilner et al. (2007). Predictive coding: an account of the mirror neuron system. *Cognitive Processing* 8, 159--166. https://doi.org/10.1007/s10339-007-0170-2
+37. Rizzolatti and Craighero (2004). The mirror-neuron system. *Annual Review of Neuroscience* 27, 169--192. https://doi.org/10.1146/annurev.neuro.27.070203.144230
+38. Iacoboni (2009). Imitation, empathy, and mirror neurons. *Annual Review of Psychology* 60, 653--670. https://doi.org/10.1146/annurev.psych.60.110707.163604
+39. Castelli et al. (2000). Movement and mind: a functional imaging study of perception and interpretation of complex intentional movement patterns. *NeuroImage* 12, 314--325. https://doi.org/10.1006/nimg.2000.0612
+40. Hasson et al. (2010). Reliability of cortical activity during natural stimulation. *Trends in Cognitive Sciences* 14, 40--48. https://doi.org/10.1016/j.tics.2009.10.011
+41. Lerner et al. (2011). Topographic mapping of a hierarchy of temporal receptive windows using a narrated story. *Journal of Neuroscience* 31, 2906--2915. https://doi.org/10.1523/JNEUROSCI.3684-10.2011
+42. Chen et al. (2017). Shared memories reveal shared structure in neural activity across individuals. *Nature Neuroscience* 20, 115--125. https://doi.org/10.1038/nn.4450
+43. Magliano and Zacks (2011). The Impact of Continuity Editing in Narrative Film on Event Segmentation. *Cognitive science* 35 8, 1489-517.
+44. Sliwa and Freiwald (2017). A dedicated network for social interaction processing in the primate brain. *Science* 356, 745--749. https://doi.org/10.1126/science.aam6383
+45. Heilbron et al. (2020). A hierarchy of linguistic predictions during natural language comprehension. *Proceedings of the National Academy of Sciences of the United States of America* 119.
+46. Caucheteux et al. (2023). Evidence of a predictive coding hierarchy in the human brain listening to speech. *Nature Human Behaviour* 7, 430 - 441.
+47. Antonello et al. (2023). Scaling laws for language encoding models in fMRI. *Advances in neural information processing systems* 36, 21895-21907.
+48. Schrimpf et al. (2021). The neural architecture of language: Integrative modeling converges on predictive processing. *Proceedings of the National Academy of Sciences*. https://doi.org/10.1073/pnas.2105646118
+49. Toneva and Wehbe (2019). Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain). *arXiv (Cornell University)*. https://doi.org/10.48550/arxiv.1905.11833
+50. Huth et al. (2016). Natural speech reveals the semantic maps that tile human cerebral cortex. *Nature* 532, 453 - 458.
+51. Nelson et al. (2017). Neurophysiological dynamics of phrase-structure building during sentence processing. *Proceedings of the National Academy of Sciences* 114, E3669 - E3678.
+52. Kutas and Federmeier (2011). Thirty years and counting: finding meaning in the N400 component of the event-related brain potential (ERP).. *Annual review of psychology* 62, 621-47.
+53. DeLong et al. (2005). Probabilistic word pre-activation during language comprehension inferred from electrical brain activity. *Nature Neuroscience* 8, 1117-1121.
+54. Lipkin et al. (2022). Probabilistic atlas for the language network based on precision fMRI data from >800 individuals. *Scientific Data* 9.
+55. Castelli et al. (2000). Movement and mind: a functional imaging study of perception and interpretation of complex intentional movement patterns.. *NeuroImage* 12 3, 314-25.
+56. Castelli et al. (2002). Autism, Asperger syndrome and brain mechanisms for the attribution of mental states to animated shapes.. *Brain : a journal of neurology* 125 Pt 8, 1839-49.
+57. Naci et al. (2014). A common neural code for similar conscious experiences in different individuals. *Proceedings of the National Academy of Sciences* 111, 14277 - 14282.
+58. Hanke et al. (2014). A high-resolution 7-Tesla fMRI dataset from complex natural stimulation with an audio movie. *Scientific Data* 1.
+59. Schroeder and Lakatos (2009). Low-frequency neuronal oscillations as instruments of sensory selection.. *Trends in neurosciences* 32 1, 9-18.
+60. Senkowski et al. (2008). Crossmodal binding through neural coherence: implications for multisensory processing.. *Trends in neurosciences* 31 8, 401-9.
+61. Buckner et al. (2008). The Brain's Default Network. *Annals of the New York Academy of Sciences* 1124.
+62. Simony et al. (2016). Dynamic reconfiguration of the default mode network during narrative comprehension. *Nature Communications* 7.
+63. Yeshurun et al. (2017). Same Story, Different Story. *Psychological Science* 28, 307 - 319.
+64. Mar (2011). The neural bases of social cognition and story comprehension.. *Annual review of psychology* 62, 103-34.
+65. Tamir et al. (2016). Reading fiction and reading minds: the role of simulation in the default network.. *Social cognitive and affective neuroscience* 11 2, 215-24.
+66. Kragel et al. (2018). Emotion schemas are embedded in the human visual system. *Science Advances* 5.
+67. Cowen and Keltner (2017). Self-report captures 27 distinct categories of emotion bridged by continuous gradients. *Proceedings of the National Academy of Sciences* 114, E7900 - E7909.
+68. Lindquist et al. (2012). The brain basis of emotion: A meta-analytic review. *Behavioral and Brain Sciences* 35, 121 - 143.
+69. Wager et al. (2013). An fMRI-based neurologic signature of physical pain.. *The New England journal of medicine* 368 15, 1388-97.
+70. Codispoti et al. (2023). Alpha-band oscillations and emotion: A review of studies on picture perception.. *Psychophysiology*, e14438.
+71. Stoeckel et al. (2014). Patterns of Brain Activation when Mothers View Their Own Child and Dog: An fMRI Study. *PLoS ONE* 9.
+72. Glocker et al. (2009). Baby schema modulates the brain reward system in nulliparous women. *Proceedings of the National Academy of Sciences* 106, 9115 - 9119.
+73. Borgi et al. (2014). Baby schema in human and animal faces induces cuteness perception and gaze allocation in children. *Frontiers in Psychology* 5.
+74. Davidson (2000). Affective style, psychopathology, and resilience: brain mechanisms and plasticity.. *The American psychologist* 55 11, 1196-214.
+75. Coan and Allen (2004). Frontal EEG asymmetry as a moderator and mediator of emotion.. *Biological psychology* 67 1-2, 7-49.
+76. Reznik and Allen (2018). Frontal asymmetry as a mediator and moderator of emotion: An updated review.. *Psychophysiology* 55 1.
+77. Singer et al. (2004). Empathy for Pain Involves the Affective but not Sensory Components of Pain. *Science* 303, 1157 - 1162.
+78. Zaki and Ochsner (2012). The neuroscience of empathy: progress, pitfalls and promise. *Nature Neuroscience* 15, 675-680.
+79. Nummenmaa et al. (2012). Emotions promote social interaction by synchronizing brain activity across individuals. *Proceedings of the National Academy of Sciences* 109, 9599 - 9604.
+80. Klin et al. (2009). Two-year-olds with autism orient to nonsocial contingencies rather than biological motion. *Nature* 459, 257--261. https://doi.org/10.1038/nature07868
+81. Klin et al. (2002). Visual fixation patterns during viewing of naturalistic social situations as predictors of social competence in individuals with autism. *Archives of General Psychiatry* 59, 809--816. https://doi.org/10.1001/archpsyc.59.9.809
+82. Kauttonen et al. (2015). Optimizing methods for linking cinematic features to fMRI data. *NeuroImage* 110, 136-48.