Phase 2 (issue #17) collection completed; 25 of 98 cards have an archived PDF in the repo (CC-BY, public-domain, or open archive). The remaining 73 cards have full card.md, source.md (abstract / canonical metadata), and meta.json but no source.pdf because the publisher requires authenticated access and no publisher token was configured.
This tracking issue lists those papers with DOIs so we can retrieve them via institutional access in a single batch and add them under their slug folders.
Workflow per paper:
- Open the DOI; download the PDF via Caltech / institutional proxy.
- Save as
research/collection/<strand>/<slug>/source.pdf.
- Update
meta.json: pdf_path: "source.pdf", pdf_status: "archived", fill pdf_sha256 (use shasum -a 256 source.pdf), set redistribution_ok: false if license still forbids redistribution.
- Update
card.md frontmatter pdf_status and pdf_path to match.
- Run Mistral conversion or pdftotext on the PDF and overwrite
source.md with the cleaner extraction; bump md_quality.
Do not commit source.pdf for papers with redistribution_ok: false. Keep the PDF locally in the slug folder, but add the slug pattern to a local .gitignore if needed.
Part of epic #15 (lit-review). Closes nothing automatically; close when all rows handled or when the user decides further attempts are not worth it.
High-relevance shortlist (start here)
These are the most load-bearing for Phase 3 synthesis and Phase 4 direction paper:
| # |
Strand |
Slug |
DOI |
| 1 |
psychophysics |
bartels-zeki-2008-natural-vision-mt |
10.1093/cercor/bhm107 |
| 2 |
psychophysics |
nishimoto-2011-movie-reconstruction |
10.1016/j.cub.2011.08.031 |
| 3 |
psychophysics |
hasson-2004-isc-natural-vision |
10.1126/science.1089506 |
| 4 |
psychophysics |
adelson-bergen-1985-spatiotemporal-energy |
10.1364/JOSAA.2.000284 |
| 5 |
action |
saygin-2007-sts-lesions |
10.1093/brain/awm162 |
| 6 |
action |
rizzolatti-2004-mirror-neuron-system |
10.1146/annurev.neuro.27.070203.144230 |
| 7 |
action |
kilner-2007-predictive-coding-mirror |
10.1007/s10339-007-0170-2 |
| 8 |
action |
zacks-2007-event-segmentation |
10.1037/0033-2909.133.2.273 |
| 9 |
action |
speer-2007-narrative-event-boundaries |
10.1111/j.1467-9280.2007.01920.x |
| 10 |
action |
baldassano-2017-event-boundaries |
10.1016/j.neuron.2017.06.041 |
| 11 |
action |
lerner-2011-temporal-receptive-windows |
10.1523/JNEUROSCI.3684-10.2011 |
| 12 |
action |
chen-2017-shared-memories |
10.1038/nn.4450 |
| 13 |
language |
hasson-2004-intersubject-synchronization |
10.1126/science.1089506 |
| 14 |
language |
lankinen-2014-meg-movie-consistency |
10.1016/j.neuroimage.2014.02.004 |
| 15 |
language |
magliano-2011-continuity-editing |
10.1111/j.1551-6709.2011.01202.x |
| 16 |
language |
castelli-2000-animations-mentalising |
10.1006/nimg.2000.0612 |
| 17 |
language |
naci-2014-suspenseful-movie |
10.1073/pnas.1407007111 |
| 18 |
language |
vanderwal-2015-inscapes |
10.1016/j.neuroimage.2015.07.069 |
| 19 |
language |
benyakov-2018-hippocampal-film-editor |
10.1523/JNEUROSCI.0524-18.2018 |
| 20 |
emotion |
saarimaki-2016-discrete-emotions |
10.1093/cercor/bhv086 |
| 21 |
emotion |
nummenmaa-2012-emotion-synchrony |
10.1073/pnas.1206095109 |
| 22 |
emotion |
lindquist-2012-emotion-meta-analysis |
10.1017/S0140525X11000446 |
| 23 |
emotion |
glocker-2009-baby-schema |
10.1073/pnas.0811620106 |
| 24 |
emotion |
schubring-schupp-2023-alpha-emotion |
10.1111/psyp.14438 |
Note: rows 3 and 13 are the same paper (Hasson et al. 2004 Science) cited under two strands; one PDF download covers both. Place under research/collection/psychophysics/hasson-2004-isc-natural-vision/source.pdf and add a relative symlink (or duplicate) for the language slug.
Full retrieval queue (medium-relevance and others)
Psychophysics (10 additional)
| # |
Slug |
DOI |
| 25 |
dimigen-2011-frp-natural-reading |
10.1037/a0023885 |
| 26 |
carandini-heeger-2011-normalization |
10.1038/nrn3136 |
| 27 |
simoncelli-olshausen-2001-natural-image-stats |
10.1146/annurev.neuro.24.1.1193 |
| 28 |
born-bradley-2005-mt-mst |
10.1146/annurev.neuro.26.041002.131052 |
| 29 |
dorr-2010-eye-movements-natural-movies |
10.1167/10.10.28 |
| 30 |
tobimatsu-celesia-2006-vep-pathophysiology |
10.1016/j.clinph.2006.01.004 |
| 31 |
riesenhuber-poggio-1999-hmax |
10.1038/14819 |
| 32 |
hubel-wiesel-1962-receptive-fields-cat-v1 |
10.1113/jphysiol.1962.sp006837 |
| 33 |
kaneshiro-2021-music-eeg-isc |
10.1101/2021.04.14.439913 |
| 34 |
lalor-foxe-2010-natural-speech-trf |
10.1111/j.1460-9568.2009.07055.x |
Action (9 additional)
| # |
Slug |
DOI |
| 35 |
johansson-1973-biological-motion |
10.3758/BF03212378 |
| 36 |
klin-2009-biological-motion-autism |
10.1038/nature07868 |
| 37 |
iacoboni-2009-mirror-empathy |
10.1146/annurev.psych.60.110707.163604 |
| 38 |
pineda-2005-mu-rhythm-mirror |
10.1016/j.brainresrev.2005.04.005 |
| 39 |
oberman-2006-mu-mirror-autism |
10.1093/scan/nsl022 |
| 40 |
sliwa-2017-macaque-social-network |
10.1126/science.aam6383 |
| 41 |
castelli-2000-heider-simmel |
10.1006/nimg.2000.0612 |
| 42 |
klin-2002-visual-fixation-autism |
10.1001/archpsyc.59.9.809 |
| 43 |
hasson-2010-natural-stimulation |
10.1016/j.tics.2009.10.011 |
Emotion (16 additional)
| # |
Slug |
DOI |
| 44 |
kragel-2019-emonet-visual |
10.1126/sciadv.aaw4358 |
| 45 |
cowen-keltner-2017-27-emotions |
10.1073/pnas.1702247114 |
| 46 |
saxe-kanwisher-2003-tpj-tom |
10.1016/S1053-8119(03)00230-1 |
| 47 |
ledoux-2010-amygdala-many-roads |
10.1038/nrn2920 |
| 48 |
sergerie-2008-amygdala-meta-analysis |
10.1016/j.neubiorev.2007.12.002 |
| 49 |
etkin-2011-acc-mpfc-emotion |
10.1016/j.tics.2010.11.004 |
| 50 |
wager-2013-neurologic-pain-signature |
10.1056/NEJMoa1204471 |
| 51 |
schmalzle-2020-coupled-brains |
10.1027/1864-1105/a000271 |
| 52 |
kauttonen-2015-cinematic-fmri |
10.1016/j.neuroimage.2015.01.063 |
| 53 |
davidson-2000-affective-style |
10.1037/0003-066X.55.11.1196 |
| 54 |
coan-allen-2004-frontal-asymmetry |
10.1016/j.biopsycho.2004.03.002 |
| 55 |
reznik-allen-2018-frontal-asymmetry |
10.1111/psyp.12965 |
| 56 |
mar-2011-narrative-social-cognition |
10.1146/annurev-psych-120709-145406 |
| 57 |
tamir-2015-fiction-default-network |
10.1093/scan/nsv114 |
| 58 |
singer-2004-empathy-pain |
10.1126/science.1093535 |
| 59 |
zaki-ochsner-2012-empathy-neural |
10.1038/nn.3085 |
Notes for batch retrieval
- Most rows resolve via Caltech library proxy; those that don't (Annual Reviews, OUP older issues) may need a direct request.
- For PNAS pre-2017 (rows 21, 22, 45) the paper is freely readable on pnas.org but redistribution is restricted; download for local Phase 3 reading and keep
redistribution_ok: false.
kragel-2019-emonet-visual (row 44) is CC-BY-NC, available open access; not committed because of the project's no-NC policy. If we revise the policy, this one can be archived directly.
kaneshiro-2021-music-eeg-isc (row 33) is bioRxiv with default CC-BY-NC-ND; same NC policy applies.
hubel-wiesel-1962 (row 32) is past 60 years old; copyright status varies by jurisdiction. Treat as restricted unless verified.
Recap by strand
| Strand |
Cards total |
PDFs archived |
Manual queue |
| psychophysics |
26 |
12 |
14 |
| action |
18 |
1 |
17 |
| language |
29 |
8 |
21 |
| emotion |
25 |
4 |
21 |
| Total |
98 |
25 |
73 (rows 3 and 13 are the same paper, true unique = 72) |
Phase 2 (issue #17) collection completed; 25 of 98 cards have an archived PDF in the repo (CC-BY, public-domain, or open archive). The remaining 73 cards have full
card.md,source.md(abstract / canonical metadata), andmeta.jsonbut nosource.pdfbecause the publisher requires authenticated access and no publisher token was configured.This tracking issue lists those papers with DOIs so we can retrieve them via institutional access in a single batch and add them under their slug folders.
Workflow per paper:
research/collection/<strand>/<slug>/source.pdf.meta.json:pdf_path: "source.pdf",pdf_status: "archived", fillpdf_sha256(useshasum -a 256 source.pdf), setredistribution_ok: falseif license still forbids redistribution.card.mdfrontmatterpdf_statusandpdf_pathto match.source.mdwith the cleaner extraction; bumpmd_quality.Do not commit
source.pdffor papers withredistribution_ok: false. Keep the PDF locally in the slug folder, but add the slug pattern to a local.gitignoreif needed.Part of epic #15 (lit-review). Closes nothing automatically; close when all rows handled or when the user decides further attempts are not worth it.
High-relevance shortlist (start here)
These are the most load-bearing for Phase 3 synthesis and Phase 4 direction paper:
Note: rows 3 and 13 are the same paper (Hasson et al. 2004 Science) cited under two strands; one PDF download covers both. Place under
research/collection/psychophysics/hasson-2004-isc-natural-vision/source.pdfand add a relative symlink (or duplicate) for the language slug.Full retrieval queue (medium-relevance and others)
Psychophysics (10 additional)
Action (9 additional)
Emotion (16 additional)
Notes for batch retrieval
redistribution_ok: false.kragel-2019-emonet-visual(row 44) is CC-BY-NC, available open access; not committed because of the project's no-NC policy. If we revise the policy, this one can be archived directly.kaneshiro-2021-music-eeg-isc(row 33) is bioRxiv with default CC-BY-NC-ND; same NC policy applies.hubel-wiesel-1962(row 32) is past 60 years old; copyright status varies by jurisdiction. Treat as restricted unless verified.Recap by strand