Skip to content

Conversation

@lehendo
Copy link
Contributor

@lehendo lehendo commented Jan 15, 2026

Author: Arjun Chatterjee (arjunc4@illinois.edu)
Contribution Type: Bug Fix

Fix TUEV dataset metadata permissions for shared cluster storage.

Modified prepare_metadata() to write metadata CSVs to ~/.cache/pyhealth/tuev/
as fallback when data directory is not writable. Updated init() to check
both shared and cache locations for CSVs. This resolves PermissionError when
users lack write access to shared data directories on clusters.

Tested on SunLab cluster with both tuev_conventional_conformal.py and
tuev_covariate_shift_conformal.py - processed 370 patients (113,353 samples)
successfully without permission errors.

@lehendo
Copy link
Contributor Author

lehendo commented Jan 15, 2026

oops, I accidentally used my other unresolved pr branch (#781) for this pr as well. hence, all the extraneous files and commits on this pr. to see the exact code for this pr only just look at this commit: "Fix TUEV metadata to use cache directory for permissions"

@lehendo lehendo closed this Jan 15, 2026
@lehendo
Copy link
Contributor Author

lehendo commented Jan 15, 2026

actually to make it easier I'll just create a new one so you can judge the two pr's separately with no conflict.

@lehendo lehendo deleted the fix/tuev-metadata-cache branch January 15, 2026 04:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant