Refactor Classification Model_I and Model_II dataset loading for portability and deterministic class mapping by Panchadip-128 · Pull Request #139 · ML4SCI/DeepLense

Panchadip-128 · 2026-02-13T21:42:54Z

Summary

This PR refactors the dataset loading logic in the Classification pipeline (Model_I and Model_II) to improve portability, reproducibility, and contributor usability.

Motivation

Previously:

Dataset paths depended on user-specific directory structures.
glob usage was loosely defined.
Class index mapping relied on dictionary iteration order.
No validation existed for empty dataset directories.

These issues reduced portability and made onboarding difficult for new contributors.

Changes

Standardized dataset structure to:
root_dir/class_name/*.npy
Replaced ambiguous glob usage with:
os.path.join(root_dir, "", ".npy")
Added validation to raise a clear error if no .npy files are found.
Made class mapping deterministic using sorted class names to ensure consistent label indices across runs.
Improved path handling using os.path utilities for cross-platform compatibility.

Impact

No changes to model architecture
No changes to training logic
No changes to transform behavior
No changes to output format

This is purely a structural and usability improvement.

Expected Dataset Structure

Example:

data/
Model_I/
axion/
cdm/
no_sub/
Model_I_test/
axion/
cdm/
no_sub/

…ce portable directory structure

…ndling

Panchadip-128 added 3 commits February 14, 2026 03:10

Refactor Model_I dataset handling to remove hardcoded paths and enfor…

de4960e

…ce portable directory structure

Update dataloader to use portable dataset root handling

7758c2b

Update dataloader for Model_I and Model_II to improve dataset path ha…

35d7a87

…ndling

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Refactor Classification Model_I and Model_II dataset loading for portability and deterministic class mapping#139

Refactor Classification Model_I and Model_II dataset loading for portability and deterministic class mapping#139
Panchadip-128 wants to merge 3 commits intoML4SCI:mainfrom
Panchadip-128:refactor-classification-dataloader

Panchadip-128 commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

Panchadip-128 commented Feb 13, 2026

Summary

Motivation

Changes

Impact

Expected Dataset Structure

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant