docs: add dataset instructions and generation guide by Shrishagk · Pull Request #140 · ML4SCI/DeepLense

Shrishagk · 2026-02-17T14:37:18Z

Summary

This PR improves the repository documentation by explaining how to obtain the datasets required to run the notebooks in
DeepLense_Gravitational_Lens_Classification_Transformers_Dhruv_Srivastava project

Currently the notebooks fail because the datasets and folder structure are not described, and new users cannot determine whether the issue is code-related or missing data.

Changes

Added dataset format explanations
Added expected directory structures for each experiment
Added instructions to generate datasets using DeepLenseSim
Clarified dataloader generation workflow

Why this is needed

Without these instructions:

Notebooks throw FileNotFoundError
Users cannot reproduce results
It is unclear that datasets must be generated externally

This PR makes the project reproducible for new contributors.

Type of Change

Documentation improvement (no code changes)

Testing

Verified that a new user can now understand:

Where the data comes from
How to generate it
Where to place it
Why dataloader files are missing

docs: add dataset instructions and generation guide

0e4deb2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

docs: add dataset instructions and generation guide#140

docs: add dataset instructions and generation guide#140
Shrishagk wants to merge 1 commit intoML4SCI:mainfrom
Shrishagk:docs/dataset-readme

Shrishagk commented Feb 17, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

Shrishagk commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Why this is needed

Type of Change

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Shrishagk commented Feb 17, 2026 •

edited

Loading