Update the benchmark to load datasets from multiple S3 buckets when None is specified by R-Palazzo · Pull Request #607 · sdv-dev/SDGym

R-Palazzo · 2026-05-25T19:49:55Z

Resolve #604
CU-86b9zwpcu

This PR includes:

Relying on SDV download_demo() to get sdv_dataset. This works for public datasets or for users with SDV-Enterprise installed. Otherwise, because on SDV there is some validation to allow access to the public bucket only, I reused the SDV logic here, so it works if the S3 client is created with valid credentials (from input keys or env variables).
Because we rely on download_demo(), we no longer save the dataset locally.
Improvement in the _generate_job_arg_list: Now we're only passing dataset_info inside it. The dataset_info includes everything necessary to then download the data and metadata during execution.
Added an integration test with a private dataset. Also tested it on AWS here and GCP here.

sdv-team · 2026-05-25T19:50:00Z

Task linked: CU-86b9zwpcu SDGym - Update the benchmark to load datasets from multiple S3 buckets when None is specified #604

codecov · 2026-05-25T19:59:07Z

Codecov Report

❌ Patch coverage is 94.89796% with 5 lines in your changes missing coverage. Please review.
✅ Project coverage is 85.89%. Comparing base (d6dc458) to head (b97fa0d).

Files with missing lines	Patch %	Lines
sdgym/benchmark.py	92.45%	4 Missing ⚠️
sdgym/datasets.py	97.77%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #607      +/-   ##
==========================================
+ Coverage   85.67%   85.89%   +0.21%     
==========================================
  Files          40       40              
  Lines        3679     3750      +71     
==========================================
+ Hits         3152     3221      +69     
- Misses        527      529       +2

Flag	Coverage Δ
integration	`43.86% <76.53%> (+0.26%)`	⬆️
unit	`81.70% <94.89%> (+0.21%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

amontanez24 · 2026-05-26T16:51:49Z


+def _get_dataset_bucket_mapping(modality, buckets, s3_client, skip_inaccessible=False):
+    """Map SDV demo dataset names to the bucket they should be loaded from."""
+    dataset_buckets = {}


nitpick: can we call dataset_to_bucket

Yes, done in c349d70

R-Palazzo · 2026-05-28T09:54:00Z

 """Main SDGym benchmarking module."""

 import functools
-import gzip


Since the dataset is no longer included in the job_arg_list, we don't need to compress it. Also, it now contains one job by default, and from my tests, it's roughly 1KB.

R-Palazzo requested review from amontanez24 and frances-h May 25, 2026 19:49

R-Palazzo self-assigned this May 25, 2026

R-Palazzo requested a review from a team as a code owner May 25, 2026 19:49

R-Palazzo removed the request for review from a team May 25, 2026 19:50

R-Palazzo commented May 26, 2026

View reviewed changes

Comment thread pyproject.toml

R-Palazzo force-pushed the issue-604-2-private-bucket branch 2 times, most recently from 95be93a to 9a91033 Compare May 27, 2026 14:53

R-Palazzo mentioned this pull request May 27, 2026

Update the benchmark to launch one instance per (dataset, synthesizer) pair #611

Open

amontanez24 reviewed May 27, 2026

View reviewed changes

R-Palazzo added 11 commits May 28, 2026 08:58

load dataset only once

828570f

use downlad_demo from sdv

cba80aa

update _resolve_dataset

3803c19

fix lint

d040382

cleaning

2118c9a

add validation

b3481aa

move dataset loading to execution

a62376b

fix tests

f0eedea

rename _get_dataset_bucket_mapping -> dataset_to_bucket

c349d70

add tests

2884e39

stop compressing job_arg_list

5555441

R-Palazzo force-pushed the issue-604-2-private-bucket branch from 725c22c to 5555441 Compare May 28, 2026 09:27

undo pip install command

e3c2e90

R-Palazzo commented May 28, 2026

View reviewed changes

R-Palazzo requested a review from amontanez24 May 28, 2026 09:54

frances-h approved these changes May 28, 2026

View reviewed changes

Comment thread sdgym/benchmark.py Outdated

remove dataset_name from JobArgs

b97fa0d

R-Palazzo requested a review from sarahmish May 29, 2026 09:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the benchmark to load datasets from multiple S3 buckets when None is specified#607

Update the benchmark to load datasets from multiple S3 buckets when None is specified#607
R-Palazzo wants to merge 13 commits into
mainfrom
issue-604-2-private-bucket

R-Palazzo commented May 25, 2026 •

edited

Loading

Uh oh!

sdv-team commented May 25, 2026

Uh oh!

codecov Bot commented May 25, 2026 •

edited

Loading

Uh oh!

Uh oh!

amontanez24 May 26, 2026

Uh oh!

R-Palazzo May 28, 2026

Uh oh!

R-Palazzo May 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

R-Palazzo commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sdv-team commented May 25, 2026

Uh oh!

codecov Bot commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

amontanez24 May 26, 2026

Choose a reason for hiding this comment

Uh oh!

R-Palazzo May 28, 2026

Choose a reason for hiding this comment

Uh oh!

R-Palazzo May 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

R-Palazzo commented May 25, 2026 •

edited

Loading

codecov Bot commented May 25, 2026 •

edited

Loading