ENH NeuralNet metadata routing by adrinjalali · Pull Request #1139 · skorch-dev/skorch

adrinjalali · 2026-04-20T14:43:03Z

Implement sklearn metadata routing (router + consumer)

Summary

Implements sklearn's metadata routing protocol for NeuralNet. NeuralNet is both a consumer (its module's forward method accepts arbitrary metadata) and a router (it routes metadata like groups to its internal CV splitter).

This follows the same pattern as sklearn's CalibratedClassifierCV: get_metadata_routing() returns a MetadataRouter with add_self_request(self) for self-consumed params plus add(splitter=..., method_mapping=fit→split) for the CV splitter child.

All behavior is gated on sklearn.set_config(enable_metadata_routing=True). When disabled, behavior is identical to before.

What this enables

When enable_metadata_routing=True, metadata flows correctly through sklearn meta-estimators (Pipeline, GridSearchCV, cross_validate) to NeuralNet and its internal components.

Routing groups through a Pipeline to GroupKFold:

import numpy as np
import sklearn
from sklearn.base import BaseEstimator, TransformerMixin
from sklearn.datasets import make_regression
from sklearn.model_selection import GroupKFold
from sklearn.pipeline import make_pipeline
from sklearn.preprocessing import StandardScaler
from torch import nn

from skorch.dataset import ValidSplit
from skorch.regressor import NeuralNetRegressor

sklearn.set_config(enable_metadata_routing=True)


class RegressionModule(nn.Module):
    def __init__(self):
        super().__init__()
        self.fc = nn.Linear(10, 1)

    def forward(self, X, Z, **kwargs):
        return self.fc(X + Z)


class DictScaler(TransformerMixin, BaseEstimator):
    """Scale only the 'X' key of a dict input, pass 'Z' through."""

    def __init__(self):
        self.scaler_ = StandardScaler()

    def fit(self, X, y=None):
        self.scaler_.fit(X["X"])
        return self

    def transform(self, X):
        result = dict(X)
        result["X"] = self.scaler_.transform(X["X"]).astype("float32")
        return result


X, y = make_regression(n_samples=1000, n_features=20, random_state=0)
X, Z = X[:, :10].astype("float32"), X[:, 10:].astype("float32")
y = y.astype("float32").reshape(-1, 1)
groups = np.array([0] * 500 + [1] * 500)

# Per-sample auxiliary data (Z) is packed into a dict so the
# DataLoader batches it alongside X. Metadata routing handles
# getting `groups` through the Pipeline to the GroupKFold splitter.
X_dict = {"X": X, "Z": Z}

net = NeuralNetRegressor(
    RegressionModule,
    max_epochs=3,
    lr=0.01,
    train_split=ValidSplit(GroupKFold(2)),
)

pipe = make_pipeline(DictScaler(), net)
pipe.fit(X_dict, y, groups=groups)

Changes

skorch/net.py:

get_metadata_routing() — returns a MetadataRouter (not MetadataRequest). Registers self as consumer via add_self_request and the inner CV splitter (from ValidSplit.cv) as a child. This is what makes NeuralNet a router.
set_fit_request() / set_partial_fit_request() — custom implementations since fit(**fit_params) uses **kwargs and sklearn can't auto-generate these. Follows _BaseScorer.set_score_request pattern. Uses __metadata_request__partial_fit class attribute to suppress conflicting auto-generation (TODO references FIX (SLEP6) descriptor shouldn't override method scikit-learn/scikit-learn#32111).
_get_metadata_request() — overrides parent to use class name string as owner instead of the instance. Needed because NeuralNet.__getstate__ uses pickle.dump for CUDA-dependent attributes, which causes deepcopy to block when the module isn't picklable. An alternative would be implementing a full __deepcopy__.
__deepcopy__() — bypasses __getstate__/__setstate__ (which use pickle) by deepcopying each attribute individually with a fallback to shallow copy. Needed because sklearn's routing infrastructure calls deepcopy on MetadataRouter objects that hold references to the NeuralNet instance (e.g. via Pipeline's get_metadata_routing).
_get_splitter_for_routing() — extracts the CV splitter from ValidSplit.cv when it supports routing (e.g. GroupKFold). Returns None for int/float cv or custom callables.
fit_loop() — when routing is enabled, calls process_routing() to validate and route metadata. Extracts split_params for get_split_datasets from routed params. Following sklearn's router pattern, forward_params are the full fit_params (the router uses its own params directly).
_get_param_names() — excludes _metadata_request (set by set_fit_request, not __init__; sklearn's clone handles it separately via deepcopy).

What this does NOT cover (future PRs)

Predict-path metadata (set_predict_request / set_predict_proba_request)
Routing metadata to scoring callbacks (sample_weight → EpochScoring)

Test plan

Disclaimer: The code is claude generated, but I've reviewed the code, and this is the second iteration of the solution, which I ended up liking, and it matches what we do in sklearn.

cc @BenjaminBossan @tsbinns @DCoupry

BenjaminBossan

Thanks for adding support for metaddata routing to skorch. Overall, this looks good, but I have a few comments, please check.

We probably can add a Mixin in scikit-learn so that you could avoid using the private type of API call.

So I assume this hasn't happened (yet)?

BenjaminBossan · 2026-04-21T09:10:37Z

                break
        setattr(self, 'callbacks_', callbacks_new)

+    def __deepcopy__(self, memo):


So I checked and prior to sklearn 1.8.0, the tests would all pass without this custom __deepcopy__ method. With 1.8.0, we get:

AttributeError: Can't pickle local object 'TestMetadataRouting.test_fit_with_extra_params_and_routing..RecordingModule'

This seems to stem from RecordingModule being defined locally in the test. When I move it to the global scope, the test passes even with sklearn 1.8.0 (other tests with local classes will fail but they all pass when making them global). So it appears that the error comes from a recent change in sklearn that makes something in metadata routing incompatible with locally defined classes.

To me, it would be okay to have a __deepcopy__ method here but it seems there is something bigger going on here.

fix on sklearn side: scikit-learn/scikit-learn#33827

BenjaminBossan · 2026-04-21T09:36:52Z

+    MethodMapping,
+    _routing_enabled,
+    process_routing,
+)


I wasn't sure how far back these imports reach, but I checked with sklearn 1.4.0 and the tests passed.

BenjaminBossan · 2026-04-21T09:43:25Z

+        # each attribute individually, falling back to shallow copy
+        # for non-copyable objects (e.g. torch modules with locally
+        # defined classes).
+        import copy


Can be global import

BenjaminBossan · 2026-04-21T09:49:54Z

+        router = MetadataRouter(owner=self.__class__.__name__)
+        router.add_self_request(self)
+
+        ts = self.train_split


adrinjalali · 2026-04-22T09:42:48Z

We probably can add a Mixin in scikit-learn so that you could avoid using the private type of API call.

So I assume this hasn't happened (yet)?

Yeah haven't managed to get it in. It's been not very busy on the metadata routing side on sklearn, since we've been focusing on other projects, but it also means the issues haven't been too urgent, which is a good thing.

The __deepcopy__ issue, however, is real and I'd like to fix it in sklearn, we shouldn't be deepcopy-ing estimators there.

BenjaminBossan

Thanks for your updates, Adrin, and for working on the fix in sklearn too. PR LGTM.

As it's not an urgent change, I'll leave it open for a week or so in case Thomas or someone else wants to review as well.

BenjaminBossan · 2026-04-22T14:57:05Z

        if not self.initialized_:
            self.initialize()

+        # When called from fit(), _routing_method is threaded in via


Ah, too bad that this is needed, but it is what it is.

tsbinns · 2026-04-22T19:42:11Z

As it's not an urgent change, I'll leave it open for a week or so in case Thomas or someone else wants to review as well.

Thanks both for the work!
Sure, could try in the next days to see if the workarounds I'm currently using are redundant now.

tsbinns · 2026-04-24T15:42:24Z

Sure, could try in the next days to see if the workarounds I'm currently using are redundant now.

Just had a go and no more workarounds needed. TYSM @adrinjalali & @BenjaminBossan!

tsbinns · 2026-04-24T15:45:50Z

Just so I can sort any env changes: is there a release planned soon, or will this live in main for a while?

BenjaminBossan · 2026-04-27T13:15:34Z

Just had a go and no more workarounds needed.

Thanks for testing.

Just so I can sort any env changes: is there a release planned soon, or will this live in main for a while?

I think we can do a release soon. It depends a bit on co-maintainer availability, so I can't promise.

adrinjalali added 6 commits April 20, 2026 14:36

ENH implement metadata routing for NeuralNet

76054e6

cleanup

3195acc

simplify

f09a794

cleanup

db5232a

end to end test

920145f

changelog

e781e02

BenjaminBossan requested review from BenjaminBossan and thomasjpfan April 20, 2026 18:23

adrinjalali changed the title ~~Nn routing router~~ ENH NeuralNet metadata routing Apr 20, 2026

BenjaminBossan requested changes Apr 21, 2026

View reviewed changes

adrinjalali added 2 commits April 22, 2026 11:37

apply Benjamin's review

57c6d04

cleanup import

e34b544

adrinjalali mentioned this pull request Apr 22, 2026

FIX metadata routing shouldn't deep copy estimator scikit-learn/scikit-learn#33827

Open

4 tasks

BenjaminBossan approved these changes Apr 22, 2026

View reviewed changes

BenjaminBossan merged commit f5a7928 into skorch-dev:master Apr 27, 2026
16 checks passed

Conversation

adrinjalali commented Apr 20, 2026

Implement sklearn metadata routing (router + consumer)

Summary

What this enables

Changes

What this does NOT cover (future PRs)

Test plan

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BenjaminBossan Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

adrinjalali Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BenjaminBossan Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

adrinjalali commented Apr 22, 2026

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

tsbinns commented Apr 22, 2026

Uh oh!

tsbinns commented Apr 24, 2026

Uh oh!

tsbinns commented Apr 24, 2026

Uh oh!

Uh oh!

BenjaminBossan commented Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants