perf: Disable USD notice handlers during usd_replicate for 25-35% faster startup by aloysbaillet · Pull Request #5070 · isaac-sim/IsaacLab

aloysbaillet · 2026-03-20T11:27:00Z

Description

Summary

Disable IFabricUsd and SimulationManager USD notice handlers during bulk environment cloning in usd_replicate(), reducing startup time by 25-35% for manager-based tasks.

Problem

When usd_replicate() calls Sdf.CopySpec to clone thousands of environment prims, every single copy triggers IFabricUsd UsdNoticeListener::Handle(UsdNotice::ObjectsChanged) — an expensive Fabric↔USD sync operation. For Reach-Franka (4096 envs), this added 17 seconds (438 notice handler calls) during scene construction.

IsaacSim's own Cloner.disable_change_listener() already handles this by disabling the notice handler before cloning, but IsaacLab v3's replacement usd_replicate() function didn't replicate this behavior.

Additionally, the import path from isaacsim.core.simulation_manager import SimulationManager resolves to PhysxManager under IsaacLab's lazy_export module system, which lacks the enable_fabric_usd_notice_handler method. The correct import is from isaacsim.core.simulation_manager.impl.simulation_manager import SimulationManager.

Changes

source/isaaclab/isaaclab/cloner/cloner_utils.py:

• Add _disable_usd_notice_handlers() / _enable_usd_notice_handlers() helpers that mirror isaacsim.core.cloner.Cloner.disable_change_listener()
• Wrap usd_replicate() body in try/finally to disable handlers during bulk USD authoring
• Use correct full import path for SimulationManager

source/isaaclab/isaaclab/utils/string.py:

• Cache string_to_callable() with @functools.cache to avoid repeated importlib.import_module calls
• Fix triple ast.parse() call in is_lambda_expression()

Benchmark Results

Tested on DGXC (2x NVIDIA L40, 4096 envs, 1 GPU, warm run):

Task	Before	After	Improvement
Isaac-Reach-Franka-v0	29.0s	20.0s	-31%
Isaac-Velocity-Rough-H1-v0	37.9s	24.7s	-35%
Isaac-Velocity-Flat-Anymal-D-v0	29.8s	22.1s	-26%
Isaac-Cartpole-Direct-v0	8.9s	9.1s	~0% (minimal cloning)
Isaac-Ant-Direct-v0	11.9s	12.0s	~0% (minimal cloning)
No FPS regressions. All tasks pass.

Root Cause Analysis

Profiling with Nsight Systems showed IFabricUsd UsdNoticeListener::Handle consuming:

• 17.0s for Reach-Franka (438 calls)
• 9.9s for H1-Rough
• 22.9s for Dexsuite-Kuka

These notice handler calls happen during usd_replicate when Sdf.CopySpec modifies the USD stage. By disabling the handlers (as IsaacSim's own Cloner does), the Fabric sync is deferred to a single batch at the end.

Type of change

Bug fix (non-breaking change which fixes an issue)

Checklist

I have read and understood the contribution guidelines
I have run the pre-commit checks with ./isaaclab.sh --format
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I have updated the changelog and the corresponding version in the extension's config/extension.toml file
I have added my name to the CONTRIBUTORS.md or my name already exists there

greptile-apps · 2026-03-20T11:31:04Z

Greptile Summary

This PR delivers a meaningful startup-time optimization (25–35% reduction for large environments) by mirroring what IsaacSim's own Cloner.disable_change_listener() does: suppressing IFabricUsd and SimulationManager USD notice handlers during bulk Sdf.CopySpec calls in usd_replicate(), so that Fabric↔USD synchronization is deferred to a single batch instead of firing on every spec copy. A secondary optimization caches string_to_callable() results and eliminates a triple ast.parse() call in is_lambda_expression().

Key changes:

cloner_utils.py: New _disable_usd_notice_handlers / _enable_usd_notice_handlers helpers wrap usd_replicate() in a try/finally; all per-depth Sdf.ChangeBlock calls are consolidated into one.
string.py: @functools.cache added to string_to_callable(); is_lambda_expression() fixed to parse the AST only once.

Issues found:

The USD notice handler (enable_usd_notice_handler) is always restored to True in the finally block without first saving its original state. The fabric handler correctly saves and conditionally restores its state, but the USD handler does not — this asymmetry means a caller that had previously disabled the handler will have it silently re-enabled after usd_replicate() returns.
UsdUtils.StageCache.Get().Insert(stage) permanently registers the stage in the global cache if it was not already there, with no corresponding cleanup. In normal Isaac Sim operation the stage is pre-registered, but in unit tests or lightweight scenarios this can prevent the stage from being garbage-collected.
The consolidation of per-depth Sdf.ChangeBlock calls into a single outer block is a behavioral change from the original design (which committed after each depth to "stabilize composition"); this is likely safe for the tested assets but deserves an explanatory comment.
@functools.cache on string_to_callable is unbounded and never cleared, which could accumulate memory in long-running training processes.

Confidence Score: 3/5

The optimization is architecturally sound and well-benchmarked, but two correctness issues in handler state management should be resolved before merging.
The core approach (mirroring IsaacSim's own Cloner pattern) is correct and the benchmarks show no regressions. However, the unconditional enable_usd_notice_handler(True) in the finally block without saving prior state is an asymmetric bug that could cause subtle issues if usd_replicate() is called within an outer disable context. The stage cache insertion side effect is a second correctness concern. These two P1 issues prevent a higher score.
Pay close attention to source/isaaclab/isaaclab/cloner/cloner_utils.py, specifically the _disable_usd_notice_handlers / _enable_usd_notice_handlers pair and the stage cache insertion logic.

Important Files Changed

Filename	Overview
source/isaaclab/isaaclab/cloner/cloner_utils.py	Adds `_disable_usd_notice_handlers`/`_enable_usd_notice_handlers` helpers and wraps `usd_replicate()` in try/finally to suppress Fabric sync overhead during bulk cloning. Two correctness concerns: (1) the USD notice handler state is not saved before disabling and is always unconditionally re-enabled (unlike the fabric handler which correctly saves/restores state), and (2) stage insertion into `UsdUtils.StageCache` without tracking prior membership risks a permanent cache side-effect. The ChangeBlock consolidation from per-depth to a single block is also a behavioral change that warrants a comment explaining why it is safe.
source/isaaclab/isaaclab/utils/string.py	Two clean optimizations: fixes triple `ast.parse()` redundancy in `is_lambda_expression()` by caching the tree, and adds `@functools.cache` to `string_to_callable()` to avoid repeated `importlib.import_module` costs. The cache is unbounded and process-lifetime, which may cause memory growth in long-running sessions — `lru_cache(maxsize=N)` would be safer.