Skip to content

Releases: NVIDIA-NeMo/DataDesigner

v0.6.0 2026-05-13

13 May 21:15
3c8394e

Choose a tag to compare

🎨 NeMo Data Designer v0.6.0

This release comes with two major updates:

  • Async by default: The async execution engine is now the default path in Data Designer. Pipelines schedule work at the cell level, overlapping independent columns and adapting model concurrency per provider/model.

  • 🔌 Plugins are stable: Plugins have moved out of experimental mode. Seed readers, column generators, and processors can now live in normal Python packages with typed configs and entry-point discovery.

What's Changed

New Contributors

Full Changelog: v0.5.9...v0.6.0

v0.5.9 2026-04-28

29 Apr 00:13
93ae875

Choose a tag to compare

What's Changed

  • fix: normalize image_url blocks to OpenAI-compliant dict format by @nabinchha in #577
  • docs: add VLM long-document understanding dev note and recipes by @nabinchha in #579
  • docs: update resource links in VLM long-document understanding dev note by @nabinchha in #580
  • ci: raise agent audit turn limit and preserve logs by @andreatgretel in #571
  • feat: update default nvidia-vision model to nemotron-3-nano-omni by @nabinchha in #583

Full Changelog: v0.5.8...v0.5.9

v0.5.8 2026-04-27

27 Apr 20:01
4662288

Choose a tag to compare

What's Changed

  • chore: bump pillow and python-multipart for CVEs, add SECURITY.md by @johnnygreco in #564
  • fix(ci): grant permissions to reusable workflow calls by @andreatgretel in #561
  • ci: bump the all-actions group across 1 directory with 5 updates by @dependabot[bot] in #558
  • refactor: unify duplicate DAG construction (dag.py + ExecutionGraph) by @przemekboruta in #511
  • fix: prevent sticky progress bar ghost lines from terminal wrapping by @andreatgretel in #565
  • fix: Updated default reasoning model for nvidia by @kirit93 in #568
  • chore: async engine readiness - blockers and polish before default by @andreatgretel in #553
  • chore: remove obsolete Cerebro ignore entries by @johnnygreco in #570
  • chore: add ko_KR locale to nemotron personas datasets by @johnnygreco in #572
  • chore: bump lxml and nbconvert to address security advisories by @johnnygreco in #574

Full Changelog: v0.5.7...v0.5.8

v0.5.7 2026-04-17

17 Apr 22:06
8be4ff7

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.5.6...v0.5.7

v0.5.6 2026-04-09

09 Apr 19:36
6505ce4

Choose a tag to compare

What's Changed

Full Changelog: v0.5.5...v0.5.6

v0.5.5 2026-04-02

02 Apr 16:31
d43ac1c

Choose a tag to compare

What's Changed

  • fix: Claude Code marketplace plugin structure and install docs by @johnnygreco in #458
  • docs: update dev note with TL;DR tips and install instructions by @johnnygreco in #461
  • chore: remove unused .claude-plugin directory by @johnnygreco in #463
  • chore: async engine follow-up - rename, preview, lifecycle, progress by @andreatgretel in #456
  • docs: restructure agent and contributor documentation (plan 427, PR 1) by @nabinchha in #454
  • fix: address nspect vulnerability report for requests and cryptography by @johnnygreco in #475
  • fix: update health checks to use new ModelFacade client API by @andreatgretel in #470
  • ci: upgrade GitHub Actions for Node.js 24 compatibility by @ko3n1g in #450
  • chore: reduce Greptile review noise from defensive coding suggestions by @andreatgretel in #423
  • docs: consolidated seed reader documentation by @eric-tramel in #481
  • fix: bump pymdown-extensions for pygments 2.20.0 compat by @eric-tramel in #482
  • feat: add fr_FR locale to nemotron personas datasets by @johnnygreco in #468
  • docs: add native model client dev note by @nabinchha in #465
  • fix: respect max_parallel_requests in HTTP connection pool size by @przemekboruta in #460
  • docs: center diagram images in native model client dev note by @nabinchha in #483
  • docs: update architecture-and-performance.md to reflect AIMD changes by @nabinchha in #467
  • chore: update review code skill output and tone by @nabinchha in #477
  • ci: add agentic CI plan, health probe workflow, and recipe scaffold by @andreatgretel in #473
  • test: add transport-wiring regression tests for #459 by @nabinchha in #485

New Contributors

Full Changelog: v0.5.4...v0.5.5

v0.5.4 2026-03-24

25 Mar 02:02
0a7b9e0

Choose a tag to compare

🔒 Note on the LiteLLM Supply Chain Incident (2026-03-24)

Earlier today, malicious versions of litellm (1.82.7 and 1.82.8) were published to PyPI containing a credential stealer that exfiltrates cloud credentials, SSH keys, and cryptocurrency wallets on any Python process startup.

Data Designer v0.5.4 removes litellm as a dependency entirely. We recommend all users upgrade.

For users on prior versions, here is our assessment:

  • v0.3.0 – v0.5.3: litellm has been pinned at >=1.73.6,<1.80.12 since January — no exposure to the compromised versions.
  • v0.2.2 and v0.2.3: These releases briefly carried an upper bound of <2, which in theory permitted resolution to the malicious 1.82.x versions. Both have been yanked from PyPI as a precaution.
  • Realistic risk from Data Designer is very low. Exposure would require a user pinned to one of the two yanked versions and running a fresh install or dependency update during the few hours the compromised package was live this morning. That said, please verify your installed versions and upgrade Data Designer to v0.5.4 at your earliest convenience.

What's Changed

New Contributors

Full Changelog: v0.5.3...v0.5.4

v0.5.3 2026-03-12

12 Mar 23:12
447ed59

Choose a tag to compare

What's Changed

  • fix: cache notebook builds to avoid flaky upstream model failures by @andreatgretel in #370
  • feat: canonical model client types, protocols, and LiteLLM bridge adapter by @nabinchha in #359
  • fix: processor artifacts type, discovery, and loading by @andreatgretel in #366
  • feat: add ExecutionGraph, CompletionTracker, and Task model for async scheduler by @andreatgretel in #356
  • fix: handle discriminated unions in oneOf pruning validator by @andreatgretel in #376
  • docs: account for vLLM reasoning field migration in plan 343 by @nabinchha in #377
  • chore: add Claude Code skill for code review by @nabinchha in #372
  • fix: replace removed DuckDB record_batch() with to_arrow_reader() by @andreatgretel in #380
  • fix: patch litellm ImageURLListItem to make index field optional (#384) by @nabinchha in #385
  • chore: improve test guidelines in AGENTS.md by @nabinchha in #387
  • fix: raise clear error when all records are dropped during generation by @nabinchha in #383
  • feat: add async generator migration with symmetric bridging and statefulness by @andreatgretel in #378
  • docs: add Enterprise Text-to-SQL and Search Agent recipes by @dhruvnathawani in #395
  • refactor: Decouple ModelFacade from LiteLLM via ModelClient adapter by @nabinchha in #373
  • docs: search agent dev note by @dhruvnathawani in #350
  • feat(cli): bootstrap default configs on CLI startup by @johnnygreco in #401
  • fix: pin chardet<6 to suppress RequestsDependencyWarning by @andreatgretel in #405
  • fix: add chardet<6 constraint to published engine package by @johnnygreco in #406

Full Changelog: v0.5.2...v0.5.3

v0.5.2 2026-03-04

05 Mar 04:49
e2c94da

Choose a tag to compare

What's Changed

New Contributors

  • @3mei made their first contribution in #365

Full Changelog: v0.5.1...v0.5.2

v0.5.1 2026-02-20

20 Feb 21:06
8f7a720

Choose a tag to compare

Data Designer now supports image generation!

What's Changed

  • docs: Updated url by @kirit93 in #325
  • docs: deep research trajectories with NDD and MCP tool use by @eric-tramel in #326
  • refactor: callback-based processor design by @andreatgretel in #294
  • feat: add image generation support with multi-modal context by @nabinchha in #317
  • docs: add image generation documentation and image-to-image editing tutorial by @nabinchha in #319
  • chore: move ArtifactStorage to engine/storage/ module by @nabinchha in #321
  • chore: gitignore Cerebro knowledge base files by @johnnygreco in #328
  • feat(engine): env-var switch for async-first models experiment by @eric-tramel in #280
  • docs: Moved nav to left hand side by @kirit93 in #331
  • feat: add --save-results option to preview command by @johnnygreco in #333
  • chore: Improve CLI startup with lazy heavy import cleanup by @johnnygreco in #330
  • feat: add allow_resize for 1:N and N:1 generation patterns by @andreatgretel in #286
  • chore: address Andre's feedback on --save-results and CLI preview by @johnnygreco in #335
  • chore: remove example_allow_resize.py from repo root by @andreatgretel in #337
  • fix: make DropColumnsProcessorConfig idempotent and support reasoning columns by @andreatgretel in #334
  • feat: add push_to_hub_from_folder classmethod for uploading saved datasets by @nabinchha in #340
  • fix: handle bool, int, float in convert_to_row_element by @dhruvnathawani in #336
  • feat: auto-detect ImageContext format for image-to-image generation by @nabinchha in #342

New Contributors

Full Changelog: v0.5.0...v0.5.1