Skip to content

fix(ingest): skip empty columns for CLL#17615

Merged
kevinkarchacryl merged 4 commits into
masterfrom
fix-dbt-empty-column-cll
May 28, 2026
Merged

fix(ingest): skip empty columns for CLL#17615
kevinkarchacryl merged 4 commits into
masterfrom
fix-dbt-empty-column-cll

Conversation

@kevinkarchacryl
Copy link
Copy Markdown
Contributor

@kevinkarchacryl kevinkarchacryl commented May 28, 2026

The sqlglot SQL parser upgrade from v29 to v30 (shipped in 1.5.0.19) changed how certain SQL patterns are analyzed during CLL extraction, causing some upstream column names to be empty strings. These empty strings were being passed directly to make_schema_field_urn(), producing invalid URNs of the form urn:li:schemaField:(urn:li:dataset:(...),) that GMS rejects, causing the entire ingestion to fail. The fix adds guards in two places in dbt_common.py.

@github-actions github-actions Bot added the ingestion PR or Issue related to the ingestion of metadata label May 28, 2026
@github-actions
Copy link
Copy Markdown
Contributor

Linear: ING-2776

@codecov
Copy link
Copy Markdown

codecov Bot commented May 28, 2026

Codecov Report

❌ Patch coverage is 66.66667% with 1 line in your changes missing coverage. Please review.
✅ All tests successful. No failed tests found.

Files with missing lines Patch % Lines
...ion/src/datahub/ingestion/source/dbt/dbt_common.py 66.66% 1 Missing ⚠️

📢 Thoughts on this report? Let us know!

@kevinkarchacryl kevinkarchacryl merged commit 36d002c into master May 28, 2026
48 of 49 checks passed
@kevinkarchacryl kevinkarchacryl deleted the fix-dbt-empty-column-cll branch May 28, 2026 17:48
@bmaquet
Copy link
Copy Markdown
Contributor

bmaquet commented May 29, 2026

Thank you @kevinkarchacryl for this fix!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ingestion PR or Issue related to the ingestion of metadata

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants