feat(DENG-11298): Drop parse_campaign_name parsing logic by phil-lee70 · Pull Request #9644 · mozilla/bigquery-etl

phil-lee70 · 2026-06-26T17:23:43Z

feat(DENG-11298): Drop parse_campaign_name parsing logic

github-actions

This PR replaces the in-repo parsing logic of mozfun.marketing.parse_campaign_name with a delegation to a new moz-fx-data-shared-prod.udf.parse_campaign_name (a stub returning typed NULL), drops the function's test suite down to a single NULL-input assertion, and adds the new UDF file to routine.publish.skip.

The mechanical rewrite of the mozfun function and its delegation are straightforward. My main question is around the publish.skip entry: it behaves differently from the existing main_summary_scalars skip (which is required because that function uses ANY TYPE), and it determines whether the new shared-prod UDF actually gets deployed for the runtime reference to resolve. I've left details inline. Note I couldn't observe the production deployment path from this repo alone, so if shared-prod routine deployment uses a path that ignores routine.publish.skip, that inline concern may not apply.

github-actions · 2026-06-26T17:30:13Z

    gcs_path: ""
    skip:
    - sql/moz-fx-data-shared-prod/udf/main_summary_scalars/udf.sql
+    - sql/moz-fx-data-shared-prod/udf/parse_campaign_name/udf.sql


issue: This skip entry is categorically different from the existing main_summary_scalars one. main_summary_scalars takes processes ANY TYPE, so it cannot be created as a persistent UDF and is only ever inlined via SQL generation — skipping it from publishing is required. parse_campaign_name, by contrast, is a concrete, deployable UDF that mozfun.marketing.parse_campaign_name references at runtime.

publish_routines.publish() honors routine.publish.skip for every target project (the skip set is global, not mozfun-only — see skipped_routines() / lines 168-172, 256 in publish_routines.py). So adding this file here means bqetl routine publish --project-id=moz-fx-data-shared-prod will not deploy it. Since this is a brand-new function with no prior deployed version, the marketing.parse_campaign_name reference would then resolve to a non-existent routine — which contradicts the PR's stated goal of having the function return NULL so dependent queries keep resolving.

If the intent is for the stub to be the deployed definition (returning NULL), remove it from the skip list so it deploys. If moz-fx-data-shared-prod.udf.parse_campaign_name is instead owned/deployed by a system outside this repo and the skip exists to avoid clobbering it, add a comment here documenting that, since it's not inferable from the change.

github-actions · 2026-06-26T17:30:16Z

@@ -0,0 +1,5 @@
+-- Parses a campaign name into known segments.


nitpick: The comment here and the description in metadata.yaml ("Parses a campaign name into known segments.") describe behavior this body doesn't have — it returns a typed NULL and parses nothing. Since this is published as routine documentation, consider noting that it's currently a placeholder/no-op so readers and the generated docs aren't misled.

…ublish ordering get_routines() only scanned sql/mozfun when parsing a mozfun routine, so a mozfun UDF that delegates to a moz-fx-data-shared-prod UDF had no dependency edge recorded. The topological sorter then published the wrapper before its dependency, failing stage deploy with 'Function not found'. Include moz-fx-data-shared-prod in the candidate routine list (deduped via a set).

… order The --target routine publish groups routines into one _publish_to_target batch per source project and published them in arbitrary (insertion) order. A mozfun UDF that delegates to a moz-fx-data-shared-prod UDF spans two batches, so the dependent batch could be published before its dependency, failing stage deploy with 'Function not found'. Topologically order the source-project batches so a project is published after the projects providing routines it depends on.

parse_routines only loaded mozfun into the dependency pool, so a mozfun UDF that delegates to a moz-fx-data-shared-prod UDF could not have that dependency inlined as a temp function in its test SQL. The test then invoked the real persistent routine, failing in CI with a 403 when the test service account can't invoke the (private) routine. Always include moz-fx-data-shared-prod in the pool so the local (stubbed) definition is inlined instead.

BenWu · 2026-06-29T19:57:13Z

    gcs_path: ""
    skip:
    - sql/moz-fx-data-shared-prod/udf/main_summary_scalars/udf.sql
+    - sql/moz-fx-data-shared-prod/udf/marketing_parse_campaign_name/udf.sql


I don't think this is needed

BenWu · 2026-06-29T20:07:02Z

+        if state.get(project) == "done":
+            return
+        if state.get(project) == "visiting":
+            # cyclic dependency between projects; fall back to best-effort order


This is fine as a best-effort attempt but there are a lot of existing cases of shared-prod udf referencing mozfun so it's not impossible that udfs that work in both directions are in the same PR. I recommend at least adding logging here to point this out in the rare event that this happens

Suggested change

# cyclic dependency between projects; fall back to best-effort order

log.warning("Routine ordering: cyclic dependency between projects; fall back to best-effort order")

Alternatively you could do the ordering per udf but that's probably more complex

will add the logging 👍

scholtzan · 2026-06-30T12:38:35Z

Integration report

View full diff, if available (SQL + DAGs) (requires private-bigquery-etl access)
Private CI run
Bigeye monitoring dry-run: ✅ plan clean — view plan (requires private-bigquery-etl access)

phil-lee70 requested a review from a team as a code owner June 26, 2026 17:23

github-actions Bot reviewed Jun 26, 2026

View reviewed changes

This comment has been minimized.

Sign in to view

feat(DENG-11298): Drop parse_campaign_name parsing logic

03da916

phil-lee70 force-pushed the deng-11298-drop-parse-campaign-name branch from 5cfd431 to 03da916 Compare June 26, 2026 18:45

This comment has been minimized.

Sign in to view

Merge branch 'main' into deng-11298-drop-parse-campaign-name

a13f3cc

This comment has been minimized.

Sign in to view

Merge branch 'main' into deng-11298-drop-parse-campaign-name

24baa04

This comment has been minimized.

Sign in to view

BenWu reviewed Jun 29, 2026

View reviewed changes

phil-lee70 added 2 commits June 30, 2026 08:26

remove skip entry and add logging for deploy script

592016c

Merge branch 'main' into deng-11298-drop-parse-campaign-name

53edb7d

phil-lee70 requested a review from BenWu June 30, 2026 12:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(DENG-11298): Drop parse_campaign_name parsing logic#9644

feat(DENG-11298): Drop parse_campaign_name parsing logic#9644
phil-lee70 wants to merge 8 commits into
mainfrom
deng-11298-drop-parse-campaign-name

phil-lee70 commented Jun 26, 2026 •

edited

Loading

Uh oh!

github-actions Bot left a comment

Uh oh!

github-actions Bot Jun 26, 2026

Uh oh!

github-actions Bot Jun 26, 2026

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

BenWu Jun 29, 2026

Uh oh!

BenWu Jun 29, 2026

Uh oh!

phil-lee70 Jun 30, 2026

Uh oh!

scholtzan commented Jun 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -0,0 +1,5 @@
		-- Parses a campaign name into known segments.

	# cyclic dependency between projects; fall back to best-effort order
	log.warning("Routine ordering: cyclic dependency between projects; fall back to best-effort order")

Uh oh!

Conversation

phil-lee70 commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

feat(DENG-11298): Drop parse_campaign_name parsing logic

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

BenWu Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

BenWu Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

phil-lee70 Jun 30, 2026

Choose a reason for hiding this comment

Uh oh!

scholtzan commented Jun 30, 2026

Integration report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

phil-lee70 commented Jun 26, 2026 •

edited

Loading