feat(go): Enable flag evaluation metrics E2E tests for Go; fix reason=static by leoromanovsky · Pull Request #6410 · DataDog/system-tests

leoromanovsky · 2026-03-02T20:45:35Z

Motivation

Per the RFC "Flag evaluations tracking for APM tracers": collect a feature_flag.evaluations OTel counter metric on each flag evaluation to track SDK usage. These system tests validate the end-to-end pipeline: evaluation in the weblog → OTel SDK aggregation → OTLP export to agent → agent forwards to proxy → system tests assert on interfaces.agent.get_metrics().

Changes

tests/ffe/test_flag_eval_metrics.py: 5 E2E test classes for Go:
- Test_FFE_Eval_Metric_Basic: metric exists with correct feature_flag.key, variant, reason=static, allocation_key tags
- Test_FFE_Eval_Metric_Count: same flag evaluated 5× → metric count ≥ 5
- Test_FFE_Eval_Metric_Different_Flags: two flags → two separate metric series
- Test_FFE_Eval_Metric_Error: non-existent flag → reason=error, error.type=flag_not_found
- Test_FFE_Eval_Metric_Type_Mismatch: STRING flag evaluated as BOOLEAN → reason=error, error.type=type_mismatch
manifests/golang.yml: Enable metrics tests for Go at v2.7.0-dev.
All other language manifests: missing_feature (pending tracer implementation).
utils/_context/_scenarios/__init__.py: Add DD_METRICS_OTEL_ENABLED=true, OTEL_EXPORTER_OTLP_METRICS_ENDPOINT, and agent_interface_timeout=30 to the FFE scenario.
utils/build/docker/golang/app/_shared/common/ffe.go: Dispatch /ffe by variationType instead of always calling ofClient.Object(), enabling type mismatch errors.

Decisions

reason=static not targeting_match. The UFC engine returns AssignmentReason::Static for a 100% catch-all allocation (empty rules array, single split with shards:[]). The test fixtures use this shape, so the assertion reflects the actual engine output.

agent_interface_timeout=30 at scenario level. The OTLP pipeline (OTel SDK export + agent flush + proxy buffer) needs up to 30s to propagate. Moving the wait to the scenario level via agent_interface_timeout replaces per-test sleep() calls.

OTLP metrics endpoint direct to agent. The FFE scenario's DD_TRACE_AGENT_URL points to the proxy, which has no OTLP receiver. OTEL_EXPORTER_OTLP_METRICS_ENDPOINT is set directly to agent:4318/v1/metrics to bypass the proxy for metric export. The agent then forwards processed metrics to /api/v2/series, where system tests capture them.

github-actions · 2026-03-02T20:46:23Z

CODEOWNERS have been resolved as:

tests/ffe/test_flag_eval_metrics.py                                     @DataDog/feature-flagging-and-experimentation-sdk @DataDog/system-tests-core
manifests/cpp_httpd.yml                                                 @DataDog/dd-trace-cpp
manifests/cpp_kong.yml                                                  @DataDog/system-tests-core
manifests/cpp_nginx.yml                                                 @DataDog/dd-trace-cpp
manifests/dotnet.yml                                                    @DataDog/apm-dotnet @DataDog/asm-dotnet
manifests/golang.yml                                                    @DataDog/dd-trace-go-guild
manifests/java.yml                                                      @DataDog/asm-java @DataDog/apm-java
manifests/nodejs.yml                                                    @DataDog/dd-trace-js
manifests/php.yml                                                       @DataDog/apm-php @DataDog/asm-php
manifests/python.yml                                                    @DataDog/apm-python @DataDog/asm-python
manifests/ruby.yml                                                      @DataDog/ruby-guild @DataDog/asm-ruby
manifests/rust.yml                                                      @DataDog/apm-rust
utils/_context/_scenarios/__init__.py                                   @DataDog/system-tests-core
utils/build/docker/golang/app/_shared/common/ffe.go                     @DataDog/dd-trace-go-guild @DataDog/system-tests-core

datadog-datadog-prod-us1 · 2026-03-03T21:25:27Z

✅ Tests

🎉 All green!

❄️ No new flaky tests detected
🧪 All tests passed

_{This comment will be updated automatically if new data arrives.

🔗 Commit SHA: 0951079 | Docs | Datadog PR Page | Was this helpful? React with 👍/👎 or give us feedback!}

…test The Go weblog was calling ofClient.Object() for all evaluations, ignoring the variationType field. This meant type conversion errors could never occur, unlike Python/Node.js which dispatch to the type-specific methods (BooleanValue, StringValue, etc.). Fix the Go weblog to dispatch based on variationType, matching the behavior of other language weblogs. Add Test_FFE_Eval_Metric_Type_Mismatch: configures a STRING flag but evaluates it as BOOLEAN, triggering a type conversion error that happens after the core evaluate() returns. This test would fail with the old evaluate()-level metric recording (which would see targeting_match / no error) and only passes when metrics are recorded via a Finally hook (which sees error / type_mismatch).

Only Go supports flag evaluation metrics via OTel so far. Without this, the test file runs for all FFE-enabled languages and fails.

Replace hardcoded time.sleep(25) in each test setup with agent_interface_timeout=30 on the FFE scenario. The container shutdown flushes metrics; the timeout gives the agent time to receive and process them.

Assert that feature_flag.result.allocation_key tag is present with value "default-allocation" on successful flag evaluations.

- Enable tests/ffe/test_flag_eval_metrics.py for PHP (>=1.16.0) and Node.js (express4 v6.0.0-pre) - Fix reason assertion: UFC engine returns AssignmentReason::Static for a 100% catch-all allocation (rules:[], splits:[{shards:[]}]), not TargetingMatch - Add type annotations to test helpers (mypy compliance)

…=static - Enable tests/ffe/test_flag_eval_metrics.py for Go only (PHP and Node.js remain missing_feature) - Fix reason assertion: UFC engine returns AssignmentReason::Static for a 100% catch-all allocation (rules:[], splits:[{shards:[]}]), not TargetingMatch - Add type annotations to test helpers (mypy compliance)

… entries

…eval_metrics entry

leoromanovsky · 2026-03-12T22:40:25Z

/merge

gh-worker-devflow-routing-ef8351 · 2026-03-12T22:40:28Z

View all feedbacks in Devflow UI.

2026-03-12 22:40:28 UTC ℹ️ Start processing command /merge

2026-03-12 22:40:58 UTC ℹ️ MergeQueue: waiting for PR to be ready

This pull request is not mergeable according to GitHub. Common reasons include pending required checks, missing approvals, or merge conflicts — but it could also be blocked by other repository rules or settings.
It will be added to the queue as soon as checks pass and/or get approvals. View in MergeQueue UI.
Note: if you pushed new commits since the last approval, you may need additional approval.
You can remove it from the waiting list with /remove command.

2026-03-12 23:09:07 UTC ℹ️ MergeQueue: merge request added to the queue

The expected merge time in main is approximately 8m (p90).

2026-03-12 23:09:51 UTC ℹ️ MergeQueue: This merge request was merged

leoromanovsky mentioned this pull request Mar 2, 2026

feat(openfeature): add flag evaluation tracking via OTel Metrics DataDog/dd-trace-go#4489

Merged

leoromanovsky marked this pull request as ready for review March 3, 2026 22:35

leoromanovsky requested review from a team as code owners March 3, 2026 22:35

leoromanovsky requested review from brettlangdon, claponcet, manuel-alvarez-alvarez, r1viollet, sameerank, typotter and xlamorlette-datadog and removed request for a team March 3, 2026 22:35

leoromanovsky added 6 commits March 11, 2026 15:10

Remove obvious comment in ffe.go

c0624e3

Mark test_flag_eval_metrics.py as missing_feature for non-Go languages

c7586ba

Only Go supports flag evaluation metrics via OTel so far. Without this, the test file runs for all FFE-enabled languages and fails.

Remove per-test sleeps, use scenario-level agent_interface_timeout

acc0754

Replace hardcoded time.sleep(25) in each test setup with agent_interface_timeout=30 on the FFE scenario. The container shutdown flushes metrics; the timeout gives the agent time to receive and process them.

Add allocation_key assertion to flag eval metrics test

a2fd84f

Assert that feature_flag.result.allocation_key tag is present with value "default-allocation" on successful flag evaluations.

leoromanovsky force-pushed the leo.romanovsky/ffe-eval-metrics branch from d342bb1 to a33486a Compare March 11, 2026 19:13

leoromanovsky mentioned this pull request Mar 11, 2026

feat(php): Add /ffe endpoint and OTel SDK to PHP weblog #6475

Draft

leoromanovsky changed the title ~~feat(ffe): add flag evaluation metrics E2E tests (Go + PHP)~~ feat(ffe): add flag evaluation metrics E2E tests (Go + PHP + Node) Mar 11, 2026

leoromanovsky changed the title ~~feat(ffe): add flag evaluation metrics E2E tests (Go + PHP + Node)~~ feat(go): Enable flag evaluation metrics E2E tests for Go; fix reason=static Mar 11, 2026

leoromanovsky added 2 commits March 11, 2026 16:53

fix(manifests): restore alphabetical order for test_flag_eval_metrics…

81b3a6d

… entries

fix(golang): restore golang.yml to main, re-apply only FFE test_flag_…

584d91e

…eval_metrics entry

gh-worker-dd-devflow-36fce6 bot added the mergequeue-status: waiting label Mar 11, 2026

Merge branch 'main' into leo.romanovsky/ffe-eval-metrics

d3da698

gh-worker-dd-devflow-36fce6 bot added mergequeue-status: removed and removed mergequeue-status: waiting labels Mar 12, 2026

Merge branch 'main' into leo.romanovsky/ffe-eval-metrics

0951079

gh-worker-dd-devflow-36fce6 bot added mergequeue-status: waiting mergequeue-status: queued mergequeue-status: in_progress and removed mergequeue-status: removed mergequeue-status: waiting mergequeue-status: queued labels Mar 12, 2026

gh-worker-dd-mergequeue-cf854d bot merged commit 93ab576 into main Mar 12, 2026
2721 of 2726 checks passed

gh-worker-dd-mergequeue-cf854d bot deleted the leo.romanovsky/ffe-eval-metrics branch March 12, 2026 23:09

gh-worker-dd-devflow-36fce6 bot added mergequeue-status: done and removed mergequeue-status: in_progress labels Mar 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(go): Enable flag evaluation metrics E2E tests for Go; fix reason=static#6410

feat(go): Enable flag evaluation metrics E2E tests for Go; fix reason=static#6410
gh-worker-dd-mergequeue-cf854d[bot] merged 14 commits intomainfrom
leo.romanovsky/ffe-eval-metrics

leoromanovsky commented Mar 2, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 2, 2026 •

edited

Loading

Uh oh!

datadog-datadog-prod-us1 bot commented Mar 3, 2026 •

edited by datadog-official bot

Loading

Uh oh!

leoromanovsky commented Mar 12, 2026

Uh oh!

gh-worker-devflow-routing-ef8351 bot commented Mar 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

leoromanovsky commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Changes

Decisions

Uh oh!

github-actions bot commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

datadog-datadog-prod-us1 bot commented Mar 3, 2026 • edited by datadog-official bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leoromanovsky commented Mar 12, 2026

Uh oh!

gh-worker-devflow-routing-ef8351 bot commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

leoromanovsky commented Mar 2, 2026 •

edited

Loading

github-actions bot commented Mar 2, 2026 •

edited

Loading

datadog-datadog-prod-us1 bot commented Mar 3, 2026 •

edited by datadog-official bot

Loading

gh-worker-devflow-routing-ef8351 bot commented Mar 12, 2026 •

edited

Loading