Skip to content

sync with open source how#118

Draft
lesterhaynes wants to merge 8440 commits intolinkedin:li_trunkfrom
apache:master
Draft

sync with open source how#118
lesterhaynes wants to merge 8440 commits intolinkedin:li_trunkfrom
apache:master

Conversation

@lesterhaynes
Copy link

Please add a meaningful description for your change here


Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

  • Mention the appropriate issue in your description (for example: addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, comment fixes #<ISSUE NUMBER> instead.
  • Update CHANGES.md with noteworthy changes.
  • If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

Build python source distribution and wheels
Python tests
Java tests
Go tests

See CI.md for more information about GitHub Actions CI.

dependabot bot and others added 15 commits January 28, 2026 10:16
…#37439)

Bumps [github.com/nats-io/nats-server/v2](https://github.com/nats-io/nats-server) from 2.12.3 to 2.12.4.
- [Release notes](https://github.com/nats-io/nats-server/releases)
- [Commits](nats-io/nats-server@v2.12.3...v2.12.4)

---
updated-dependencies:
- dependency-name: github.com/nats-io/nats-server/v2
  dependency-version: 2.12.4
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Add exception chaining to preserve error context

- Add 'from e' to exception re-raises in CloudSQLEnrichmentHandler
- Add exception chaining in processes.py for OSError and CalledProcessError
- Improve logging in core.py to preserve traceback context

This improves debuggability by preserving the full exception chain,
following Python PEP 3134 best practices.

Fixes #37422

* Fix yapf formatting for logging.warning statement

* Fix yapf formatting: put logging arguments on single line
…e cache directory (#37360)

* Fix cached wheels used in future runs

* address review comments

* run post tests

* add .github/trigger_files/beam_PostCommit_Python_Examples_Dataflow.json
* Fix python postcommit

* Trigger postcommit
* remove pubsublite from java sdk

* revert builde file change impacting pubsub

* remove unused dependencies
Bumps [github.com/aws/aws-sdk-go-v2/feature/s3/manager](https://github.com/aws/aws-sdk-go-v2) from 1.21.0 to 1.21.1.
- [Release notes](https://github.com/aws/aws-sdk-go-v2/releases)
- [Changelog](https://github.com/aws/aws-sdk-go-v2/blob/main/changelog-template.json)
- [Commits](aws/aws-sdk-go-v2@v1.21.0...v1.21.1)

---
updated-dependencies:
- dependency-name: github.com/aws/aws-sdk-go-v2/feature/s3/manager
  dependency-version: 1.21.1
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* remove groovy pubsublite dependencies

* remove checkstyle suppressions

* remove role config for pubsublite service and update role files
#37428)

* Refactor: Unify batching args in ModelHandler constructors

- Added 'max_batch_weight' and 'element_size_fn' to __init__ of all ModelHandlers (PyTorch, Sklearn, TF, ONNX, XGBoost, TensorRT, Hugging Face, vLLM, VertexAI).
- Updated subclasses to delegate these args to 'super().__init__' or internal batching kwargs.
- Removed 'with_element_size_fn' builder method from base class to enforce API consistency.
- Updated tests to reflect the new API signature.

* Address review comments: refactor tests and fix linting
…iguration (#37356)

* [#37198] Make withBackOffSupplier public to enable bounded retry configuration

Users need to configure bounded backoff to prevent infinite retry loops.
Making withBackOffSupplier public allows users to set
FluentBackoff.DEFAULT.withMaxRetries(n) and control retry behavior.

Changes:
- Changed withBackOffSupplier() visibility from package-private to public
- Added comprehensive integration test with zero-delay BoundedBackOff
- Test verifies: responses empty, 1 failure emitted, call count = maxRetries+1

The test uses a serializable BoundedBackOff class with assertions on both
PAssert (pipeline outputs) and Metrics (retry counts) to ensure bounded
retry behavior works correctly.

Fixes #37198
Related to #37176

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

* Add explicit public modifier to withSleeperSupplier method

Address review feedback from @damccorm to explicitly add
the public keyword to withSleeperSupplier() for consistency.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Fix Spotless formatting for withSleeperSupplier method

Break long line to comply with code style requirements.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

---------

Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>
Bumps [github.com/lib/pq](https://github.com/lib/pq) from 1.10.9 to 1.11.1.
- [Release notes](https://github.com/lib/pq/releases)
- [Changelog](https://github.com/lib/pq/blob/master/CHANGELOG.md)
- [Commits](lib/pq@v1.10.9...v1.11.1)

---
updated-dependencies:
- dependency-name: github.com/lib/pq
  dependency-version: 1.11.1
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
arunpandianp and others added 5 commits February 2, 2026 10:02
* Upgrade GCP Bom to LTS version

* license
#37112)

* Allow multiprocessshared to spawn process and delete directly with obj

* Remove oom protection

* Resolve comments

* Rename unsafe_hard_delete for the proxy object to prevent collision

* Remove support for proxy on proxy to avoid complexity

* Fix import order

* Update reap test to be compatiable for windows

* Update print to logging

* Try to tearDown test in a cleaner way

* Try patching atexit call to prevent hanging on window

* Try weakref so windows can GC the process

* Try GC manually to make sure p is cleaned up

* Use a different way to check if parent is alive

* Close the pipe atexit as well
* Disable build isolation for workflow tarball by default. Add an env var to enable it if needed.

* Apply build isolation setting to all pip install commands. Use experiment to control.

* Only search experiment in experiment options.
Amar3tto and others added 30 commits March 9, 2026 23:52
* ITs for RESTCatalog using BLMS

* update rest catalog config

* use top-level gcs bucket for warehouse
…t cannot be decoded successfully (#37762)

Such messages will log an error but are otherwise discarded.
Update PaneInfoCoder to throw a CoderException instead of ArrayOutOfBoundsException
Fix PreCommit Python ML tests with ML deps installed
Bumps [minimatch](https://github.com/isaacs/minimatch) from 3.1.2 to 3.1.5.
- [Changelog](https://github.com/isaacs/minimatch/blob/main/changelog.md)
- [Commits](isaacs/minimatch@v3.1.2...v3.1.5)

---
updated-dependencies:
- dependency-name: minimatch
  dependency-version: 3.1.5
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
…37795)

* Fix #37738: handle Debezium DELETE records without valueSchema

* refactor: replace fully qualified class names with imports in KafkaConnectSchemaTest.
* Loosen GRPC requirements.

* Change link to prevent throttling.

* Change link to prevent throttling.
* Support inferring schemas from Python dataclasses

* Address comments; Revert native_type_compatibility _TypeMapEntry change

* Add unit test for named tuple and dataclasses encoded by RowCoder and passing through GBK

* Fix lint
* Pin cloudml benchmark deps to avoid pip resolution-too-deep on Dataflow

* Reduce Dataflow inactivity timeout risk for TFT CloudML benchmark

* Tighten CloudML TFT benchmark requirements

* focus fix on dependency bounds only
… InvalidLink checks (#37773)

* Fix InvalidInlineTag, InvalidParam, InvalidBlockTag and InvalidLink javadocs

* Fix JdbcUtil after merge

* spotless

* changes

* leave ignore block

* Fix InvalidLink and restore InvalidBlockTag to disabledChecks

* Remove duplicate entry
Co-authored-by: Claude <cvandermerwe@google.com>
Co-authored-by: Claude <cvandermerwe@google.com>
* Fix Vertex AI image embedding test

* Removed unused import
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.