NIFI-15305 Fix PutDatabaseRecord Timestamp Parsing #10613

taylorobyen · 2025-12-07T11:55:04Z

Summary

NIFI-15305 Fixes PutDatabaseRecord's inconsistent timestamp parsing when handling epoch timestamps.

Epoch timestamps are supposed to be in milliseconds, but when fractional milliseconds are included in the timestamp it is incorrectly handled as seconds resulting in dates very far in the future.

Behavior prior to my patch:
Correct handling of a timestamp with only whole milliseconds:

{"ts": 1765056655230}

Timestamp loaded into PostgreSQL: 2025-12-06 16:30:55.230 -0500 ✅

Incorrect handling of a timestamp that includes fractional milliseconds:

{"ts": 1765056655230.746}

Timestamp loaded into PostgreSQL: 57902-06-03 07:20:30746. -0400 ❌

This bug was introduced in #8332, which was trying to allow for microsecond precision for epoch timestamps. Before #8332, the fractional milliseconds were truncated as the timestamp was always converted from String to long.

My implementation allows for microsecond precision while handling all epoch timestamps as milliseconds.

Tracking

Please complete the following tracking steps prior to pull request creation.

Issue Tracking

Apache NiFi Jira issue created

Pull Request Tracking

Pull Request title starts with Apache NiFi Jira issue number, such as NIFI-00000
Pull Request commit message starts with Apache NiFi Jira issue number, as such NIFI-00000

Pull Request Formatting

Pull Request based on current revision of the main branch
Pull Request refers to a feature branch with one commit containing changes

Verification

Please indicate the verification steps performed prior to pull request creation.

Build

Build completed using ./mvnw clean install -P contrib-check
- JDK 21
- JDK 25

Licensing

New dependencies are compatible with the Apache License 2.0 according to the License Policy
New dependencies are documented in applicable LICENSE and NOTICE files

Documentation

Documentation formatting appears as expected in rendered files

exceptionfactory

Thanks for proposing this improvement @taylorobyen.

On initial read, this appears to break expected behavior. As indicated in the changes for NIFI-12710, the parsing logic expects that a floating point number contains seconds before the period, and nanoseconds after the period.

The proposed changes appear to alter the behavior, expecting milliseconds instead of seconds before the period.

Can you provide more detail on the expected use case where the input field contains milliseconds.nanoseconds?

There may be options to support a change based on maximum expected numbers of seconds versus milliseconds, but current behavior should be preserved.

exceptionfactory · 2025-12-16T02:14:25Z

...t/java/org/apache/nifi/serialization/record/field/TestObjectLocalDateTimeFieldConverter.java

+        // Less precise timestamp than other tests as double is less precise than BigDecimal
+        final double timestamp = 1764673335503.607;
+
+        final BigDecimal bd = new BigDecimal(Double.toString(timestamp));


Is there a particular reason for converting the timestamp double to a String, as opposed to just passing it to the BigDecimal constructor?

Passing a double directly to the BigDecimal constructor preserves the binary floating-point approximation rather than the intended decimal value. I’ve updated this to use BigDecimal.valueOf() instead.

taylorobyen · 2025-12-25T11:48:33Z

If it is intended behavior to parse the timestamp differently based off the input, I think the reader controller services should have their descriptions updated. In NIFI-12710, it modifies the parsing under the hood, but nothing was updated description wise to indicate to the end user that this behavior has changed. When I upgraded my production instances of NiFi from 1.20.0 to NiFi 2.5.0 and noticed that my timestamps were now being parsed incorrectly the first place I checked was the JsonTreeReader timestamp field to see if parsing had changed. It wasn't until I checked out the project and viewed the source code that I found that I need to provide my milliseconds as integers to have my timestamps parsed correctly. The current description from JsonTreeReader only mentions milliseconds (other readers such as CSV also have the same description):

My use case for these timestamps is that I load JSON payloads into my PostgreSQL database which have timestamp fields generated in Python i.e:

ms = int(time.time() * 1000)

Which will generate something like 1766663148057.1487. To work around the current behavior, I'm rounding down to an integer so that the timestamps are parsed correctly.

Instead of inferring the epoch unit from the numeric value, it may be better to let the user explicitly specify the unit, defaulting to milliseconds to preserve existing behavior. For example, the timestamp format could accept epoch_seconds, epoch_millis, or epoch_micros. This avoids ambiguity and is less error-prone than inference.

exceptionfactory · 2025-12-27T17:24:54Z

Thanks for the reply @taylorobyen.

After evaluating the details, the core of the current problem is that the conversion process does not handle floating point numbers, represented as strings, with the same level of sanity checking as long numbers.

It is important to preserve current parsing behavior in a way that also addresses the current issue, if at all possible. With that goal in mind, I put together an alternative pull request that addresses the core issue in #10697. If you are able to evaluate that approach and test it in your environment, that would be helpful for comparison.

exceptionfactory · 2025-12-30T15:55:20Z

Thanks again for raising this issue and proposing an initial approach @taylorobyen. I'm closing this in favor the alternative in #10697 that preserves existing behavior, but handles this scenario. If you observe additional issues, feel free to raise a new Jira issue for evaluation.

exceptionfactory requested changes Dec 16, 2025

View reviewed changes

NIFI-15305 Fix PutDatabaseRecord Timestame Parsing

9bc128e

taylorobyen force-pushed the NIFI-15305 branch from 2ae4f01 to 9bc128e Compare December 25, 2025 19:45

taylorobyen requested a review from exceptionfactory December 25, 2025 19:50

exceptionfactory closed this Dec 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NIFI-15305 Fix PutDatabaseRecord Timestamp Parsing #10613

NIFI-15305 Fix PutDatabaseRecord Timestamp Parsing #10613

taylorobyen commented Dec 7, 2025

Uh oh!

exceptionfactory left a comment

Uh oh!

exceptionfactory Dec 16, 2025

Uh oh!

taylorobyen Dec 25, 2025

Uh oh!

taylorobyen commented Dec 25, 2025 •

edited

Loading

Uh oh!

exceptionfactory commented Dec 27, 2025

Uh oh!

exceptionfactory commented Dec 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

NIFI-15305 Fix PutDatabaseRecord Timestamp Parsing #10613

NIFI-15305 Fix PutDatabaseRecord Timestamp Parsing #10613

Conversation

taylorobyen commented Dec 7, 2025

Summary

Tracking

Issue Tracking

Pull Request Tracking

Pull Request Formatting

Verification

Build

Licensing

Documentation

Uh oh!

exceptionfactory left a comment

Choose a reason for hiding this comment

Uh oh!

exceptionfactory Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

taylorobyen Dec 25, 2025

Choose a reason for hiding this comment

Uh oh!

taylorobyen commented Dec 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

exceptionfactory commented Dec 27, 2025

Uh oh!

exceptionfactory commented Dec 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

taylorobyen commented Dec 25, 2025 •

edited

Loading