Skip to content

fix: store real kafka partition+offset in Iceberg via KafkaRecordDeserializer#9

Closed
AndrewAct wants to merge 1 commit into
mainfrom
fix/kafka-offset
Closed

fix: store real kafka partition+offset in Iceberg via KafkaRecordDeserializer#9
AndrewAct wants to merge 1 commit into
mainfrom
fix/kafka-offset

Conversation

@AndrewAct

Copy link
Copy Markdown
Owner

What Changed

Replace SimpleStringSchema with a custom KafkaRecordDeserializationSchema in both normalize.py and cdc_symbol_config.py so that kafka_partition and kafka_offset written to Iceberg silver/bronze layers reflect the actual Kafka record position instead of the placeholder -1.

The -1 fallback path in iceberg_reader.py is retained for backward compatibility with rows written before this fix.

…rializer

Replace SimpleStringSchema with a custom KafkaRecordDeserializationSchema in
both normalize.py and cdc_symbol_config.py so that kafka_partition and
kafka_offset written to Iceberg silver/bronze layers reflect the actual Kafka
record position instead of the placeholder -1.

The -1 fallback path in iceberg_reader.py is retained for backward
compatibility with rows written before this fix.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@AndrewAct AndrewAct closed this May 17, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant