You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
My use case is: I try setting up data replication from Postgresql to Bigquery. Source connector is Debezium Postgresql connector (for CDC), sink connector is Bigquery connector. It is working quite well. However, as bigquery connector is writing data (before/after schema) as append-only mode, I have to write views in Bigquery to de-duplicate and extract the latest rows (remove deleted rows and get the latest after fields).
Is there any solution to de-duplicate the data from Kafka into Bigquery on the fly?
The text was updated successfully, but these errors were encountered:
My use case is: I try setting up data replication from Postgresql to Bigquery. Source connector is Debezium Postgresql connector (for CDC), sink connector is Bigquery connector. It is working quite well. However, as bigquery connector is writing data (before/after schema) as append-only mode, I have to write views in Bigquery to de-duplicate and extract the latest rows (remove deleted rows and get the latest after fields).
Is there any solution to de-duplicate the data from Kafka into Bigquery on the fly?
The text was updated successfully, but these errors were encountered: