[PostgreSQL / SQL Server sources] Change Data Capture (CDC) implementation questions #52058
Unanswered
Kévin (kev-datams)
asked this question in
Connector Questions
Replies: 1 comment
-
|
Hello Marcos Marx (@marcosmarxm), discussion created as discussed, please could you tag people who may help ? 🙏 FYI Francisco Carneiro (@FVidalCarneiro) juoll-data ichrzanowska |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello Airbyte team,
We consider using
CDCto retrieve data from bothPostgreSQLandSQLServersources (with 3 days of log retention).FULL SYNC, our main goal is to absolutely minimize the impact on production DB instances (CPU, RAM, locks, tempdb size, etc).💡 As we have some huge tables to sync, we count on the Airbyte
resumable full refreshcapability to be able to:❓ Regarding this feature, we have some questions:
PostgreSQLandSQLServer?resumable full refreshconcretely operates (technical mechanisms / cursor usage / snapshot isolation / temp disk flush / ...) on:PostgreSQL?SQLServer?considering a first sync lasting a long time (eg: 48 hours), what are the risks on production DB instances ? (CPU, RAM, locks, tempdb size, ...)
we observed the
LSN commit behaviourparameter onPostgreSQLconnector:While reading datavalue will:SQLServer, does it mean it is:While reading databehavior ? (if so, how ?)After loading data in the destinationbehavior ? (if so, how ?)I guess above questions are common to many people wishing to synchronize large volumes of data, and the answers might be of great interest to them ! 🙏
Thanks a lot for clarifications 🔥
Beta Was this translation helpful? Give feedback.
All reactions