Data Management Hub - Raw zone Data lake #265
Replies: 7 comments 1 reply
-
This is not in line with the principles of this. |
Beta Was this translation helpful? Give feedback.
-
This is a pattern considered by 2 banking customers (both are in the design
phase) - where they want to land the raw data into the management zone --
to be owned by IT team and then the Domain Owners will pull data from the
raw zone in the management zone to the Data Landing Zone for curation and
transformation.
…On Thu, Dec 16, 2021 at 5:52 PM Marvin Buss ***@***.***> wrote:
This is not in line with the principles of this.
The Data Management Zone is not here for data storage. The Data Management
Zone is about governance and data management and not about integrating,
storing or processing data. Please let us know why this can't be done in
the Landing Zones and what is blocking.
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#257 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AQP3545NX6WLO46WG45WAL3URHKY3ANCNFSM5KF7AQQQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
Beta Was this translation helpful? Give feedback.
-
Why not land the data in a Data Landing Zone? The data can still be picked up by Domain Owners once the data has landed there. The Data Management Zone is solely for governance and data management purposes and not data storage or processing. This is recommended because of clear separation of duties and ownership in the data platform. Also, the data in the management zone would always have to land in the same region, as the Vnet and all the other data management zone services are tied to a specific region. Would this not be a problem in that scenario? Also, how will cost be split across domains and businesses for these datasets? Will this the paid centrally? Usually, this is something that is supposed to be split across business groups within an organization. |
Beta Was this translation helpful? Give feedback.
-
This is my understanding from the customer today -
Bank is going to use central india as primary and south India as DR .
1. they need to land all the raw data - as identifying domains etc may take
time - hence they need an area to land all the raw data and the start
thinking about how to segregate to the different domain - so as a temporary
solution could be spin up a data landing zone for this — however again this
could lead to them continuing to use the central landing zone and never
come out of it — this was the same same pattern asked for by the Australian
bank —
Citing same reason that they want to have a centralised control on all the
raw data
2. There will be common data that may be needed by all /some of the domains
eg .. customer data etc — will not be specific to a single domain — is
there a pattern for the common data ?
…On Fri, 17 Dec 2021 at 3:01 PM, Marvin Buss ***@***.***> wrote:
Why not land the data in a Data Landing Zone? The data can still be picked
up by Domain Owners once the data has landed there.
The Data Management Zone is solely for governance and data management
purposes and not data storage or processing. This is recommended because of
clear separation of duties and ownership in the data platform. Also, the
data in the management zone would always have to land in the same region,
as the Vnet and all the other data management zone services are tied to a
specific region. Would this not be a problem in that scenario?
Also, how will cost be split across domains and businesses for these
datasets? Will this the paid centrally? Usually, this is something that is
supposed to be split across business groups within an organization.
—
Reply to this email directly, view it on GitHub
<#257 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AQP35444PBR4PBLG55DKIZDURL7PRANCNFSM5KF7AQQQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Hi @namrata01Apr,
|
Beta Was this translation helpful? Give feedback.
-
I will move this to a discussion for now. Let's continue the discussion in the "Discussions" Tab. |
Beta Was this translation helpful? Give feedback.
-
@marvinbuss Do we have a reference implementation for the core service provider pattern as well? Or is the reference implementation limited to the harmonized mesh pattern |
Beta Was this translation helpful? Give feedback.
-
2 customers that i am working on have asked for a centralized data lake ( only the raw zone) in the data management hub - Can this be included in the template?
Beta Was this translation helpful? Give feedback.
All reactions