Adrian
1 min readMar 20, 2024

--

So, I have an ERP system and a whole ecosystem of other systems around it. How is supposed to help me a set of data products to address the data quality issues? Do I need to create a series of data products (e.g. to replace a data warehouse or data lake) to address the issues?

Moreover, data quality is judged in respect to a set of needs that can vary across systems. How do I address this?

From what I understand, a data product will make sure that the data it receives is consisted with its internal expectations. What happens if suddenly new data with different rules need to be consumed? I suppose the values will be rejected. How can this address the quality existing in source systems?

Can you estimate how much logic needs to be included in a product to process the data via its pipelines, to validate the data, to cope with exceptions or to implement the changes?

My general feeling is that a data product will become a little monster that attempts to handle too much than to be pragmatic. In the end I need the data as reflected in the source system, and they should be clean.

--

--

Adrian
Adrian

Written by Adrian

IT professional/blogger with more than 24 years experience in IT - Software Engineering, BI & Analytics, Data, Project, Quality, Database & Knowledge Management

No responses yet