Data mesh is a hot topic. In fact, the data mesh has been identified as a top trend of 2021. So what’s all the noise about? A lot. Here’s an introduction to get you started and on the path to learning more.
The data mesh represents a paradigm shift: the concept is to design and develop data architectures around distributed ownership rather than a centralized data warehouse or data lake. At a high level, a data mesh is a decentralized data architecture broken into smaller portions and oriented around data domains. Run by data experts and data owners, it serves up data products and uses a common, self-serve data infrastructure with centralized governance and standardization.
Thoughtworks’ technology consultant Zhamak Dehghani first introduced the Data Mesh concept in the blog How to Move Beyond a Monolithic Data Lake to a Distributed Data Mesh. From there, it took off like wildfire and has become the de-facto standard in describing the data mesh.
“The data mesh platform is an intentionally designed distributed data architecture, under centralized governance and standardization for interoperability, enabled by a shared and harmonized self-serve data infrastructure.”
There are many reasons a data mesh is moving from a more “fringe” idea to more mainstream consideration, but here are some of the biggest ones.
Infusing the idea of product thinking, Data is treated with its own intrinsic value, not a byproduct, with domain experts managing their data products. This also eliminates the middle-person (often IT) for direct interaction between those who deeply understand the data and the business stakeholders who deeply understand the use case for the data.