dataproductpoc-docs

Data Marketplace

A data marketplace is the shop window for data products.
The users who visit the data marketplace are called “data consumers”.

Data Consumer Types

In laymans terms, a data consumer can be:-

Alignment to the business

Another significant factor that the data mesh/data product concept introduces is the need to align data products and datasets to specific business areas.

So, if you work in Acme Department A, you should be able to view data products and datasets that are relevant only to your department and not have to wade through data products and datasets that are not relevant.

As with any business organisation, there are organisational hierarchies to consider, so some data consumers should be entitled to be able to see all data products and datasets relevant to the entire organisation, others just those that are relevant to a particular division, others just those that are relevant to a particular deparment, and within a department you may also want to restrict datasets from certain users.

Data Marketplace audience requirements

So that only permitted and appropriate data products and datasets are provided to a data consumer, we will need to:-

These identifications will be set up via the data product admin website, but the data marketplace will need to be able to use the information.

Metadata

Metadata simply means information that we need to store so that it can be presented back to the data consumer via the data marketplace, so that they fully understand what the data product and datasets can provide.

There are going to be various types of metadata that need to be captured to support the data marketplace that relate to:-

Data Consumer Metadata

Data consumer metadata needs to be stored so that we know everything about our target audience so that the correct data products are provided to them:- The following should be considered as a minimum set:-

Data Product Metadata

A particular business area can create 1 or more data products. Each data product will have a data product owner Each data product can provide 1 or more datasets. The data product metadata should cover just what is required for a data product and not what is required for each dataset The following should be considered to be the minimum required:-

Dataset Metadata

Dataset metadata will provide the data consumer with full details as to what they can expect from the data

The following should be considered a minimum set of metadata:-

Technology requirements

The data marketplace will need to:-

Application requirements

The data marketplace should:-

Off-the-shelf Data Marketplace

If you’re using Collibra as your data governance tool, it provides a data marketplace which can utilise the data product & dataset metadata within Collibra. Bear in mind that you may not be able to tailor such a solution sufficiently for your data consumers, however.