Amazon Datazone is from the AWS consulting service that solves the problem of data producers getting data consumers. Amazon Datazone makes it easy for customers to catalog, discover, share, and govern data stored across AWS, on-premises, and third-party sources. The solution is through four key features.

  1. Data catalog: Where the data producers publish their data, known as data assets, to be searched and subscribed to by the consumers.
  2. Projects: Individuals or structures of the organization can form separate projects. They can publish and request access data.
  3. Portal: This is an innovative UI that puts in place the mechanism that you need for the catalog and the project themselves.
  4. Governance: The governance layer is where the data consumer can ask by searching through the data catalog, and this is to get access to a particular data asset. Producers or data owners can then say yes or no based on their need to access the data. Also, they can state why they are granting access to the governance strategy needed for all modern architecture.

Amazon Datazone Demo

How to create a domain

Step 1. Sign in to AWS Console. Navigate to Amazon DataZone and click on the Create Domain button.

amazon datazone

Step 2. Give a name for the domain, for example, Sales. Then click the Create domain button.

Access data portal to create producer project

Step 1. Click the Open data portal URL in the bottom right corner. The Portal will open in a new tab and take you to the data catalog page.

Step 2. Navigate to the left panel and click the plus button right to the MY PROJECTS.

Step 3. Give a name for the project, for example, a sales producer. Select the domain Sales. Then click the CREATE PROJECT button.

aws consulting services

Produce and publish data

Step 1. Navigate to the right panel and click the link Query data. There, you can produce data as per your needs.

Step 2. Go back to the portal and click the Publish data button.

Amazon Web Services

Step 3. Select the automatic publish method. Then, you can give it a name sales-catalog. And provide the necessary information.

Step 4. Copy the database name from the query editor and paste it under the Database name. Click CREATE PUBLISHING JOB.

Step 5. Go to project on the left panel, then to the PUBLISHING tab. Click on the hyperlink of the table catalog_sales.

You will get the metadata, schema, and subscriptions about the table on the window that appears. Also, you can edit under metadata and schema.

Create consumer project

Step 1. Navigate to the left panel and click the plus button right to the MY PROJECTS.

aws

Step 2. Give a name sales-consumer. Then, select the domain sales.

Step 3. Click the CREATE PROJECT button.

Subscribe to data

Step 1. As a consumer, the first thing to do is subscribe to the table created by the producer. Search for the sales catalog, and you will get the name Sales Catalog of Anycompany and hit the SUBSCRIBE button.

Step 2. Select the project sales-consumer to subscribe. Then, request a reason for subscription.

Step 3. Click the SUBSCRIBE button.

Step 4. Navigate to the left panel and click the project sales producer. Click the PUBLISHING tab.

amazon datazone

Step 5. Go to SUBSCRIPTION REQUESTS, click the notification in the top right corner, and hit the APPROVE button.

Step 6. Go to the sales consumer project and refresh till the status changes to granted.

Step 7. Click on Granted, which will tell you what table you can see inside the catalog and can access that table.

Conclusion

With Amazon Datazone, an organization’s data assets can manage and govern access to data using fine-grained controls. Consumers or others in the organization can access data with the right level of privileges.

Metclouds Technologies help you to manage your data with Amazon Datazone.