Data warehouses are used for storing structured, processed data.
This type of data is used by Peak for analysis and decision making.
Peak requires a data warehouse to be configured so that core features such as Data Sources and SQL Explorer can be used.
This article describes how to connect a Redshift data warehouse to Peak.
Contents
- Process overview
- Getting to the screens
- Entering the data warehouse details
- Amazon Redshift configuration
- Data lake
- Reviewing your connection
Process overview
To connect Peak to Redshift data warehouse, the following steps will need to be completed:
A Redshift cluster must be created for your Peak organization
This is performed by the Peak support team.
It must be completed before you start configuring the Redshift data warehouse.Enter your data warehouse configuration details in Peak
There are four steps to complete during this process:Details
This step lets you name your data warehouse and specify the type of data warehouse that you want to use (in this case, Redshift).Configuration
This step lets you specify the region and the endpoint for your Redshift cluster.Data Lake
This step lets you link your data warehouse to a data lake.
They must be in the same region.Review
Once you have entered all of your configuration details, this step lets you review everything before saving.
Getting to the screens
To connect to a new data warehouse:
Go to Dock > Data Bridge.
Click ADD DATA WAREHOUSE.
Entering the data warehouse details
Name your data warehouse connection.
The name must be unique to your Peak organization.
Only alphanumeric characters and underscores are allowed.
The name cannot be changed after the connection has been set up.Choose Amazon Redshift and then click NEXT to move to the Configuration stage.
Amazon Redshift configuration
This step lets you specify the region and the endpoint for your Redshift cluster.
Select a region for your data warehouse region.
This specifies where the data warehouse will be physically located.
Make sure that your chosen region complies with your local storage regulations.
Select an endpoint for your Redshift cluster from the dropdown.
The endpoint is the URL where your Redshift cluster is hosted.
The dropdown lists the Redshift endpoint URLs that are available for the selected region.
Data lake
This step lets you link your data warehouse to a data lake.
Linking the two gives you faster speeds, reduced latency and more flexibility.
Select the required on from the drop-down and click NEXT.
Your data warehouse and data lake must be in the same region.
Reviewing your connection
Before you complete the configuration process, you can review the details you have given at each stage of the process.
To make changes, click Edit next to the option you want to change.
Once the details are correct, click FINISH.
You will be taken back to the Data Bridge listing screen and your newly configured data warehouse will be shown as ‘Active’.