Please complete the following steps in the listed order to configure your installed instance of the
|D s platform|
Deploy running environment cluster and
D s node Info
NOTE: The running environment cluster can be deployed as part of the installation process. You can also integrate the platform with a pre-existing cluster. Details are below.
on the node.
D s platform
For more information, see Install for Azure.
Create registered application
You must create an Azure Active Directory (AAD) application and grant it the desired access permissions, such as read/write access to resources and read/write access to the Azure Key Vault secrets.
This service principal is used by the
After you have registered, acquire the following information:
These properties are applied later in the configuration process.
Configure the Platform
Configure for HDI
If you are integrating the
Configure for Azure Databricks
You can integrate the
Configure base storage layer
For Azure installations, you can set your base storage layer to be HDFS or WASB.
Configure for Key Vault
For authentication purposes, the
Configure for SSO
If needed, you can integrate the
Configure for ADLS Gen2
Enable read-only or read-write access to ADLS Gen2. For more information, see Enable ADLS Gen2 Access.
Configure for ADLS Gen1
Enable read-only or read-write access to ADLS Gen1. For more information, see Enable ADLS Gen1 Access.
Configure for WASB
Enable read-only or read-write access to WASB. For more information on integrating with WASB, see Enable WASB Access.
Configure relational connections
If you are integrating
Create encryption key file
An encryption key file must be created on the
Create Hive connection
You can create a connection to the Hive instance on the HDI cluster with some modifications.
In addition to the other Hive connection properties, please specify the following values for the properties listed below:
Connections are created through the Connections page. See Connections Page.
For additional details on creating a connection to Hive, see Create Hive Connections.
A Hive connection can also be created using the above property substitutions via programmatic methods.
Create Azure SQL Database connection
For more information, see Create Azure SQL Database Connections.
Create Azure SQL DW connection
For more information, see Create SQL DW Connections.
- Load a dataset from the cluster.
- Perform a few simple steps on the dataset.
- Click Run Job in the Transformer page.
- When specifying the job:
- Click the Profile Results checkbox.
- Select Spark.
- When the job completes, verify that the results have been written to the appropriate location.