You must create an Azure Active Directory (AAD) application and grant it the desired access permissions, such as read/write access to the ADLS resource and read/write access to the Azure Key Vault secrets.
Info |
---|
NOTE: If you are integrating with Azure Databricks and are Managed Identities for authentication, please skip this section. That configuration is covered in a later step. |
This service principal is used by the for access to all Azure resources. For more information, see https://docs.microsoft.com/en-us/azure/azure-resource-manager/resource-group-create-service-principal-portal.After you have registered, acquire the following information: Azure Property | Location | Use |
---|
Application ID | Acquire this value from the Registered app blade of the Azure Portal. | Applied to configuration: azure.applicationid . | Service User Key | Create a key for the Registered app in the Azure Portal. | Applied to configuration: azure.secret . Info |
---|
NOTE: If you are using Azure AD to integrate with an Azure Databricks cluster, the Azure AD secret value stored in azure.secret must begin with an alphanumeric character. This is a known issue. |
| Directory ID | Copy the Directory ID from the Properties blade of Azure Active Directory. | Applied to configuration: azure.directoryId . | Excerpt Include |
---|
| Azure Install Create Registered Application |
---|
| Azure Install Create Registered Application |
---|
nopanel | true |
---|
|
These properties are applied later in the configuration process. If you are integrating the with a pre-existing HDI cluster, additional configuration is required. See Configure for HDInsight. Info |
---|
NOTE: If you created a new HDI cluster as part of the installation, all required is listed below. |
You can integrate the with Azure Databricks. For more information, see Configure for Azure Databricks.For Azure installations, you can set your base storage layer to be HDFS or WASB. Info |
---|
NOTE: The base storage layer must be set after installation. After it has been configured, it cannot be modified. |
Azure storage | webapp.storageProtocol setting | hdfs.protocolOverride setting |
---|
WASB | wasbs | (empty) | ADLS Gen2 | abfss | (empty) | ADLS Gen1 | hdfs | adl | See Set Base Storage Layer. For authentication purposes, the must be integrated with an Azure Key Vault keystore. See Configure Azure Key Vault.If needed, you can integrate the with Azure AD for Single-Sign On to the platform. See Configure SSO for Azure AD.Enable read-only or read-write access to ADLS Gen2. For more information, see Enable ADLS Gen2 Access. Enable read-only or read-write access to ADLS Gen1. For more information, see Enable ADLS Access. Enable read-only or read-write access to WASB. For more information on integrating with WASB, see Enable WASB Access. If you are integrating with relational datastores, please complete the following configuration sections.An encryption key file must be created on the . This key file is shared across all relational connections. See Create Encryption Key File.You can create a connection to the Hive instance on the HDI cluster with some modifications. - High Availability: Natively, Azure supports high availability for HiveServer2 via Zookeeper. Host and port information in the JDBC URL must be replaced with a Zookeeper quorum.
In addition to the other Hive connection properties, please specify the following values for the properties listed below: Property | Description |
---|
Host | Use your Zookeeper quorum value. For the final node of the list, omit the port number. Example: Code Block |
---|
zk1.cloudapp.net:2181,zk2.cloudapp.net:2181,zk3.cloudapp.net |
| Port | Set this value to 2181 . | Connect String options | In addition to any options required for your environment, include the following option: Code Block |
---|
/;serviceDiscoveryMode=zooKeeper;zooKeeperNamespace=hiveserver2 |
| Database | Enter your Hive database name. | Connections are created through the Connections page. See Connections Page. For additional details on creating a connection to Hive, see Create Hive Connections. A Hive connection can also be created using the above property substitutions via programmatic methods. For more information, see Create Azure SQL Database Connections. For more information, see Create SQL DW Connections. |