The following are the Azure deployment scenarios.
|Base Storage Layer||Storage - WASB||Storage - ADLS||Storage - Azure SQL Database||Storage - SQL DW||Cluster|
install for WASB
install for ADLS
install for ADLS Gen2
Legend and Notes:
|Deployment Scenario||Description of the Azure-connected deployment|
Location where the is installed in this scenario.
|Base Storage Layer|
When the is first installed, the base storage layer must be set.
|Storage - WASB|
For read/write access to WASB, the base storage layer must be set to WASB.
|Storage - ADLS Gen1|
For read/write access to ADLS (Gen1), the base storage layer must be set to HDFS.
|Storage - ADLS Gen2||For read/write access to ADLS Gen2, the base storage layer must be set to ABFSS.|
|Storage - Azure SQL Database||For Azure installs, you can optionally create a connection to an Azure SQL Database instance.|
|Storage - SQL DW||For Azure installs, you can optionally create a connection to an Azure-hosted instance of SQL DW.|
List of Hadoop cluster types that are supported for integration and job execution at scale.
For more information, see Install for Azure in the Install Guide.
The following table describes the different Azure components that can host or integrate with the . Combinations of one or more of these items constitute one of the deployment scenarios listed in the following section.
|Azure Service||Description||Base Storage Layer||Other Required Azure Services|
Microsoft Azure deployments can integrate with an HDI cluster, which can be pre-existing or created at the time of deployment.
Base storage layer can be HDFS (for ADLS) or WASB.
Optionally, you can integrate the with an Azure Databricks cluster.
|Base storage layer can be HDFS (for ADLS) or WASB.|
Windows Azure Storage Blobs (WASB) extends HDFS to enable access to storage blobs that have not been deployed into the HDI cluster.
Base Storage Layer = WASB
HDI or Azure Databricks
Active Data Lake Store (ADLS) provides a highly scalable file-based storage system within HDI cluster.
Base Storage Layer = HDFS
HDI or Azure Databricks
|ADLS Gen2||Azure Data Lake Storage Gen2 is a set of capabilities dedicated to big data analytics, built on Azure Blob storage.||Base Storage Layer = ABFSS|
The following database connections are optional.
You can read from and write to Hive, a data warehouse built on top of HDI.
You can read from and write to SQL Data Warehouse, a scalable data warehouse solution for Azure.
|Azure SQL Database|
You can read from Azure SQL Database, a SQL Server variant for Azure.