Page tree

 

Contents:


Azure Deployment Scenarios

The following are the Azure deployment scenarios.

Deployment Scenario

Trifacta node installation

Base Storage LayerStorage - WASBStorage - ADLS Gen1/Gen2Storage - Azure SQL DatabaseStorage - SQL DWCluster

Trifacta® Wrangler Enterprise install for WASB

AzureWASBread/writeread onlyreadread/write

 

  • HDI
  • Azure Databricks

 

Trifacta Wrangler Enterprise install for ADLS Gen1

AzureHDFS read only read/writereadread/write

 

  • HDI
  • Azure Databricks

 

Trifacta Wrangler Enterprise install for ADLS Gen2

AzureABFSSread only
  • read/write to ADLS Gen2
  • read only to ADLS Gen1
readread/write
  • Azure Databricks

Legend and Notes:

ColumnNotes
Deployment ScenarioDescription of the Azure-connected deployment

Trifacta node installation

Location where the Trifacta node is installed in this scenario.

Base Storage Layer

When the Trifacta platform is first installed, the base storage layer must be set.

NOTE: After you have begun using the product, you cannot change the base storage layer.

Storage - WASB

For read/write access to WASB, the base storage layer must be set to WASB.

Storage - ADLS Gen1

For read/write access to ADLS Gen1, the base storage layer must be set to HDFS.

Storage - ADLS Gen2For read/write access to ADLS Gen2, the base storage layer must be set to ABFSS.
Storage - Azure SQL DatabaseFor Azure installs, you can optionally create a connection to an Azure SQL Database instance.
Storage - SQL DWFor Azure installs, you can optionally create a connection to an Azure-hosted instance of SQL DW.
Cluster

List of Hadoop cluster types that are supported for integration and job execution at scale.

  • The Trifacta platform can integrate with at most one cluster. It cannot integrate with two different clusters at the same time.
  • Smaller jobs can be executed on the Trifacta Photon running environment, which is hosted on the Trifacta node itself.
  • For more information, see Running Environment Options in the Configuration Guide.

Azure Installations

For more information, see Install for Azure in the Install Guide.

Azure Integrations

The following table describes the different Azure components that can host or integrate with the Trifacta platform. Combinations of one or more of these items constitute one of the deployment scenarios listed in the following section.

Azure ServiceDescriptionBase Storage LayerOther Required Azure Services
HDI

Microsoft Azure deployments can integrate with an HDI cluster, which can be pre-existing or created at the time of deployment.

Base storage layer can be HDFS (for ADLS Gen1) or WASB.

 
Azure Databricks

Optionally, you can integrate the Trifacta platform with an Azure Databricks cluster.

Base storage layer can be HDFS (for ADLS Gen1), ABFSS (for ADLS Gen2), or WASB. 
WASB

Windows Azure Storage Blobs (WASB) extends HDFS to enable access to storage blobs that have not been deployed into the HDI cluster.

Base Storage Layer = WASB

HDI or Azure Databricks

Key Vault

ADLS Gen1
Active Data Lake Store (ADLS) provides a highly scalable file-based storage system within HDI cluster.

Base Storage Layer = HDFS

HDI or Azure Databricks

Key Vault

ADLS Gen2Azure Data Lake Storage Gen2 is a set of capabilities dedicated to big data analytics, built on Azure Blob storage.Base Storage Layer = ABFSS

Azure Databricks

Key Vault

The following database connections are optional.

Database NameDescription
Hive

You can read from and write to Hive, a data warehouse built on top of HDI.

NOTE: Access to Hive is not supported on Azure Databricks.



SQL DW

You can read from and write to SQL Data Warehouse, a scalable data warehouse solution for Azure.


Azure SQL Database

You can read from Azure SQL Database, a SQL Server variant for Azure.


This page has no comments.