Page tree

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Current »

 

Contents:


Azure Deployment Scenarios

The following are the Azure deployment scenarios.

Deployment Scenario

Trifacta node installation

Base Storage LayerStorage - WASBStorage - ADLSStorage - SQL DBStorage - SQL DWCluster

Trifacta® Wrangler Enterprise install for WASB

AzureWASBread/writeread onlyreadread/write

 

  • HDI
  • Azure Databricks

 

Trifacta Wrangler Enterprise install for ADLS

AzureHDFS read only read/writereadread/write

 

  • HDI
  • Azure Databricks

 

Legend and Notes:

ColumnNotes
Deployment ScenarioDescription of the Azure-connected deployment

Trifacta node installation

Location where the Trifacta node is installed in this scenario.

Base Storage Layer

When the Trifacta platform is first installed, the base storage layer must be set.

NOTE: After you have begun using the product, you cannot change the base storage layer.

 

For more information, see Set Base Storage Layer.

Storage - WASB

For read/write access to WASB, the base storage layer must be set to WASB. For more information, see Enable WASB Access.

Storage - ADLSFor read/write access to ADLS, the base storage layer must be set to HDFS. For more information, see Enable ADLS Access.
Storage - SQL DBFor Azure installs, you can optionally create a connection to an Azure-hosted instance of SQL DB. For more information, see Create SQL DB Connections.
Storage - SQL DWFor Azure installs, you can optionally create a connection to an Azure-hosted instance of SQL DW. For more information, see Create SQL DW Connections .
Cluster

List of Hadoop cluster types that are supported for integration and job execution at scale.

  • The Trifacta platform can integrate with at most one cluster. It cannot integrate with two different clusters at the same time.
  • Smaller jobs can be executed on the Trifacta Photon running environment, which is hosted on the Trifacta node itself.
  • For more information, see Running Environment Options.

Azure Installations

For more information, see Install for Azure.

Azure Integrations

The following table describes the different Azure components that can host or integrate with the Trifacta platform. Combinations of one or more of these items constitute one of the deployment scenarios listed in the following section.

Azure ServiceDescriptionBase Storage LayerOther Required Azure Services
HDI

Microsoft Azure deployments can integrate with an HDI cluster, which can be pre-existing or created at the time of deployment. See Configure for HDInsight.

Base storage layer can be HDFS (for ADLS) or WASB.

 
Azure Databricks

Optionally, you can integrate the Trifacta platform with an Azure Databricks cluster. See Configure for Azure Databricks.

Base storage layer can be HDFS (for ADLS) or WASB. 
WASB

Windows Azure Storage Blobs (WASB) extends HDFS to enable access to storage blobs that have not been deployed into the HDI cluster. See Enable WASB Access.

Base Storage Layer = WASB

HDI cluster

WASB

Key Vault

ADLS
Active Data Lake Store (ADLS) provides a highly scalable file-based storage system within HDI cluster. See Enable ADLS Access.

Base Storage Layer = HDFS

HDI cluster

ADLS

Key Vault

The following database connections are optional.

Database NameDescription
Hive

You can read from and write to Hive, a data warehouse built on top of HDI.

SQL DW

You can read from and write to SQL Data Warehouse, a scalable data warehouse solution for Azure.

SQL DB

You can read from SQL DB, a SQL Server variant for Azure.

  • No labels

This page has no comments.