This install process applies to installing on an Azure infrastructure that you manage.
Azure Marketplace deployments:
NOTE: Content in this section does not apply to deployments from the Azure Marketplace, which provide fewer deployment and configuration options. For more information, see the Azure Marketplace.
NOTE: All hardware in use for supporting the platform is maintained within the enterprise infrastructure on Azure.
For more information on deployment scenarios, see Supported Deployment Scenarios for Azure.
For more information on the limitations of this deployment scenario, see Product Limitations.
Depending on which of the following Azure components you are deploying, additional pre-requisites and limitations may apply. Please review these sections as well.
Before you begin, please verify that you have completed the following:
Cluster sizing: Before you begin, you should allocate sufficient resources for the cluster. For guidance, please contact your .
The required set of ports must be enabled for listening. See System Ports.
This node should be dedicated for .
Limitations: For more information on limitations of this scenario, see Product Limitations in the Install Preparation area.
Deploy and provision a cluster of one of the supported types. The supports integrations with multiple cluster types.
NOTE: Before you deploy, you should review cluster sizing options. For guidance, please contact your .
Primary storage of the cluster may be set to an existing Azure Data Lake Store or Blob Storage.
For more information, see Supported Deployment Scenarios for Azure.
In your Azure infrastructure, you must deploy a suitable VM for the installation of the .
The operating system requirements for the VM for installing the platform vary depending on the type of job execution cluster with which you are running.
|Cluster Type||Supported O/S for VM||Notes|
must be installed on an edge node of the HDInsight cluster.
|Azure Databricks||CentOS and Ubuntu|
For more information on the supported EMR distributions, see Supported Deployment Scenarios for Azure.
Create the following directories, which are specified by parameter in the platform.
|Default HDFS path||Platform configuration property|
Change the ownership of the above directories to
trifacta:trifacta or the corresponding values for the S3 user in your environment.
Additional users may be required. For more information, see Required Users and Groups in the Install Preparation area.
Please complete these steps listed in order:
Install Software: Install the software on the node you created.
NOTE: You must follow the instructions provided for Ubuntu installation.
See Install Software.
Install Databases: The platform requires several databases for storing metadata.
NOTE: The software assumes that you are installing the databases on a PostgreSQL server on the same node as the software. If you are not or are changing database names or ports, additional configuration is required as part of this installation process.
For more information, see Install Databases.
As soon as you login, you should change the password on the admin account. In the left menu bar, select Settings > Admin Settings. Scroll down to Manage Users. For more information, see Change Admin Password.
Tip: At this point, you can access the online documentation through the application. In the left menu bar, select Help menu > Product Docs. All of the following content, plus updates, is available online. See Documentation below.
After you have completed the above topics, you can complete the configuration for your deployment below.
NOTE: The following configuration topics are not part of this installation guide. See links below.
You can access complete product documentation online and in PDF format. From within the product, select Help menu > Product Docs.
After you have accessed the documentation, the following topics are relevant to Azure deployments. Please review them in order.
|Supported Deployment Scenarios for Azure||Matrix of supported Azure components.|
Top-level configuration topic on integrating the platform with Azure.
|Configure for HDInsight|
Review this section if you are integrating the with a pre-existing HDI cluster.
|Configure for Azure Databricks||Review this section if you are integrating with a pre-existing Azure Databricks cluster.|
|Enable ADLS Access||Configuration to enable access to ADLS.|
|Enable WASB Access||Configuration to enable access to WASB.|
|Verify Operations||You should be able to verify platform operations by running a simple job at this time.|
To enable, see Enable Relational Connections.
Azure-specific relational connections:
|Configure SSO for Azure AD|
How to integrate the with Azure Active Directory for Single Sign-On.