This guide takes you through the steps for installing Trifacta® software on CentOS or Red Hat.
For more information on supported operating system versions, see Product Support Matrix in the Planning Guide.
Before you install software, please review and verify the following.
NOTE: Except for database installation and configuration, all install commands should be run as the root user or a user with similar privileges. For database installation, you will be asked to switch the database user account.
- Review key sections of the Planning Guide:
- Review the System Requirements and verify that all required components have been installed.
- Verify that all required System Ports are opened on the node.
Review the System Dependencies in the Planning Guide.
Cluster Configuration: Additional steps are required to integrate the Trifacta platform with the cluster. See Prepare Hadoop for Integration with the Platform in the Planning Guide.
- Acquire your License Key.
Install and verify operations of the datastore, if used.
NOTE: Access to the Spark cluster is required.
Verify access to the server where the Trifacta platform is to be installed.
Required version of RPM for CentOS
The installer for the Trifacta platform on CentOS/RHEL requires RPM version 4.11.3-40. Please upgrade if necessary.
NOTE: On CentOS/RHEL 7.4 or earlier, the installer may fail to launch on earlier versions of RPM.
Tip: The Python setup tools can be useful for debugging startup issues. To install:
1. Install Dependencies
Without Internet access
If you have not done so already, you may download the dependency bundle with your release directly from Trifacta. For more information, see Install Dependencies without Internet Access.
With Internet access
Use the following to add the hosted package repository for CentOS/RHEL, which will automatically install the proper packages for your environment.
2. Install JDK
By default, the Trifacta node uses OpenJDK for accessing Java libraries and components. In some environments, basic setup of the node may include installation of a JDK. Please review your environment to verify that an appropriate JDK version has been installed on the node.
NOTE: Use of Java Development Kits other than OpenJDK is not currently supported. However, the platform may work with the Java Development Kit of your choice, as long as it is compatible with the supported version(s) of Java. For more information, see System Requirements in the Planning Guide.
Tip: OpenJDK is included in the offline dependencies, which can be used to install the platform without Internet access. For more information, see Install Dependencies without Internet Access.
The following commands can be used to install OpenJDK. These commands can be modified to install a separate compatible version of the JDK.
java-1.8.0-openjdk-devel is not included, the batch job runner service, which is required, fails to start.
By default, the
JAVA_HOME environment variable is configured to point to a default install location for the OpenJDK package.
NOTE: If you have installed a JDK other than the OpenJDK version provided with the software, you must set the
JAVA_HOME environment variable on the Trifacta node to point to the correct install location.
The property value must be updated in the following locations:
Edit the following file:
- Save changes.
3. Install Trifacta package
NOTE: If you are installing without Internet access, you must reference the local repository. The command to execute the installer is slightly different. See Install Dependencies without Internet Access.
NOTE: Installing the Trifacta platform in a directory other than the default one is not supported or recommended.
Install the package with yum, using root:
4. Verify Install
The product is installed in the following directory:
The platform must be made aware of the location of Java.
- Edit the following file:
Update the following parameter value:
- Save changes.
5. Install License Key
Please install the license key provided to you by Trifacta. See License Key.
6. Install Hadoop dependencies
If you are integrating with a supported Hadoop cluster, you must install the dependencies for the Hadoop cluster on the Trifacta node. For more information, see Install Hadoop Dependencies.
7. Store install packages
For safekeeping, you should retain all install packages that have been installed with this Trifacta deployment.
Install and configure Trifacta databases
The Trifacta platform requires installation of several databases. If you have not done so already, you must install and configure the databases used to store . See Install Databases in the Databases Guide.
After installation is complete, additional configuration is required to make the platform operational. See Install Configuration.
Install Desktop Application
You can deploy the Wrangler Enterprise desktop application as a desktop client to enable end-users to connect to the Trifacta application without using one of the supported web browsers. See Install Desktop Application.
This page has no comments.