Contents:
This section contains hardware and software requirements for successful installation of Designer Cloud Enterprise Edition.
Edge Node Requirements
If the Designer Cloud Powered by Trifacta platform is installed in a Hadoop cluster, the platform is typically installed on an Edge Node.
NOTE: The Designer Cloud Powered by Trifacta platform does not require edge node installation.
Hardware Requirements
Minimum hardware:
Item | Required |
---|---|
Number of cores | 8 cores |
RAM | 64 GB NOTE: The platform requires 12GB of dedicated RAM to start and perform basic operations. |
Disk space to install software | 4 GB |
Total free disk space | 16 GB Space requirements by volume:
|
Recommended hardware:
Item | Recommended |
---|---|
Number of cores | 16 cores |
RAM | 128 GB NOTE: The platform requires 12GB of dedicated RAM to start and perform basic operations. |
Disk space to install software | 16 GB |
Total free disk space | 100 GB Space requirements by volume:
|
Operating System Requirements
The following operating system distributions are supported for the Alteryx node.
NOTE: The Designer Cloud Powered by Trifacta platform requires 64-bit versions of any operating system.
- CentOS 6.4 - 6.x, 7.1, 7.2, 7.4
RHEL 6.4 - 6.x, 7.1, 7.2, 7.4
NOTE: If you are installing on CentOS/RHEL 7.1, you must be connected to an online repository for some critical updates. Offline installation is not supported for these operating system distributions.
NOTE: For security reasons, RHEL 7.3 is not supported for installation of Release 5.0 or later of the Designer Cloud Powered by Trifacta platform. Please upgrade to RHEL 7.4 or later.
Ubuntu 14.04 (codename Trusty) and 16.04 (codename Xenial)
NOTE: For Ubuntu installations, some packages must be manually installed. Instructions are provided later in the process.
NOTE: During normal operations, the platform may maintain a high number of open files, which may exceed the default limit defined by the operating system. Before you begin using the system, you should raise this limit to 64000
. For more information on raising the ulimit, see Miscellaneous Configuration.
NOTE: If you are enabling SSO and want to use an Apache Server as a reverse proxy for the Alteryx node, you may need to upgrade to Apache Server. See Configure SSO for AD-LDAP.
For more information on RPM dependencies, see System Dependencies.
Other Software Requirements
The following software components must be present.
Java
Java 1.8
NOTE: There are additional requirements related to Java JDK listed in the Hadoop Components section listed below.
NOTE: If you are integrating your Alteryx instance with S3, you must install the Oracle JRE 1.8 onto the Alteryx node. No other version of Java is supported for S3 integration. See Enable S3 Access.
NOTE: OpenJDK 1.8 is officially supported. It must be installed on the Alteryx node during the installation process. See Installation Steps.
PostgreSQL
PostgreSQL 9.3 is used by the Designer Cloud Powered by Trifacta platform for storing metadata and the user's Wrangle recipes. The software is installed on the Alteryx node as part of the installation process.
- PostgreSQL 9.3
For more information on installing and configuring the database, see Set up the Databases.
Other Software
NOTE: For Ubuntu installations, the following packages must be manually installed using Ubuntu-specific versions. Instructions and version numbers are provided later in the process.
- NginX 1.12.2
- NodeJS 6.12.2
Root User Access
Installation must be executed as the root user on the Alteryx node.
SSL Access
(Optional) If users are connecting to the Designer Cloud Powered by Trifacta platform, an SSL certificate must be created and deployed. See Install SSL Certificate.
Internet Access
(Optional) Internet access is not required for installation or operation of the platform. However, if the server does not have Internet access, you must acquire additional software as part of the disconnected install. For more information, see Install Dependencies without Internet Access.
Hadoop Cluster Requirements
The following requirements apply if you are integrating the Designer Cloud Powered by Trifacta platform with your enterprise Hadoop cluster.
NOTE: If you have upgrades to your Hadoop cluster planned for the next year, you should review those plans with Support prior to installation. For more information, please contact Alteryx Support.
Supported Hadoop Distributions
NOTE: The Designer Cloud Powered by Trifacta platform only supports the latest major release and its minor releases of each distribution.
The Designer Cloud Powered by Trifacta platform only supports the versions of any required components included in a supported distribution. Even if they are upgraded components, use of non-default versions of required components is not supported.
The Designer Cloud Powered by Trifacta platform supports the following minimum Hadoop distributions:
Vendor | Supported Versions | Link |
---|---|---|
Cloudera |
| Supported Deployment Scenarios for Cloudera |
Hortonworks |
| Supported Deployment Scenarios for Hortonworks |
Node Requirements
Each cluster node must have the following software:
Java JDK 1.8
Hadoop Component Access
The Alteryx deployment must have access to the following.
Java and Spark version requirements
The following matrix identifies the supported versions of Java and Spark on the Hadoop cluster.
Notes:
- Java must be installed on each node of the cluster.
- If you are integrating with an EMR cluster, there are specific version requirements for EMR. See Configure for Spark.
- By default, Cloudera may be installed with Java JDK 1.7 or earlier. If you are upgrading to Java JDK 1.8, you must do so on each node of the cluster. For more information, see https://www.cloudera.com/documentation/enterprise/latest/topics/cdh_ig_jdk_installation.html.
- The versions of Java on the Alteryx node and the Hadoop cluster do not have to match.
Spark 2.1 | Spark 2.2 | |
---|---|---|
Java 1.7 | Supported. | Not supported. |
Java 1.8 | Supported. | Required. |
Other components
- HDFS Namenode
WebHDFS
NOTE: In HDFS, Append Mode must be enabled. See Prepare Hadoop for Integration with the Platform.
NOTE: If you are enabling high availability failover, you must use HttpFS, instead of WebHDFS. See Enable Integration with Cluster High Availability.
For Map Reduce 1:
- NOTE: As of Release 2.7, Map Reduce 1 has been deprecated. For more information, please see End of Life and Deprecated Features.
- JobTracker is running.
- For YARN:
- ResourceManager is running.
- ApplicationMaster's range of ephemeral ports are open to the Alteryx node.
- Hive Server 2:
- Hive Server 2 is supported for metadata publishing. Additional configuration is required. See Configure for Hive.
- WebHCat is not supported.
Hadoop System Ports
For more information, see System Ports.
Site Configuration Files
Hadoop cluster configuration files must be copied into the Alteryx deployment. It is especially important in a YARN deployment. See Configure for Hadoop.
Security Requirements
- Kerberos supported:
- If Kerberos is enabled, a keytab file must be accessible to the Designer Cloud Powered by Trifacta platform.
- See Set up for a Kerberos-enabled Hadoop cluster.
- If Kerberos and secure impersonation are not enabled:
- A user
[hadoop.user
(default=trifacta
)]
must be created on each node of the Hadoop cluster. - A directory
[hadoop.dir
(default=trifacta
)]
must be created on the cluster. - The user
[hadoop.user]
must have full access to the directory. which enables storage of the transformation recipe back into HDFS. - See Configure for Hadoop.
- A user
Cluster Configuration
For more information on integration with Hadoop, see Prepare Hadoop for Integration with the Platform.
User Requirements
Users must access the Designer Cloud Powered by Trifacta platform through the Google Chrome browser. For more information on user system requirements, see Desktop Requirements.
I/O Requirements
This page has no comments.