This section contains hardware and software requirements for successful installation of .

Platform Node Requirements

Node Installation Requirements

If the is installed in a Hadoop environment, the software must be installed on an edge node of the cluster.


NOTE: If you are installing the into a Docker container, a different set of requirements apply. For more information, see Install for Docker in the Install Guide.

Hardware Requirements

Minimum hardware:

ItemRequired
Number of cores

8 cores, x86_64

RAM

64 GB

The platform requires 12GB of dedicated RAM to start and perform basic operations.

Disk space to install software4 GB
Total free disk space

16 GB

Space requirements by volume:

  • /opt - 10 GB
  • /var - Remainder

Recommended hardware:

ItemRecommended
Number of cores

16 cores, x86_64

RAM

128 GB

The platform requires 12GB of dedicated RAM to start and perform basic operations.

Disk space to install software16 GB
Total free disk space

100 GB

Space requirements by volume:

  • /opt - 10 GB
  • /var - Remainder

Operating System Requirements

The following operating systems are supported for the The requires 64-bit versions of any supported operating system.

CentOS/RHEL versions:

Notes on CentOS/RHEL installation:

Ubuntu versions:

Notes on Ubuntu installation:

For more information on RPM dependencies, see System Dependencies.

Database Requirements

The following database versions are supported by the for storing metadata and the user's recipes.

Supported database versions:

Notes on database versions:

For more information on installing and configuring the database, see Install Databases in the Databases Guide.

Other Software Requirements

The following software components must be present.

Java

Where possible, you should install the same version of Java on the and on the cluster with which you are integrating.

Notes on Java versions:

Other Software

For Ubuntu installations, the following packages must be manually installed using Ubuntu-specific versions:

Instructions and version numbers are provided later in the process.

Root User Access

Installation must be executed as the root user on the .

SSL Access

(Optional) If users are connecting to the , an SSL certificate must be created and deployed. See Install SSL Certificate in the Install Guide.

Internet Access

(Optional) Internet access is not required for installation or operation of the platform. However, if the server does not have Internet access, you must acquire additional software as part of the disconnected install. For more information, see Install Dependencies without Internet Access  in the Install Guide.

Hadoop Cluster Requirements

The following requirements apply if you are integrating the with an enterprise Hadoop cluster.

Supported Hadoop Distributions

The supports the following minimum Hadoop distributions.

Cloudera supported distributions

See Supported Deployment Scenarios for Cloudera in the Install Guide.

Hortonworks supported distributions

See Supported Deployment Scenarios for Hortonworks  in the Install Guide.

EMR supported distributions

See Configure for EMR in the Configuration Guide.

HDInsight supported distributions

See Configure for HDInsight in the Configuration Guide.

Azure Databricks supported distributions

See Configure for Azure Databricks in the Configuration Guide.

Node Requirements

Each cluster node must have the following software:

Hadoop Component Access

The must have access to the following.

Java and Spark version requirements

The following matrix identifies the supported versions of Java and Spark on the Hadoop cluster. Where possible, you should install the same version of Java on the and on the cluster with which you are integrating.

Notes:

 Spark 2.3Spark 2.4
Java 1.8Required.Required.

Other components

Hadoop System Ports

For more information, see System Ports.

Site Configuration Files

Hadoop cluster configuration files must be copied into the . See Configure for Hadoop in the Configuration Guide.

Security Requirements

Cluster Configuration

For more information on integration with Hadoop, see Prepare Hadoop for Integration with the Platform.

User Requirements

Users must access the through one of the supported browser versions. For more information on user system requirements, see Desktop Requirements.

I/O Requirements

See Supported File Formats in the User Guide.