Page tree

 

Contents:


Supported Cloudera Distributions

NOTE: By default, Cloudera may be installed with Java JDK 1.7 or earlier. If so, you must upgrade each node in the cluster to Java JDK 1.8. For more information, see https://www.cloudera.com/documentation/enterprise/latest/topics/cdh_ig_jdk_installation.html.

For this release, the Trifacta® platform supports the following Cloudera versions. 

NOTE: Cloudera 6.0 and later requires use of native Hadoop libraries from the cluster. See Configure for Spark.

  • Cloudera 6.2.x (recommended)

  • Cloudera 6.1.x 

  • Cloudera 6.0.x 

    NOTE: Spark 2.4 is not supported on Cloudera 6.0. Please use Spark 2.2. See Configure for Spark.

  • Cloudera 5.16.x

    NOTE: Cloudera 5.14.x and 5.15.x are no longer supported. For best results, please upgrade your Hadoop distribution.

Notes:

  • Update Date: July 29, 2019
  • The Trifacta platform supports all variants of patch or point releases (X.Y.* and X.Y.*.* releases) through the Hadoop vendor's backwards compatibility policy.
  • For individual versions of Hadoop components (such as HDFS, Spark, and Hive), the Trifacta platform supports the component version that is bundled with the vendor's package for the supported Hadoop distribution.
  • For more information on how to set up your Hadoop distribution, please consult the documentation provided with your distribution or contact your distribution vendor.

Supported Deployments

NOTE: Unless otherwise noted, all items listed below are supported across all Hadoop distribution versions listed above. Unlisted items are not supported. Please contact Trifacta Support or your sales representative for items not listed here.

Deployment System

NOTE: The Trifacta platform software must be installed on a gateway node of the Cloudera cluster. For more information, see System Requirements.


ItemDescription
Physical On Premise MachinesSupported.
VMWare / VXServerSupported.

NOTE: Deployment to an Amazon EC2 is supported. See Supported Deployment Scenarios for AWS.


Running Environment

ItemDescription
SparkSupported.

Trifacta Photon

Supported.

Platform Security

ItemDescription
HDFS File PermissionsSupported.
HDFS PrivilegesSupported through Sentry.
Hive PrivilegesSupported through Sentry.
Kerberos-Enabled Hadoop ClusterSupported. See Configure for Kerberos Integration.
Secure User Impersonation

Supported. See Configure for Secure Impersonation.

High Availability

ItemDescription
Name Node, Resource Manager, HttpFSSupported. See Enable Integration with Cluster High Availability.

Metadata Publishing

ItemDescription
Cloudera Navigator

Not supported.

Hive PublishingSupported. See Configure for Hive.
Redshift Publishing

Supported.

See Run Job Page.

See Publishing Dialog.

Supported File Formats

See Supported File Formats.

Connectivity

Hadoop Connectivity

The Trifacta platform supports connectivity for execution to the following Hadoop environments for this vendor's distributions. Connectivity exceptions are listed below:

Running EnvironmentHDFS ReaderHDFS WriterHive Reader w/ HiveServer2
SparkSupported.Supported.Supported.
Profiling EnvironmentHDFS ReaderHDFS Writer
Profiling on SparkSupported.Supported.

External Connectivity

Storage PlatformHDFS ReaderHDFS Writer
S3 Supported.Supported. 
Storage PlatformAmazon S3 ReaderAmazon S3 Writer
Spark ProfilingSupported.Supported.

Notes

  • none.

This page has no comments.