This section covers key known limitations of

NOTE: This list of limitations should not be considered complete.

Sampling

Internationalization

NOTE: Some functions do not correctly account for multi-byte characters. Multi-byte metadata values may not be consistently managed.

Size Limits


Sample Size Limits

Defaults for each running environment:

Job Size Limits

Execution on a Spark running environment is recommended for any files over 5GB in net data size, including join keys.  

Limitations by Integration

General

The product requires definition of a base storage layer, which can be HDFS or S3 for this version. This base storage layer must be defined during install and cannot be changed after installation. See Set Base Storage Layer.

LDAP

Hadoop

Amazon AMI

Amazon EMR

Microsoft Azure

S3

Redshift

None.

Hive

Spark

JDBC

 


Other Limitations