ADLS access leverages HDFS protocol and storage, so additional configuration is required.
Steps:
- You can apply this change through the Admin Settings Page (recommended) or
trifacta-conf.json
. For more information, see Platform Configuration Methods. Enable ADLS as the base storage layer:
"webapp.storageProtocol": "hdfs", "hdfs.enabled": true, "hdfs.protocolOverride": "adl",
Parameter Description webapp.storageProtocol Sets the base storage layer for the platform. Set this value to
hdfs
.NOTE: After this parameter has been saved, you cannot modify it. You must re-install the platform to change it.
hdfs.enabled For ADLS blob storage, set this value to true
.hdfs.protocolOverride For ADLS blob storage, this special parameter must be set to adl
.These parameters specify the Azure Data Lake for the platform:
"azure.adl.enabled": = "true" "azure.adl.store": = "adl://xxx.azuredatalakestore.net"
Parameter Description azure.adl.enabled To enable access to the Azure Data Lake, set this value to true
.azure.adl.store Specify the value of the Azure Data Lake Store here.
NOTE: Protocol should be set to
adl://
.Configure the appropriate Hadoop bundle JAR to use:
"hadoopBundleJar": "hadoop-deps/hdp-2.6/build/libs/hdp-2.6-bundle.jar",
Configure access to HDFS resources:
"hdfs.namenode.host": "xxx.azuredatalakestore.net", "hdfs.namenode.port": "443", "hdfs.webhdfs.host": "xxx.azuredatalakestore.net", "hdfs.webhdfs.ssl.enabled": "true", "hdfs.webhdfs.port": "443", "hdfs.highavailability.serviceName": "xxx.azuredatalakestore.net",
Parameter Description hdfs.namenode.host Hostname of the HDFS namenode hdfs.namenode.port Port number for the HDFS namenode hdfs.webhdfs.host Hostname of WebHDFS service hdfs.webhdfs.ssl.enabled If SSL has been enabled on the WebHDFS host, please set this value to true
. You are likely to need to set the port value to a non-default value.hdfs.webhdfs.port Port number of WebHDFS service hdfs.highavailability.serviceName Set this value the high availability service name for HDFS, if you have enabled integration with the cluster high availability. For more information, see Enable Integration with Cluster High Availability in the Configuration Guide. - Save your changes.
This page has no comments.