Page tree

 

Contents:


This section covers the following areas around general connectivity of the Trifacta® platform.

Additional configuration may be required for individual connection types. For more information, see Connection Types.

Enable

The platform automatically enables connectivity to relational databases for reading in datasets and writing results back out. 

NOTE: Relational connectivity requires the use of an encryption key file, which must be created and deployed before you create relational connections. For more information, see Create Encryption Key File in the Install Guide.

Data Service

The platform streams records from relational sources through the data service. These records are applied to transformation and sampling jobs on the Photon running environment, which is native to the Trifacta node.

Tip: In general, you should not have to modify settings for the data service. However, if you are experiencing general performance issues or issues with specific connection types, you may experiment with settings in the data service.

For more information, see Configure Data Service in the Configuration Guide.

Relational Features

Custom SQL Query

To enhance performance of your relational datasets, you can enable the use of custom SQL queries against your relational datasources, which allows you to pre-filter your datasets before you ingest them into the platform. This feature is enabled by default, but additional configuration can be applied. See Enable Custom SQL Query.

JDBC Ingestion

As needed, the platform can be configured to ingest data from your relational datasources to the base storage layer for faster execution of Spark-based jobs. See Configure JDBC Ingestion.

Configure Security

For more information, see Configure Security for Relational Connections.

Enable SSO Connections

If you have enabled Kerberos on the Hadoop cluster, you can leverage the Kerberos global keytab to enable SSO connections to relational sources. See Enable SSO for Relational Connections.

Type Inference

By default, the platform applies type inferencing to all imported datasources. However, for schematized sources, you may wish to disable type inferencing from the platform instead relying on the types provided from the source. 

Tip: You can also toggle the use of type inferencing for individual connections or for individual imported datasets.

For more information, see Configure Type Inference.

This page has no comments.