Page tree

Trifacta Dataprep


Contents:

On April 28, 2021, Google is changing the required permissions for attaching IAM roles to service accounts. If you are using IAM roles for your Google service accounts, please see Changes to User Management.

   

Contents:


The following settings can be customized for the user experience in your Cloud Dataprep by TRIFACTA INC. project. When you modify a setting, the change is immediately applied to the project. To access the page, select User menu > Admin console > Dataprep settings.

NOTE: Users may not experience the changed environment until each user refreshes the application page or logs out and in again.

Enablement Options:

NOTE: Any values specified in the Cloud Dataprep Settings page applies exclusively to the specific project and override any system-level defaults.

OptionDescription
Default

The default value is applied. This value may be inherited from higher level configuration.

Tip: You can review the default value as part of the help text.

Enabled

The setting is enabled.

NOTE: If the setting applies to a feature, the feature is enabled. Additional configuration may be required. See below.

DisabledThe setting is disabled.
EditClick Edit to enter a specific value for the setting.

General

Allow listing users

When enabled, individual users review and select from a list of all users in the Trifacta application.

NOTE: When this feature is disabled, users can access other users by providing the login email address of other users in screens such as the sharing dialogs.

Locale

Set the locale to use for inferring or validating data in the application, such as numeric values or dates. The default is United States.

NOTE: After saving changes to your locale, refresh your page. Subsequent executions of the data inference service use the new locale settings.

For more information, see Locale Settings.

Session duration

Specify the length of time in minutes before a session expires. Default is 10080 (one week).

API

Allow users to generate access tokens

Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA® INC.

When enabled, individual users can generate their own personal access tokens, which enable access to REST APIs. For more information, see Manage API Access Tokens.

Maximum lifetime for user generated access tokens (days)

Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA® INC.

Defines the maximum number of days that a user-generated access token is permitted for use in the product.

Tip: To permit generation of access tokens that never expire, set this value to -1.

For more information, see Manage API Access Tokens.

Connectivity

Custom SQL query

When enabled, users can create custom SQL queries to import datasets from relational tables. For more information, see Create Dataset with SQL.

Detect maximum column count in XLSX sheet

When you have enabled the Apache POI method for converting Excel files, you can enable this feature to force the conversion service to detect the maximum number of columns in an Excel sheet before beginning the conversion. This feature can improve detection of the structure of the sheet. For more information, see Import Excel Data.

Enable Apache POI based converter for Excel data conversion

When this setting is enabled, the conversion service uses the Java-based Apache POI converter to convert Excel data for ingestion into the product. 

Tip: You should enable this setting, unless you are experiencing problems during Excel conversion.


NOTE: If the Java-based converter is enabled, you should enable Detect maximum column count in XLSX sheet.

When disabled, the conversion service uses the Python-based converter, which has been available in the product previously.

For more information, see Import Excel Data.


Manage access to data using IAM permissions

Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA® INC.

When enabled, user access to data services in Google Cloud Platform, such as Google Cloud Storage and Bigquery, is determined by the permissions defined in a user's assigned IAM role.

NOTE: When this feature is enabled, all Cloud Dataprep Premium by TRIFACTA INC. users that belong to the project are automatically logged out of all Trifacta application sessions across all projects. For example, if a Cloud Dataprep Premium by TRIFACTA INC. user is logged into the product through another project, the user is logged out of their Trifacta application session when this feature is enabled. When each user logs in to the Trifacta application again, any changes to the user's permissions are applied. Since each each API request requires authentication in the header, API users are not automatically logged out.

For more information on IAM-based permissions, Required Dataprep User Permissions.

Flows, recipes, and plans

Column from examples

When enabled, users can access a tool through the column menus that enables creation of new columns based on example mappings from the selected column. For more information, see Overview of TBE.

Export

When enabled, users are permitted to export their flows and plans. Exported flows can be imported into other work areas or product editions. 

NOTE: If plans have been enabled in your project settings, enabling this flag applies to flows and plans.

For more information, see Export Flow.

For more information, see Export Plan.

Import

When enabled, users are permitted to import exported flows and plans.

NOTE: If plans have been enabled in your project settings, enabling this flag applies to flows and plans.

For more information, see Import Flow.

For more information, see Import Plan.

Schematized output

When enabled, all output columns are typecast to their annotated types. This feature is enabled by default.

Webhooks

Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.

When enabled, webhook notification tasks can be configured on a per-flow basis in Flow View page. Webhook notifications allow you to deliver messages to third-party applications based on the success or failure of your job executions. For more information, see Create Flow Webhook Task.

Job execution

Logical and physical optimization of jobs

Feature Availability: This feature is available in the following editions:

  • Cloud Dataprep Premium by TRIFACTA INC.
  • Cloud Dataprep Standard by TRIFACTA INC.

When enabled, the Trifacta application attempts to optimize job execution through logical optimizations of your recipe and physical optimizations of your recipes interactions with data.

This workspace setting can be overridden for individual flows.

Tip: You should keep this feature enabled. Please enable it at the project level and disable it only if needed at the flow level.

For more information, see Flow Optimization Settings Dialog.

Require a companion service account for running jobs

Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.

By default, Cloud Dataprep by TRIFACTA INC. utilizes a default compute service account for running jobs on Cloud Dataflow. Optionally, you can enable this feature, which requires each user in the project to provide their own companion service account to run jobs. This feature is disabled by default.

Pre-requisites:

  • Service accounts must be created in the Google Cloud platform.
  • Companion service accounts must have a minimum set of permissions.
  • For more information, see Google Service Account Management.

When this feature is enabled:

  • Project administrators can review and specify companion service accounts for individual users of the project. For more information, see Service Accounts Page.
  • Individual users can specify their companion service account. For more information see User Profile Page.
  • At runtime, an override service account can be applied if needed. See Run Job Page.

When this feature is disabled:

  • By default, all users of the project use the Compute Engine service account specified for the project.
  • If companion service accounts has been enabled, when it's disabled, the default service account for the project is used.
  • For more information, see Google Service Account Management.

Scheduling and parameterization

Plan feature

Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.

When enabled, users can create plans to execute sequences of recipes across one or more flows. For more information, see Plans Page.

For more information on plans and orchestration, see Overview of Operationalization.


Scheduling feature

When enabled, project users can schedule the execution of flows. See Add Schedule Dialog.

Publishing

JSON output format

When enabled, members can generate outputs in JSON format.

This page has no comments.