Page tree

 

Contents:


This section contains miscellaneous configuration topics to enable minor features of the platform.

Limits

For more information, see Configure Application Limits.

Configure CSV field delimiters

When you publish a CSV file, by default the fields in the file are comma-separated. Optionally, you can configure a different field delimiter.

Tip: During publication, you can specify whether the output file includes double quote marks around each field. For more information, see Run Job Page.

Steps:

  1. You can apply this change through the Admin Settings Page (recommended) or trifacta-conf.json. For more information, see Platform Configuration Methods.
  2. To enable the feature, verify that the following parameter has been set to true:

    "feature.publishDelimiterQuoteOption.enabled": true,
  3. If you wish to specify the delimiter character using a Unicode value, please set the following to true:

    "feature.publishDelimiterQuoteOption.enableUnicodeCSVDelimiters": true,
  4. Search for the following parameter:

    "webapp.outputCsvDelimiter"
  5. In the textbox, enter the string that is used as the delimiter.
    1. Strings can be more than one character in length.
    2. If you have enabled the Unicode CSV delimiters feature, you can enter Unicode characters in the following format:

      \uXXXX

      where XXXX is the Unicode value for the character.

      NOTE: The delimiter value is used as the default value for each job. The default can be overridden on a per-job basis. For more information, see Run Job Page.

  6. Save your changes.

Supported Extensions

You cannot upload a file with an unsupported or unlisted file extension. As needed, you can add extensions to the list of supported file formats, which enables the file to be uploaded. 

  • Extensions are case-sensitive. .xml and .XML must both be listed.
  • A leading period is required.

NOTE: Even if the file extension is added to this list, the platform may not be able to process the file. For more information, see Supported File Formats.


NOTE: The Trifacta platform prevents the uploading of files with extensions that are disabled through specific configuration flags. For example, if you disable BZIP2 or Avro files, you cannot re-enable them by adding their extensions to this configuration list.


Steps:

  1. You can apply this change through the Admin Settings Page (recommended) or trifacta-conf.json. For more information, see Platform Configuration Methods.
  2. Locate the following parameter and add file extensions. In the example, two new extensions have been added to the list:

    "webapp.client.additionalAllowedFileExtensions": [".log",".LOG"],
  3. Save your changes and restart the platform.

Feature Flags

Disable enforced type casting

For non-schematized outputs,  Trifacta Wrangler Enterprise enforces casting of all values to the annotated data type of the column by default. For example, if the output value is -3.4 and the data type for the output column is Integer, the platform enforces Integer type casting and writes a null value instead.

This feature is managed by the following workspace setting: Schematized output:

Parameter SettingDescriptionImplications on Mismatched Values
true(Default) All output values are forced to match the data type of the output columns.Mismatched values are written as null values.
false

All output values are written in their output form, regardless of the column's data type.

  • This behavior is the default for Cloud Dataprep by TRIFACTA INC. and Trifacta Wrangler.
Mismatched values are written as-is.

Steps:

  1. You apply this change through the Workspace Settings Page. For more information, see Platform Configuration Methods.
  2. Set the following parameter accordingly:

    "Schematized output" : true,
  3. Save your changes and restart the platform.

Enable deletion of jobs

By default,  Trifacta Wrangler Enterprise does not permit the deletion of jobs. As needed, you can enable deletion of jobs. 

To enable this feature:

Steps:

  1. You can apply this change through the Admin Settings Page (recommended) or trifacta-conf.json. For more information, see Platform Configuration Methods.
  2. Set the following parameter to true:

    "feature.enableJobDeletion" : false,
  3. Save your changes and restart the platform.

When enabled, jobs can be deleted from Flow View page or from the Jobs page through the context menu. 

Disable user modification of storage paths

By default, users are permitted to modify the paths for uploading datasets and outputting results. For more information, see Storage Config Page.

If needed, you can disable the ability for individual users to modify these paths. 

Steps:

  1. You apply this change through the Workspace Settings Page. For more information, see Platform Configuration Methods.
  2. Set the following parameter to false:

    Allow the user to modify their paths
  3. Save your changes.

For more information, see Workspace Admin Page.

Disable parameterization

By default,  Trifacta Wrangler Enterprise supports the application of parameters to imported datasets. Datetime, wildcard, or variable parameters can be used to operationalize execution of jobs on different versions of the same dataset. For example, if a new dataset is dropped in the same folder each day with a different date value in the filename, you can apply parameters to the imported dataset to ensure that all matching datasets in the directory can be executed. For more information, see Overview of Parameterization.

To disable this feature, please complete the following steps.

Steps:

  1. You can apply this change through the Admin Settings Page (recommended) or trifacta-conf.json. For more information, see Platform Configuration Methods.
  2. Set the following parameter to false:

    "feature.parameterization.enabled" : false,
  3. Save your changes and restart the platform.

Enable uploading profile pictures

Optionally, you can configure the platform to all users to upload images to be used as their profile pictures. See User Profile Page.

Steps:


  1. You can apply this change through the Admin Settings Page (recommended) or trifacta-conf.json. For more information, see Platform Configuration Methods.
  2. Set the following parameter to true:

    "webapp.enableProfilePicture" : true,
  3. Save your changes and restart the platform.

Disable flow sharing

The flow sharing feature enables a user to share a flow with another user for collaboration. In this manner, a single source of truth can be retained as multiple users collaborate to transform the flow's datasets into the desired output.

Flow sharing is enabled by default. For more information on disabling, see Workspace Admin Page.

Disable input swapping with datasets with parameters

By default, the Trifacta application enables swapping out a static dataset with a dataset with parameters. However, these changes can significantly affect overall data processing volumes. If needed, this feature can be disabled.

Steps:

  1. You can apply this change through the Admin Settings Page (recommended) or trifacta-conf.json. For more information, see Platform Configuration Methods.
  2. Set the following parameter to false:

    "feature.parameterization.enableSwapWithDynamicDataset": true,
  3. Save your changes and restart the platform.

Disable column lineage recipe highlighting

By default, the Trifacta application can display lineage highlighting of applicable steps for a selected column. When enabled, you can select Show related steps from a column's drop-down menu, and all recipe steps that reference the column are highlighted in the recipe panel. 

Please complete the following steps to disable this feature.

Steps:

  1. You can apply this change through the Admin Settings Page (recommended) or trifacta-conf.json. For more information, see Platform Configuration Methods.
  2. Set the webapp.enableColumnLineageScriptHighlighting configuration flag to false:

    "webapp.enableColumnLineageScriptHighlighting": false,
  3. Save your changes and restart the platform.

Configure publishing access controls

By default, the Trifacta application publishes job results with the access controls inherited from the default output home directory of the user who ran the job. 

In some cases, these access controls may not work. For example, if the user is working on a project with multiple users, those other users may not be able to access the job results even though the output is written into a directory on the backend datastore to which they all have access. 

As needed, you can change the Trifacta application to write job results into the target directory inheriting user access controls from the parent of the target directory. In this manner, permissions are inherited based on the target location.

NOTE: When this feature is enabled, job results are always written with permissions inherited from the parent directory of the target location.

Steps:

  1. You can apply this change through the Admin Settings Page (recommended) or trifacta-conf.json. For more information, see Platform Configuration Methods.
  2. Locate the following parameter and set it to true:

    "feature.copyFileToDestination": true,
  3. Save your changes and restart the platform.

Disable custom types

By default, you can create custom data types in  Trifacta Wrangler Enterprise. When a column is set to a custom data type, the values in the column are validated against the type specification. 

After a custom data type has been created, it cannot be removed from the platform.


  • For more information on creating custom data types using regular expression, see Create Custom Data Types Using RegEx.

    NOTE: Use of regular expressions is considered a developer-level skill.

To disable this feature, please complete the following steps.

Steps:

  1. You can apply this change through the Admin Settings Page (recommended) or trifacta-conf.json. For more information, see Platform Configuration Methods.
  2. Set the following to false:

    "feature.enableCustomTypes": false,
  3. Save changes and restart the platform.

Disable machine learning transform suggestions

There is a known issue in which editing a recipe can result in an error like the following appearing in the recipe panel:

Server Save Error.
Unable to save changes. You may lose your work.

This error may be caused by a conflict between your operation and the machine learning service's attempt to deliver ranked suggestions to the user interface. Disabling this feature may remove the error.

NOTE: This change disables the platform's use of machine learning to improve transformation suggestions based on your actions. It does not impact the base platform's predictive interaction, which feeds your selections through a default set of algorithms to deliver suggested transformations.

Steps:

  1. You can apply this change through the Admin Settings Page (recommended) or trifacta-conf.json. For more information, see Platform Configuration Methods.
  2. Locate the following parameter and set it to false:

    "feature.mlTransformSuggestions.enabled": false,
  3. Save your changes and restart the platform.

Other Configuration Topics


This page has no comments.