- Transform by Example
- Snowflake Connector
- Output Parameterization
- And much more!
Transform By Example
Transform by Example expands the native, guided step creation in Trifacta Self-Managed Enterprise Edition. For any existing column value, you can type out the desired output value, and Trifacta Self-Managed Enterprise Edition assembles a program in the background to get you there:
After entering the example on the first row, Trifacta Self-Managed Enterprise Edition infers the kind of transformation you're trying to do. It applies this transformation to your input column, and Trifacta Self-Managed Enterprise Edition provides you with a preview of what your data will look like after it is saved. If you're not satisfied with what it predicts, you can add more examples for different input records to fine-tune the transformation. You can toggle between the full column view and a pattern view that shows you the output for each of the pattern groups present in that column. When satisfied with the results, you can add the transformation to your recipe, which can be executed at scale on your full dataset.
For more information, see Overview of TBE.
Macros provide a repeatable way to accomplish repetitive or common tasks in Trifacta Self-Managed Enterprise Edition. In the example shown below, we use three steps to create a macro to remove outliers. Here are the steps bundled up into the macro:
- Create a column of the standard deviations,
- Create a column of the mean, and
- Create a formula to flag outliers based on whether or not the value falls more than 3.5 standard deviations from the mean.
Below, we create a macro out of these three steps, with the original column as a parameter that can be changed from recipe to recipe. Rather than create these three steps from scratch, or manually apply from a separate recipe, we can locate the macro in our library of macros directly from the Transformer page to reduce the busy work:
As needed, you can inspect a macro to see the underlying steps to verify the correct behavior. You can also parameterize values in the macros, such as columns, numbers, strings, patterns, booleans, and more. If you need to tweak any step in a macro, you can convert the macro back to the original set of discrete steps and modify.
Reusing a macro is easy; select it and enter the needed parameter values:
For more information, see Overview of Macros.
This release includes a connector to Snowflake. Read data from Snowflake, wrangle in Trifacta Self-Managed Enterprise Edition, and publish the results back to Snowflake. For more information, see Enable Snowflake Connections.
You can now add parameters and variables to your output file paths. For example, the following appends the timestamp of the job execution time to the output filename:
For more information. see Overview of Parameterization.
These are just the highlights of this release. To see all of what’s new in Release 6.4 for Trifacta Self-Managed Enterprise Edition, please see Release Notes 6.4.
For more information on previous releases, see Past Releases.
This page has no comments.