August 21, 2020
Support for PostgreSQL 12.3 for Trifacta databases.
NOTE: For this release, PostgreSQL 12.3 is supported for supported versions of CentOS/RHEL 7 only. See Product Support Matrix.
NOTE: In a future release, support for PostgreSQL 9.6 will be deprecated. For more information, see Upgrade Databases for PostgreSQL.
- Support for configurable Azure AD endpoint and authority for SSO validation. For more information, see Configure SSO for Azure AD.
Changes in System Behavior
Schema information is retained:
When schematized datasources are ingested, schema information is now retained for publication of job results.
NOTE: In prior releases, you may have set column data types manually because this schema information was lost during the ingest process. You may need to remove these manual steps from your recipe. For more information, see Improvements to the Type System.
If you are upgrading your cluster to CDH 6.3.3, please set the following property to the value listed below:
Save your changes and restart the platform. For more information, see Admin Settings Page.
For more information, see Configure for Spark.
Key Bug Fixes
|TD-53062||After upgrade, imported recipe has UDF steps converted to comments.|
On Azure Databricks, creating a stratified sample fails.
Cannot run Azure Databricks jobs on ADLS-Gen1 cluster in user mode.
UnknownHostException error when generating Azure Databricks access token from Secure Token Service
Cannot import some Parquet files into the platform.
Import data page is taking too long to load.
Closing the connections search bar removes search bar and loses sort order.
On upgrade, Spark is incorrectly parsing files of type "UTF-8 Unicode (with BOM)."
Import rules not working for remapping of WASB bucket name. For more information, see Define Import Mapping Rules.
Cannot import flow due to missing associated flownode error.
Server Save error when deleting a column.
Transformation engine unavailable due to prior crash
After upgrade, you cannot edit recipes or run jobs on recipes that contain the optional
Optional file cleanup generates confusing error logging when it fails.
When modifying file privileges, the platform makes assumptions about database usernames.
On upgrade, the migration framework for the authorization service is too brittle for use with Amazon RDS database installations.
When flows are imported into the Deployment Manager, additional characters are inserted into parameterized output paths, causing job failures.
PostgreSQL connections may experience out of memory errors due to incorrectly specified fetch size and vendor configuration.
Can't import a flow that contains a reference in a flow webhook task to a deleted output.
Generic Hadoop folder is missing in
After upgrade, you cannot publish as a single-file to WASB to replace an existing output destination.
After upgrade, users cannot load recipes due to Requested Data Not Found error when loading samples.
After upgrading Cloudera cluster to version 6.3.3, you cannot run jobs due to the following error:
Please see "Cloudera support" above.
During upgrade, cross-migration fails for authorization service and its database with the following error:
After upgrade, ad-hoc publish to Hive fails.
After upgrade, you cannot unzip downloaded log files.
After upgrade, cross-migration validation fails for "groupsPolicies."
Tripache Vulnerabilities - CVE-2020-1927
New Known Issues
May 4, 2020
Have a question about the product? Use the new in-app chat feature to explore content or ask a question to our support staff. If you need assistance, please reach out!
NOTE: User messaging may require enablement in your deployment. See Enable In-App Chat.
- Users can download log files related to their current session through the application. See Download Logs Dialog.
- Administrators have a separate admin dialog that enables log download by time frame, job identifier, or session identifier. See Admin Download Logs Dialog.
NOTE: If you are installing or upgrading a deployment of Trifacta Wrangler Enterprise that uses or will use a remote database service, such as Amazon RDS, for hosting the Trifacta databases, please contact Trifacta Customer Success Services. For this release, additional configuration may be required.
Support for installation on CentOS/RHEL 8. See System Requirements.
NOTE: SSO using SAML is not supported on CentOS/RHEL 8. See Configure SSO for SAML.
NOTE: Support for CentOS/RHEL 6 has been deprecated. Please upgrade to CentOS/RHEL 8.
- Support for installation on CentOS/RHEL 7.7. See System Requirements.
Support for EMR 5.28.1 and EMR 5.29.0
NOTE: EMR 5.28.0 is not supported, due to Spark compatibility issues.
NOTE: Support for EMR 5.8 - EMR 5.12 is deprecated. For more information, see End of Life and Deprecated Features.
- Support for Azure Databricks 6.2. See Configure for Azure Databricks.
Support for installation on Ubuntu 18.04 (Bionic Beaver). See System Requirements.
NOTE: Support for installation on Ubuntu 14.04 (Trusty) has been deprecated. See End of Life and Deprecated Features.
- Support for CDH 6.0 is deprecated. See End of Life and Deprecated Features.
- Support for Spark 2.2.x versions is deprecated. See End of Life and Deprecated Features.
- Improved performance for Spark profiling on Datetime and numeric columns with low number of discrete values.
- Support for access to Kerberized clusters. See Configure for EMR.
Improved performance for Oracle, SQL Server, and DB2 connections. These performance improvements will be applied to other relational connections in future releases.
NOTE: For more information on enabling this feature, please contact Trifacta Customer Success Services.
- Azure Databricks Tables:
- Support for read/write on Delta tables.
- Support for read/write on external tables.
- Support for read from partitioned tables.
NOTE: To enable these additional read/write capabilities through Databricks Tables, the underlying connection was changed to use a Simba driver. In your connection definition, any Connect String Options that relied on the old Hive driver may not work. For more information, see Configure for Azure Databricks.
- Ingestion of large relational datasets is no longer a blocking operation. For more information, see Configure JDBC Ingestion.
- Redesigned Settings and Help menus. See Home Page.
Introducing plans. A plan is a sequence of tasks on one or more flows that can be scheduled.
NOTE: In this release, the only type of task that is supported is Run Flow.
- Introducing new Flow View. The Flow View page has been redesigned to improve the user experience and overall productivity. NOTE: This feature is in Beta release.
- Enhancements include:
- Drag and drop to reposition objects on the Flow View canvas, and zoom in and out to focus on areas of development.
- Perform joins and unions between objects on the Flow View canvas.
- Annotate the canvas with notes.
- You can toggle between new and classic views through the context menu in the corner of Flow View. See Flow View Page.
- As needed, Trifacta administrators can disable access to the new Flow View completely. See Miscellaneous Configuration.
- Enhancements include:
Create flow parameters that you can reference in your flow. Flow parameters can be string literals, Trifacta patterns, or regular expression patterns.
NOTE: For this release, flow parameters can be applied into your recipes only.
As needed, you can apply overrides to the parameters in your flow or to downstream flows.
NOTE: Flow parameters do not apply to datasets or output objects, which have their own parameters. However, if you specify an override at the flow level, any parameters within the flow that use the same name receive the override value, including output object parameters and datasets with parameters.
- See Manage Parameters Dialog.
- For more information on parameters, see Overview of Parameterization.
- Monitor job progress through each phase in the Jobs panel. See Flow View Page.
- Improved performance when loading the Transformer page and when navigating between the Flow View and Transformer pages.
- Join steps are now created in a larger window for more workspace. See Join Window.
- New column selection UI simplifies choosing columns in your transformations. See Transform Builder.
- Faster and improved method of surfacing transform suggestions based on machine learning.
NOTE: Azure Databricks 5.3 and 5.4 are no longer supported. Please upgrade to Azure Databricks 5.5 LTS or 6.x. See End of Life and Deprecated Features.
- Apply overrides to Spark properties for individual job execution. See Enable Spark Job Overrides.
- Execute jobs from SFTP sources on EMR and Azure Databricks. See Create SFTP Connections.
- When visual profiling is enabled for a job, you can now download your visual profile in PDF format. See Job Details Page.
Support for generating results and publishing to Tableau Hyper format.
NOTE: Tableau TDE format will be deprecated in a future release. Please switch to using Tableau Hyper format.
If you have upgraded to Tableau Server 10.5 or later, you may have a mix of TDE and Hyper files stored on the server. You can automatically upgrade the TDE files to Hyper, if needed. For more information, see https://help.tableau.com/current/online/en-us/extracting_upgrade.htm.
- If you are on Tableau Server 10.5 or later and you append to a TDE file, the file is automatically converted to Hyper format. This conversion cannot be reverted.
- See Create Tableau Server Connections.
- New functions to parse values against specific data types.
- New functions for calculating working days between two valid dates.
- New two-column statistical functions.
- See Changes to the Language.
- New content on the getting started with sampling. See Sampling Basics.
- Feature overview: Overview of Sampling
- Best practices: https://community.trifacta.com/s/article/Best-Practices-Managing-Samples-in-Complex-Flows
Changes in System Behavior
Wrangler Enterprise desktop application:
The Wrangler Enterprise desktop application is no longer available in the software distribution and has been deprecated. Please switch to a supported browser version. For more information, see Desktop Requirements.
A Release 6.8 version of the Wrangler Enterprise desktop application can be made available upon request. For more information, please contact Trifacta Support.
- All Trifacta admin users are now workspace admins.
All workspace admins now have access to all user-created objects within the workspace.
NOTE: Workspace administrators can access some types of user-created objects in the workspace with the same level of access as the object owner. Under some conditions, workspace admins may have access to source datasets and generated results. See Workspace Admin Permissions.
- For more information, see Changes to Authorization.
API reference documentation is now available directly through the application. This release includes more supported endpoints and documented options. To access, select Help menu > API Documentation.
NOTE: API reference content is no longer available with the product documentation. Please use the in-app reference documentation instead.
Workflow documentation is still available with the product documentation. For more information, see API Reference.
- For details, see Changes to the APIs.
- The v3 version of the API endpoints are no longer available in the platform. You must use v4 endpoints. See API Reference.
- Simplified connections endpoints.
The format of the supported WASB URIs has changed.
NOTE: If you were using the APIs to interact with WASB resources, you must update your resources to use the new format. See Changes to the APIs.
- See Changes to the APIs.
In a future release, custom dictionaries that rely on an uploaded file will be deprecated. The specific release vehicle has not been determined yet.
- Deprecation only affects the ability to create custom types using a file. Where possible, you can and should continue to create custom times using regular expressions. For more information, see Create Custom Data Types Using RegEx.
- The file-based feature will be replaced by a standardization-based option.
- Beginning in this release, this feature is disabled by default.
- For more information, see Create Custom Data Types.
- If you have upgraded to Release 7.1 or later, any parameter overrides that you have specified in your flows can be modified in the Overrides tab of the Manage Parameters dialog.
- For more information, see Manage Parameters Dialog.
WASB and ADLS:
Configuration to enable WASB and ADLS access has been streamlined and simplified.
NOTE: No action is required for upgrading customers.
The default port number for the secure token service has been changed from
8090. The new default port number is
NOTE: Your upgraded installation is forced to use this new port number. You can modify the value after installation or upgrade.
- The Send a Copy feature is no longer available in the product. Instead, you can make a copy of the flow and share it. See Flow View Page.
- All MODE functions return the lowest value in a set of values if there is a tie in the evaluation. See Changes to the Language.
Key Bug Fixes
By default, under SSO manual logout and session expiration logout redirect to different pages. Manual logout directs you to SAML sign out, and session expiry produces a session expired page.
To redirect the user to a different URL on session expiry, an administrator can set the following parameter:
New Known Issues
You cannot update your AWS configuration for per-user or per-workspace mode via UI.
Workaround: You can switch to using AWS system mode with a single, system wide configuration, or you can use the APIs to make changes.See API Workflow - Manage AWS Configurations.
Cannot select and apply custom data types through column Type menu.
Workaround: You can change the type of the column as a recipe step. Use the Change column type transformation. From the New type drop-down, select
|TD-47784||When creating custom datasets using SQL from Teradata sources, the |
Uploaded files (CSV, XLS, PDF) that contain a space in the filename fail to be converted.
Workaround: Remove the space in the filename and upload again.
This page has no comments.