May 12, 2021
|TD-60881||Incorrect file path and missing file extension in the application for parameterized outputs|
|TD-60378||Join and Union user interfaces are taking too long to respond.|
|TD-60187||Snowflake publishing fails during validation when both Stage database and External Stage are in use.|
|TD-59658||IAM roles passed through SAML does not update after Hotfix upgrade|
|TD-59633||Enabled session tag feature but running into "The security token included in the request is invalid" error|
|TD-59249||After ad-hoc publishing, job cleanup process deletes user's output directory on HDFS.|
|TD-59229||Uploaded CSV file fails with Parquet schema error.|
|TD-58932||Cannot read file paths with colons from EMR Spark jobs|
|TD-58591||Copied flow does not include output objects.|
|TD-58433||Some recipe steps missing in copied flow|
|TD-58036||Custom SQL query from Hive fails to run.|
jobs take a long time to write output without chunked encoding.
|TD-57653||Monitoring details not visible in dataset details and flow view|
|TD-57528||Slow ingest time for small XSLX files|
|TD-57512||Java-vfs-service is becoming unresponsive when fetching files for a parameterized dataset comprised of 10,000 files|
|TD-57264||Transformation engine crashes when specifying Group By parameter for List function.|
|TD-56739||Memory leak in java-vfs-service|
|TD-53375||S3 browsing 400 error|
December 7, 2020
Support for PostgreSQL 12.3 for on all supported operating systems.
NOTE: Support for PostgreSQL 9.6 will be deprecated in a future release.
Installation of database client is now required:
Beginning in this release, before you install or upgrade the database or perform any required database cross-migrations, you must install the appropriate database client first.
NOTE: Use of the database client provided with each supported database distribution is now a required part of any installation or upgrade of the .
NOTE: The MySQL database client cannot be provided by . It must be downloaded and installed separately. As a result, installation or upgrade of a Docker environment using MySQL requires additional support. For more information, please contact .
For more information:
Catalog support to be deprecated:
NOTE: Integrations with Alation and Waterline catalogs are likely to get deprecated in a future release.
Support for custom data types based on dictionary files to be deprecated:
NOTE: The ability to upload dictionary files and use their contents to define custom data types is scheduled for deprecation in a future release. This feature is limited and inflexible. Until an improved feature can be released, please consider using workarounds. For more information, see Validate Your Data.
You can create custom data types using regular expressions. For more information, see Create Custom Data Types.
Maintenance release updater script is deprecated:
The maintenance release updater script has been deprecated. This script could be used for performing maintenance upgrades:
Cannot import data from Azure Databricks. This issue is caused by an incompatibility between TLS v1.3 and Java 8, to which it was backported.
This issue is known to impact Marketplace installs of and can impact on-premises installs.
Non-default admin users are not automatically granted full workspace admin privileges on upgrade. These users may be able to see Workspace Settings and Admin Settings but are not granted access to edit roles and users.
For more information:
Access to S3 is disabled after upgrade.
When importing a dataset via API that is sourced from a BZIP file stored on a backend datastore such as S3, WASB, or ADLS Gen1/Gen2, the columns may not be properly split when the platform is permitted to detect the structure.
September 7. 2020
New Flow View is now generally available:
Annotate the canvas with notes.
Tip: The relative position of objects on the flow view canvas is preserved between screen updates. On refresh, the window on the canvas is repositioned based on the leftmost object on the canvas to focus on the flow to other objects from that one.
NOTE: Classic Flow View is no longer available.
See Flow View Page.
Support for PostgreSQL 12.3 for on CentOS/RHEL 7.
Support for Cloudera Data Platform.
NOTE: Installation requirements for Cloudera Data Platform are consistent with installation for CDH. The must be installed on a pre-existing Cloudera Data Platform.
There are minor differences in configuration. For more information, see Configure for Cloudera.
Support for EMR 5.30.1.
NOTE: Avoid EMR 5.30.0. Instead, please use EMR 5.30.1.
See Configure for EMR.
For long-loading relational datasets, you can monitor the ingest process through Flow View as you continue your work.
NOTE: This feature may require enablement in your deployment. For more information, see Configure JDBC Ingestion.
For more information, see Flow View Page.
Improved performance when browsing databases for tables to import.
Tip: Performance improvements are due to limiting the volume of table metadata that is imported when paging through available tables. This metadata can be retrieved when you hover over a table in the database browser.
For more information, see Database Browser.
Logical and physical optimizations when reading from relational sources during job execution, which includes column pruning push-down among other enhancements.
NOTE: This feature may need to be enabled in your workspace. See Workspace Settings Page.
This feature applies to the following relational connections in this release:
Collaborative suggestions allow users within a workspace to receive suggestions based on the transformations that have been recently created by themselves or by all members of the workspace. As more users generate transformations, the relevance of these suggestions to the data in the workspace continues to improve.
Support for job cancellation on EMR clusters. See Jobs Page.
NOTE: Additional configuration may be required. For more information, see Configure for EMR.
The is no longer available for installation and is not supported for use with the product. Please use one of the supported browser versions instead. For more information, see Desktop Requirements.
In previous releases, the Users section of the Admin Settings page was used to manage users.
Please upgrade to a supported distribution of either operating system. For more information, see System Requirements.
Access to S3 is now managed through the Java-based virtual file system. For more information, see Configure Java VFS Service.
NOTE: No configuration changes are required for upgrading customers. For more information, see Enable S3 Access.
When schematized datasources are ingested, schema information is now retained for publication of job results.
NOTE: In prior releases, you may have set column data types manually because this schema information was lost during the ingest process. You may need to remove these manual steps from your recipe. For more information, see Improvements to the Type System.
For social security numbers and credit card numbers, the methods by which these values are determined for purposes of masking sensitive Personally Identifiable Information (PII) has been expanded and improved. For more information, see Improvements to the Type System.
Optimizer service and database: During job execution on relational sources, the optimizer service assists in managing SQL queries efficiently so that smaller volumes of data are retrieved for the job. Queries are stored in the related database.
API: Unable to update awsConfig objects in per-user or per-workspace modes.
|TD-51229||When an admin user shares a flow that the admin user owns, a |
|TD-48915||Inserting special characters in an output filename results in a validation error in the the application and job failures.|
|TD-47696||Platform appears to fail to restart properly through Admin Settings page due to longer restarts of individual services.|
|TD-49559||Cannot select and apply custom data types through column Type menu.|
|TD-47473||Uploaded files (CSV, XLS, PDF) that contain a space in the filename fail to be converted.|
|TD-34840||Platform fails to provide suggestions for transformations when selecting keys from an object with many of them.|
Import of dataset from Alation catalog hangs.
If a flow is unshared with you, you cannot see or access the datasources for any jobs that you have already run on the flow. You can still access the job results.