Page tree

Trifacta Dataprep


Contents:

On April 28, 2021, Google is changing the required permissions for attaching IAM roles to service accounts. If you are using IAM roles for your Google service accounts, please see Changes to User Management.

   

Contents:


These release notes apply to the following product tiers of Cloud Dataprep by TRIFACTA® INC.:

  • Cloud Dataprep Premium by TRIFACTA INC.
  • Cloud Dataprep Standard by TRIFACTA INC.
  • Cloud Dataprep Legacy by TRIFACTA INC.

Differences are noted in the docs. 

Tip: You can see your product tier in the Trifacta application. Select Help menu > About Cloud Dataprep.

For release notes from previous releases, see Earlier Releases of Cloud Dataprep.

February 16, 2021

Release 8.0

Features

Tip: Add a profile picture to your account! For more information, see User Profile Page.

Flow templates:

Introducing flow templates, which are predefined flows with guidelines for creating the flow objects needed to solve a specific transformation and publication use case. These step-by-step guides leverage placeholders for flow objects to assist you in rapidly assembling your end-to-end flow pipeline.

The first available template simplifies the Data Warehouse Onboarding process, which simplifies the ingestion of datasets, transformation of them, and loading them into your data warehouse. From the Home page, you can quickly set up a pipeline from data lakes into data warehouses:

  • GCS to BigQuery pipelineUse this template to create a flow by importing a Google Cloud Storage, transforming the data, and publishing the outputs on the BigQuery. For more information, see Start with a Template.

Authorization:

  • Support for assigning companion service accounts to individual project users for job execution.
  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • This feature must be enabled. See Dataprep Settings Page.
  • NOTE: Additional configuration may be required. See Changes to User Management.
  • See Service Accounts Page.
APIs:

  • Individual workspace users can be permitted to create and use their own access tokens for use with the REST APIs. For more information, see Dataprep Settings Page.

Connectivity:

Import:

  • Improved method for conversion and ingestion of XLS/XSLX files. For more information, see Import Excel Data.

Recipe development:

  • The Flag for Review feature enables you to set review checkpoints in your recipes. You can flag recipe steps for review by other collaborators for review and approval. For more information, see Flag for Review.

Metric-based data quality rules:

Update Macros:

  • Replace / overwrite an existing macro's steps and inputs with a newly created macro.
  • Map new macro parameters to the existing parameters before replacing.
  • Edit macro input names and default values as needed. 

Job execution:

  • You can enable the Trifacta application to apply SQL filter pushdowns to your relational datasources to remove unused rows before their data is imported for a job execution. This optimization can significantly improve performance as less data is transferred during the job run. For more information, see Flow Optimization Settings Dialog.
  • Optimizations that were applied during the job run now appear in the Job Details Page. See Job Details Page.

Changes

None.

Deprecated

None.

Known Issues

  • TD-56830: Receive malformed_query: enter a filter criterion when importing table from Salesforce.

    • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
    • NOTE: Some Salesforce tables require mandatory filters when they are queried. Mandatory filters are not currently supported for Salesforce connections


  • TD-56170: The Test Connection button for some relational connection types does not perform a test authentication of user credentials.

    • Workaround: Append the following to your Connect String Options:

      ;ConnectOnOpen=true
    • This option forces the connection to validate user credentials as part of the connection. There may be a performance penalty when this option is used.

Fixes

None.

January 12, 2021

Release 7.10

Features

In-app chat: Have a question about the product? Use the new in-app chat feature to explore content or ask a question to our support staff. If you need assistance, please reach out! 

 


In-app tours: Check out the new in-app tours, which walk you through the steps of wrangling your datasets into clean, actionable data. 

  • NOTE: The product tours that were accessible through the Home page of the Trifacta application are no longer available.

 


Import: The maximum permitted size of a file uploaded through Trifacta application has been increased from 100 MB to 1 GB. 

 


Import and Export Plans: You can import and export plans from one environment, workspace, or projects to others.

  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • For more information, see Export Plan.
  • For more information, see Import Plan.

 


Share Plans: Share plans with one or more users to work together on the same plan.

  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • For more information, see Share a Plan. 

 


Email notifications: Send email notifications to plan owners and collaborators based on the status of execution of plans.

 


Connectivity: Improved Salesforce connection type.

  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • For more information, see Create Salesforce Connections. 

 

Changes

IP address range whitelist:

  • The list of IP addresses for the Trifacta service has been changed.
    • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
    • These addresses must be whitelisted to provide access to your relational datastores.
    • Please verify that the following address range is whitelisted for each relational datastore that you wish to access from the product:

      34.68.114.64/28
    • Please remove the following IP address ranges from your relational whitelists. They are no longer used by Cloud Dataprep by TRIFACTA INC.:

      104.198.44.13/32
      34.71.238.145/32
      104.198.217.74/32
      34.68.178.136/32
    • For more information, see Getting Started with Cloud Dataprep.

 


Changes to IAM roles for service accounts: Recently, Google announced changes to permissions required for attaching IAM roles to service accounts. If you are using IAM roles for your Google service accounts, these changes may require updating the permissions that you must enable in your IAM roles. For more information, see Changes to User Management.

 


Enable listing all users in the workspace: Beginning in this release, workspace administrators can choose to enable or disable the listing of all workspace users in the Trifacta application. For example, if you are sharing a flow with another user and this feature is enabled, you can browse the list of all workspace users and select users with whom to share.

  • NOTE: When enabled, users are listed according to their email addresses.
  • NOTE: For Cloud Dataprep Legacy by TRIFACTA INC., this feature is disabled by default.
  • For more information on enabling or disabling this feature, see Changes to Configuration.

 

Deprecated

None.

Known Issues

None.

Fixes

TD-53527: When importing a dataset via API that is sourced from a BZIP file stored on S3, the columns may not be properly split when the platform is permitted to detect the structure..

 

December 14, 2020

Release 7.9

Features

In-app chat: Have a question about the product? Use the new in-app chat feature to explore content or ask a question to our support staff. If you need assistance, please reach out! 

 


Plan View: Execute branching and parallel tasks, using success/failure criteria to determine next steps:

  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • Execute Plan using status rules: Starting in Release 7.9, you can execute tasks based on the previous task execution result. For more information, see Create a Plan.
  • Execute Parallel Plan tasks: In previous releases, plans were limited to a sequential order of task execution. Beginning in Release 7.9, you can create branches in the graph into separate parallel nodes, enabling the corresponding tasks to run in parallel. This feature enables you to have a greater level of control of your plans' workflows. For more information, see Create a Plan.
  • Zoom options: Zoom control options and keyboard shortcuts have been introduced in the plan canvas. For more information, see Plan View Page.
  • Filter Plan Runs: Filter your plan runs based on dates or plan types. For more information, see Plan Runs Page

 


Transform Builder: An All option has been added for selecting columns in the Transform Builder.  For more information, see Changes to the Language.

 


API access tokens: Individual project users can now be permitted to create and use their own access tokens for use with the REST APIs. For more information, see Dataprep Settings Page.

 

Manage data access: You can control user access to datastores such as Google Cloud Storage and BigQuery based on finer-grained permissions assigned to the user's IAM role.

  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • For more information, see Changes to User Management.

 

Changes

Changes to permissions: The set of required and optional permissions has changed for Cloud Dataprep Premium by TRIFACTA INC..

  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • The set of required permissions to use the product has been reduced to only the permissions required to access the Trifacta application and related features.
  • NOTE: Permissions required to access external, optional services such as BigQuery are considered optional.
  • Administrators may wish to adjust permissions accordingly. For more information, see Changes to User Management.
  • For general information on required and optional permissions, see Required Dataprep User Permissions.

 


Job results page changes: The old flow view in the dependency graph tab is replaced with the new flow view.

  • The dependencies tab is renamed as dependency graph tab.
  • For more information, see Job Details Page

 


Optimizer Service re-enabled: In the September release, the Optimizer service was introduced, but an issue was discovered, which caused us to disable the service temporarily. This issue has been fixed.

  • The optimizer service optimizes query execution against data sources to minimize use of Cloud Dataprep by TRIFACTA INC. resources, reduce compute costs, and improve overall job execution time.
  • No configuration is required.
  • Feature Availability: This feature is available in the following editions: Cloud Dataprep Premium by TRIFACTA INC.  and  Trifacta Wrangler Enterprise
  • You can apply optimizations for individual flows. For more information, see Flow Optimization Settings Dialog.

 

Deprecated

None.

Known Issues

None.

Fixes

TD-53475: Missing associated artifact error when importing a flow.

 


November 17, 2020

Release 7.8

Features

Plans:

  • Create HTTP tasks for your plans, which can be configured to issue a request to an API endpoint over HTTP.

    Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.

 


Flow View:

  • Get started adding and building objects in your flows with new object placeholders.

  • Automatically organize the nodes of your flow with a single click.
  • The viewport position and zoom level are now preserved when returning to a given flow.

  • See Flows Page.

 


Data quality rules:

  • Improvements to suggestions for data quality rules:

    • Existing Data quality rules panel is updated with View suggestions button.
    • Data quality rules are now categorized by suggestions for individual columns.
    • Some rule types support the May be missing checkbox. When it is selected, the data quality rule allows missing values to be acceptable for a specified column. For more information on data quality rules, see Data Quality Rules Panel.

 


Language:

  • Rename columns now supports uppercase or lowercase characters or shorten column names to a specified character length from the left or right. For more information, see Rename Columns.

 


API: Modify the source Google Cloud Storage bucket and path for a defined imported dataset. Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.

For more information, see API Workflow - Swap Datasets.

 

Changes

JDBC connection pooling disabled: The ability to create connection pools for JDBC-based connections has been disabled. It is likely to be removed in a future release. Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.

 


Deprecated Parameter History Panel Feature: As a part of collaborative suggestions enhancement, the support for Parameter History panel is deprecated from the software. For more information on collaborative suggestions feature, see Overview of Predictive Transformation.

 

Automatic random samples are disabled: In prior releases, a random sample of the data was automatically generated for display when a recipe with source data greater than 10MB was first loaded into the Transformer page. The Initial Sample, which is the first set of rows in the dataset was displayed by default, and this automatic random sample was available for manually selection if needed.

NOTE: Recent issues with long-running random sample jobs require that the generation of automatic random samples must be disabled until the issues are addressed.

You can still generate random samples manually. If sample generation takes too long, you can cancel it and select a different sampling type. For more information, see Samples Panel.

 


Classic Flow View no longer available: In Release 7.6, an improved version of Flow View was released. At the time of release, users could switch back to using the classic version. 

Beginning in this release, the classic version of Flow View is no longer available. 

Tip: The objects in your flows that were created in classic Flow View may be misaligned in the new version of Flow View. You can use auto-arrange to re-align your flow objects.

For more information, see Flow View Page.

 


Enhanced Flow and Flow View menu options: The context menu options for Flow View and Flow have been renamed and reorganized for better user experience.

 


Salesforce connector disabled temporarily: In Release 7.8, the Salesforce connector has been disabled temporarily. In a future release, it will be replaced with an improved version of the Salesforce connector. Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.

 

Deprecated

None.

Known Issues

TD-55503: When you swap datasets via API, existing samples are not discarded. These samples are invalid.

  • Workaround: This issue does not occur if you swap datasets through the Trifacta application. If it does occur via API, you can collect a new sample manually. See Samples Panel.

 

Fixes

TD-53318: Cannot publish results to relational targets when flow name or output filename or table name contains a hyphen (e.g. my - filename.csv).

 

October 2, 2020

Release 7.6 push 3

Features

None.

Changes

Disabled Optimizer Service: In the September release, the Optimizer service was introduced, which enabled users to apply advanced physical and logical optimizations for flow and job executions. Recently, an issue was discovered, which has caused us to disable the service temporarily.

  • This issue affected a very small number of users who were using the new feature. Now that the feature is disabled, impacted users should experience impacts only to performance of flow and job executions. Performance should be similar to pre-release of the service.
  • The Optimizer service was disabled through a configuration change that did not require any service interruptions. Users should not experience any loss of functionality or availability due to the work to resolve this issue.
  • The Trifacta Engineering team is actively working to resolve the issue. Thank you for patience. If you have further questions, please contact Trifacta Support.

 

Deprecated

None.

Known Issues

None.

Fixes

None.

October 1, 2020

Release 7.6 push 3

Features

None.

Changes

Shared VPCs: Across all product editions, you can now run jobs through another project by specifying a full URL for the shared VPC.

  • Previously, this capability was only available for Cloud Dataprep Premium by TRIFACTA INC.. This restriction has been lifted. It is now available for Cloud Dataprep Standard by TRIFACTA INC. and Cloud Dataprep Legacy by TRIFACTA INC., too.
  • For more information on applying the shared VPC to a job, see Dataflow Execution Settings.
  • For more information on applying a shared VPC to jobs in the project, see Project Settings Page.

    NOTE: You must create new or replace output objects to use these shared VPC settings across your project.


 

Deprecated

None.

Known Issues

None.

Fixes

None.

September 21, 2020

Release 7.6

Features

Plans:

  • Apply overrides to recipe parameters for your plans. See Plan View Page.

    Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.

 

In-app messaging: Be sure to check out the new in-app messaging feature, which allows us to share new features and relevant content to Cloud Dataprep by TRIFACTA INC. users. More developments coming soon!

 

Dataprep Settings:

  • Project administrators can configure settings that are applied to the Cloud Dataprep by TRIFACTA INC. project. For more information, see Dataprep Settings Page.

 


MySQL:

  • Support for connections to MySQL databases.
  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • See Create MySQL Connections.

 


Optimizer Service: The optimizer service optimizes query execution against data sources to minimize use of Cloud Dataprep by TRIFACTA INC. resources, reduce compute costs, and improve overall job execution time.

  • No configuration is required.
  • Feature Availability: This feature is available in the following editions: Cloud Dataprep Premium by TRIFACTA INC.  and  Trifacta Wrangler Enterprise
  • You can apply optimizations for individual flows. For more information, see Flow Optimization Settings Dialog.

 


Relational long-loading: For relational datasources that take time to load, you can continue to work while monitoring the loading process through the Import Data page, Flow View, or Dataset Details. For more information, see Overview of Job Monitoring.

 

Documentation:

  • Additional connect string options and troubleshooting information has been included for specific relational connections.
  • Feature Availability: This feature is available in the following editions: Cloud Dataprep Premium by TRIFACTA INC.  and  Cloud Dataprep Standard by TRIFACTA INC.
  • For more information, see Connection Types.

Changes

None.

Deprecated

None.

Known Issues

None.

Fixes

TD-52559: When publishing a single CSV file with headers using the append or overwrite publishing action, multiple instances of the header may be written in the output file.

 

TD-48915: Inserting special characters in an output filename results in a validation error in the the application and job failures.

 


August 4, 2020

Release 7.5

Features

New Flow View is now available:

  • Drag and drop to reposition objects on the Flow View canvas, and zoom in and out to focus on areas of development.
  • Perform joins and unions between objects on the Flow View canvas. 
  • Annotate the canvas with notes.

  • Tip: You can still access classic Flow View through the context menu in Flow View.

 


Data quality rules: Introducing data quality rules, which enable you to define data quality checks specific to your dataset. For more information, see Overview of Data Quality.

  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • Data quality rules are created in the Transformer page. For more information, see Data Quality Rules Panel.

 


Flow Sharing API

  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • When you share a flow via API, you can now pass in a user's email address. For more information, see Changes to the APIs.

 


Changes

Delete Table IAM permission no longer required

  • Previously, the following IAM permission was required: bigquery.tables.delete
  • This permission is no longer required.

    NOTE: If this permission is not included in a user's account, some publishing actions on BigQuery, such as drop and truncate, are not possible. For more information, see Required Dataprep User Permissions.

  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.

 

Add new IP address to whitelist

  • Please add that the following range of IP addresses to the whitelist for the Trifacta services for access to relational datasources in your enterprise

    34.68.114.64/28
  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • For more information, see Getting Started with Cloud Dataprep.

 

Deprecated

None.

Known Issues

TD-50942: If a flow is unshared with you, you cannot see or access the datasources for any jobs that you have already run on the flow. You can still access the job results.

 

Fixes

TD-49559: Cannot select and apply custom data types through column Type menu.

 

TD-47473: Uploaded files (CSV, XLS) that contain a space in the filename fail to be converted.

 

TD-34840: Platform fails to provide suggestions for transformations when selecting keys from an object with many of them.

 

July 7, 2020

Release 7.1

Features

Introducing Cloud Dataprep Premium by TRIFACTA INC. and Cloud Dataprep Standard by TRIFACTA INC.: You can now upgrade your existing Cloud Dataprep by TRIFACTA INC. projects to unlock advanced features, such as broader API access and relational connectivity. To see the full set of new capabilities and use cases, see https://www.trifacta.com/products/pricing/cloud-dataprep/.

 

Relational connectivity: Connect to relational sources to import data and, where supported, write results. 

 


Advanced Cloud Dataflow execution options: Specify additional job execution options at the project level or for individual jobs. 

  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA® INC.
  • Assign scaling algorithms for managing Google Compute Engine instances or define minimum and maximum workers to use.
  • Specify the service account and any billing labels to apply to your jobs.
  • For more information:

 


Introducing plans: A plan is a sequence of tasks on one or more flows that can be scheduled.

  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • NOTE: In this release, the only type of task that is supported is Run Flow.
  • For more information on plans, see Plans Page.

  • For more information on orchestration in general, see Overview of Operationalization.

 


Dataflow execution in non-local VPC: You can now execute your Cloud Dataflow jobs on a non-local or shared virtual private network (VPC).

  • NOTE: To accommodate a wider range of shared VPCs configuration, subnetworks must be specified by full URL. See Changes below.
  • Project owners can set these execution options for the entire project. See Project Settings Page.

 

Changes

Subnetwork specified by URL: When you are specifying the subnetwork where to execute your Cloud Dataflow jobs, you must now specify the subnetwork using a URL.

  • Tip: This feature can be used when Cloud Dataprep by TRIFACTA INC. is configured to execute Cloud Dataflow jobs to run within a shared VPC hosted in a project other than the current project.
  • Previously, you could specify the subnetwork by name. However, non-local subnetwork values could not be specified in this manner.
  • For more information, see Dataflow Execution Settings.

 

Deprecated

None.

Known Issues

Soft validation errors during job execution for imported flow: If you import a flow from Cloud Dataprep Premium by TRIFACTA INC. into Cloud Dataprep Standard by TRIFACTA INC., you may encounter soft validation errors during job execution. 

  • If the imported flow uses custom VPC mode, then the job execution may fail, since network may be inaccessible.
  • Workaround: 1) set the VPC mode to Auto or 2) set accessible VPC options before you run your job. See Dataflow Execution Settings.

 

Fixes

None.

June 4, 2020

Release 7.1

Features


Flow parameters: Create flow parameters that you can reference in the recipes of your flow. 

  • NOTE: For this release, flow parameters can be applied into your recipes only.
  • As needed, you can apply overrides to the parameters in your flow or to downstream flows. 
  • NOTE: Flow parameters do not apply to datasets or output objects, which have their own parameters. However, if you specify an override at the flow level, any parameters within the flow that use the same name receive the override value, including output object parameters and datasets with parameters.
  • See Manage Parameters Dialog.
  • For more information on parameters, see Overview of Parameterization.

 

Introducing new Flow View: The Flow View page has been redesigned to improve the user experience and overall productivity. 

NOTE: This feature is in Beta release.

  • Enhancements include:
    • Drag and drop to reposition objects on the Flow View canvas, and zoom in and out to focus on areas of development.
    • Perform joins and unions between objects on the Flow View canvas. 
    • Annotate the canvas with notes.
  • You can toggle between new and classic views through the context menu in the corner of Flow View. See Flow View Page.

 

Redesigned Settings and Help menus: See Home Page.

 


Report issue: If you are experiencing an issue with Cloud Dataprep by TRIFACTA INC., you can gather useful information from the application to deliver to Trifacta Support.

  • From the Help menu, select Report issue.

 

Transformer page:

  • Join steps are now created in a larger window for more workspace. See Join Window.
  • New column selection UI simplifies choosing columns in your transformations. See Transform Builder.

 


Transformer page performance:

  • Improved performance when loading the Transformer page and when navigating between the Flow View and Transformer pages.
  • Faster and improved method of surfacing transform suggestions based on machine learning.

 


PDF profiles: When visual profiling is enabled for a job, you can now download your visual profile in PDF format. See Job Details Page.

 

 

Changes

Parameter overrides: If you have upgraded to Release 7.1 or later, any parameter overrides that you have specified in your flows must be re-applied. For more information, see Manage Parameters Dialog.

 

Language: All MODE functions return the lowest value in a set of values if there is a tie in the evaluation.

 


API Documentation:

  • API reference documentation is now available directly through the application. This release includes more supported endpoints and documented options. To access, select Help menu > API Documentation.

  • NOTE: API reference content is no longer available with the product documentation. Please use the in-app reference documentation instead.

  • Workflow documentation is still available with the product documentation. For more information, see API Reference.

 

Deprecated

Send a Copy: You can no longer send a copy of a flow to another user.

  • New method: Create a copy of a flow and share it with the other user.
  • For more information, see Share Flow Dialog.

 

Re-run jobs using Cloud Dataflow templates: This feature is no longer available. Cloud data flow templates can no longer be used to re-run jobs.

  • New method: Please use the /v4/jobGroups endpoint to run and re-run jobs.
  • For more information, see API Reference.

 

Known Issues

TD-49559: Cannot select and apply custom data types through column Type menu.

  • Workaround: You can change the type of the column as a recipe step. Use the Change column type transformation. From the New type drop-down, select Custom. Then, enter the name of the type for the Custom type value.

 

TD-47473: Uploaded files (CSV, XLS, PDF) that contain a space in the filename fail to be converted.

  • Workaround: Remove the space in the filename and upload again.

 

Fixes

None.

April 16, 2020

Release 6.8.2

Features

Advanced Dataflow Settings:

  • Feature Availability: This feature is available in Cloud Dataprep Premium by TRIFACTA INC.
  • Project admins can apply advanced dataflow execution settings to all jobs in the project. See Project Settings Page.
  • These settings can be overridden with values applied to individual jobs or output objects. See Dataflow Execution Settings.
  • Via jobGroup API: See API Workflow - Run Job.

 

Changes

None.

Deprecated

None.

Fixes

TD-47149: Cannot edit settings when importing Google Sheets.

 

Known Issues

None.

February 14, 2020

Release 6.8.1-push2

This is the initial release of Cloud Dataprep Premium by TRIFACTA® INC..

Features


Changes

None.

Deprecated

None.

Fixes

TD-40348: When loading a recipe in an imported flow that references an imported Excel dataset, Transformer page displays Input validation failed: (Cannot read property 'filter' of undefined) error, and the screen is blank.

 

Known Issues

None.

February 12, 2020

Release 6.8.1

Features

Macros:

 


Improved RapidTarget: Improved matching logic and performance when matching columns through RapidTarget.

  • Align column based on the data contained in them, in addition to column name. 
  • This feature is enabled by default. For more information, see Overview of RapidTarget.

 


Download visual profile: You can download a JSON version of the visual profile. See Job Details Page.

 


Improved Date/Time format selection: See Choose Datetime Format Dialog.

  • Tip: Datetime formats in card suggestions now factor in the user's locale settings for greater relevance.

 


Enable or disable keyboard shortcuts: Individual users can now enable or disable keyboard shortcuts in the workspace or Transformer page. See User Profile Page.

 


Duplicate datasets while copying: You can optionally duplicate the datasets from a source flow when you create a copy of it. See Flow View Page.

 


Create a copy of your imported dataset: See Library Page.

 


Define all columns: Select columns, functions applied to your source, and constants to replace your current dataset. See Select

 


Search panel improvements: Improvements to the Search panel enable faster discovery of transformations, functions, and other objects. See Search Panel.

 


APIs: Apply overrides at time of job execution via API.

  • When you are running a job, you can override the default publication settings for the job using overrides in the request. For more information, see API Workflow - Run Job.

 


New functions:

 

Changes

Browser Support Policy:

  • For supported browsers, at the time of release, the latest stable version and the two previous stable versions are supported.

  • NOTE: Stable browser versions released after a given release of Cloud Dataprep by TRIFACTA INC. will NOT be supported for any prior version of Cloud Dataprep by TRIFACTA INC..  A best effort will be made to support newer versions released during the support lifecycle of the release.

 

Import/Export: Flows can now be exported and imported across products and versions of products.

 

Deprecated

Re-run jobs using Cloud Dataflow templates:

  • In prior releases, you could re-run a Cloud Dataprep by TRIFACTA INC. job by configuring the Cloud Dataflow template with input and output parameters for the job.
  • As of this release, Cloud Dataprep by TRIFACTA INC. will continue to generate Cloud Dataflow templates, but they are no longer recommended for use in programmatic execution of Cloud Dataflow jobs.
  • Instead, you can now run jobs and monitor them through exposed API endpoints. For more information, see API Reference.
  • Support for Cloud Dataflow templates will be decommissioned in March 2020 (previously planned for Dec 2019)

 

Known Issues

TD-47263: Importing an exported flow that references a Google Sheets or Excel source breaks connection to input source.

  • Workaround: If the importing user has access to the source, the user can re-import the dataset and then swap the source for the broken recipe.

 


TD-46185: Stepping backward to an early step in a recipe sometimes fails to properly update the state of the quality bar and histograms in the data grid.

  • Workaround: This issue is caused by caching of snapshot profiles from the data grid. The workaround is to reload the page through the browser.

 


TD-45122: API: re-running job using only the wrangleDataset identifier fails even if the original job succeeds when writeSettings were specified.

  • Workaround: Use a full jobGroups job specification each time that you run a job. See Cloud Dataprep by TRIFACTA INC. API Reference docs: Premium | Standard

 

Fixes

TD-44548: RANGE function returns null values if more than 1000 values in output.

 


TD-43877: Preview after a DATEFORMAT step does not agree with results or profile values.

 


TD-43849: Export flows are broken when recipe includes Standardization or Transform by Example tasks.

 


TD-42080: Cannot run flow that contains more than 10 recipe jobs

 

September 16, 2019

Release 6.4.1

Features

Introducing APIs: Manage job execution via API.

  • Cloud Dataprep by TRIFACTA INC. now supports API endpoints for programmatic execution and monitoring of Trifacta jobs. Beginning in this release, you can use token-based security to manage the launching and execution of Cloud Dataprep by TRIFACTA INC. jobs. For more information, see API Reference.
  • This API should be used as a replacement for Cloud Dataflow templates for programmatic invocation of Trifacta jobs. In addition, this feature includes support for dynamic functions and input & output destinations.
  • NOTE: Cloud Dataflow templates generated by Cloud Dataprep by TRIFACTA INC. are still supported but are no longer recommended for use.

 

Changes

None.

Deprecated

Re-run jobs using Cloud Dataflow templates:

  • In prior releases, you could re-run a Cloud Dataprep by TRIFACTA INC. job by configuring the Cloud Dataflow template with input and output parameters for the job.
  • As of this release, Cloud Dataprep by TRIFACTA INC. will continue to generate Cloud Dataflow templates, but they are no longer recommended for use in programmatic execution of Cloud Dataflow jobs.
  • Instead, you can now run jobs and monitor them through exposed API endpoints. For more information, see API Reference.
  • Support for Cloud Dataflow templates will be decommissioned in December 2019 March 2020.

 

Known Issues

TD-43284: When running a job via API, you cannot apply setting overrides, parameter values, or other execution settings as part of the job definition in the API. The job will be executed with the settings, parameter values and execution settings defined in the UI for that job.

  • You can run the job using default settings only through the API. Please the the Run Job UI in order to change the default settings.
  • For more information, see Cloud Dataprep by TRIFACTA INC. API Reference docs: Premium | Standard

 

Fixes

None.

September 11, 2019

Release 6.4.1

Features

Introducing recipe macros: User-defined macros enable saving and reusing sequences of steps. For more information, see Overview of Macros.

 

Introducing Transformation by Example: Transform by example output values for a column of values. See Transformation by Example Page.

 

Redesigned Recipe Panel: Multi-step operations and more robust copy and paste actions are now supported. See Recipe Panel.

 

Browse flow for joins: Browse your current flow for datasets or recipes to join into the current recipe. See Join Window.

 


Replace cell values: Build transformations to replace specific cell values. See Replace Cell Values.

 

Parameter overrides to destinations: Parameterize output paths and table and file names for dynamic destinations. See Run Job Page.

 

Specify VPC networks and sub-nets: You can specify your own Google VPC network and the sub-net IP address range to use for individual job execution or for your project. For more information, see Project Settings Page.

 

Broader support for metadata references: Broader support for metadata references. For Excel files, $filepath references now return the location of the source Excel file. Sheet names are appended to the end of the reference. See Source Metadata References.

 


Changes

PNaCl browser extension no longer supported: Please verify that all users of Cloud Dataprep by TRIFACTA INC. are using a supported version of Google Chrome, which automatically enables use of WebAssembly. For more information, see Desktop Requirements .

 

Documentation errata: In prior releases, the documentation listed UTF32-BE and UTF32-LE as supported file formats. These formats are not supported. Documentation has been updated to correct this error. See Supported File Encoding Types

 

Deprecated

None.

Known Issues

None.

Fixes

TD-40424: UTF-32BE and UTF-32LE are available as supported file encoding options. They do not work.

  • NOTE:  Although these options are available in the application, they have never been supported in the underlying platform. They have been removed from the interface.

 

TD-39296: Cannot run Cloud Dataflow jobs on datasets with parameters sourced from Parquet file or files. 

 

For release notes from previous releases, see Earlier Releases of Cloud Dataprep.


This page has no comments.