Page tree

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 32 Next »

Trifacta Dataprep



Contents:


   

Contents:


These release notes apply to the following product tiers of Dataprep by Trifacta®:

  • Dataprep Enterprise Edition by Trifacta
  • Dataprep Professional Edition by Trifacta
  • Dataprep Starter Edition by Trifacta
  • Dataprep Premium by Trifacta
  • Dataprep Standard by Trifacta
  • Dataprep Legacy by Trifacta

Tip: You can see your product tier in the Trifacta application. Select Help menu > About Cloud Dataprep.

For more information, see Product Editions.

For release notes from previous releases, see Earlier Releases of Cloud Dataprep.

July 20, 2021

Release 8.5

What's New

Tip: When you complete your Dataprep Enterprise Edition by Trifacta or Dataprep Professional Edition by Trifacta trial, you can choose to license a higher or lower tier product edition. For more information, see Product Editions.


Parameterization:

  • Create environment parameters to ensure that all users of the project or workspace use consistent references.

    NOTE: You must be a workspace administrator or project owner to create environment parameters.

    Tip: Environment parameters can be exported from one project or workspace and imported into another, so that these references are consistent across the enterprise.

  • Parameterize names of your storage buckets using environment parameters.

Schedules:

  • Project owners and workspace administrators can review, enable, disable, and delete schedules through the application.

    Feature Availability: This feature is not available in
    Dataprep Starter Edition by Trifacta only.

    See Schedules Page.

Flow View:

Job execution:

Connectivity:

Contribute to the future direction of connectivity: Click I'm interested on a connection card to upvote adding the connection type to the Trifacta application. See Create Connection Window.

  • Early Preview (read-only) connections available with this release:

    Feature Availability: This feature is available in the following editions:

    • Dataprep Enterprise Edition by Trifacta
    • Dataprep Professional Edition by Trifacta
    • Dataprep Premium by Trifacta

  • Apache Impala

Connectivity:

  • Connect to your relational database systems hosted on Cloud SQL. In the Connections page, click the Cloud SQL card for your connection type.
    Feature Availability: This feature is available in the following editions:

    • Dataprep Enterprise Edition by Trifacta®
    • Dataprep Professional Edition by Trifacta
    • Dataprep Premium by Trifacta


    For more information, see Create Connection Window

Connectivity:


API:

  • Cancel in-progress Dataflow jobs via API.

    Feature Availability: This feature is available in the following editions:

    • Dataprep Enterprise Edition by Trifacta
    • Dataprep Professional Edition by Trifacta
    • Dataprep Premium by Trifacta
    • Dataprep Standard by Trifacta

    See Changes to the APIs.

Language:

  • NUMVALUE function can be used to convert a String value formatted as a number into an Integer or Decimal value.
  • NUMFORMAT function now supports configurable grouping and decimal separators for localizing numeric values.
  • For more information, see Changes to the Language.


Performance:

  • Improved performance when browsing folders containing a large number of files on Base Storage

Changes

None.

Deprecated

None.

Known Issues

None.

Fixes

  • TD-62190: You may not be able to view the SQL that was used to execute a job within BigQuery. This issue is due to a regression in the new BigQuery console in which job identifiers containing dashes are not supported. A ticket has been filed with Google.

June 7, 2021

Release 8.4

What's New

Template Gallery:

  • Check out the new gallery of flow templates, which can be imported into your workspace. These templates are pre-configured to solve the most compelling loading and transformation use cases in the product. For more information, see www.trifacta.com/templates.
    • For more information on importing flows into your workspace, see Import Flow.
    • For more information on using a template in the product, see Start with a Template

Connectivity:

  • Early Preview (read-only) connections available with this release:

    Feature Availability: This feature is available in the following editions:

    • Dataprep Enterprise Edition by Trifacta
    • Dataprep Professional Edition by Trifacta
    • Dataprep Premium by Trifacta

  • Splunk
  • YouTube Analytics

Collaboration:


Support for delete actions on merge (upsert) operations in BigQuery:

When publishing to a BigQuery table, you can choose to update or, with this release, to delete matching records during a merge option. For more information, see BigQuery Table Settings.

Job execution:

You can choose to ignore the recipe errors before job execution and then review any errors in the recipe through the Job Details page.

Language:

Changes

Trifacta Photon limits on execution time

Trifacta Photon is an in-memory running environment that is hosted on the same node as Dataprep by Trifacta, which allows for faster execution suitable for small- to medium-sized jobs.

Feature Availability: This feature is not available in
Dataprep Legacy by Trifacta only.

NOTE: Jobs that are executed on Trifacta Photon may be limited to run for a maximum of 10 minutes, after which they fail with a timeout error. If your job fails due to this limit, please switch to running the job on Dataflow.

Trifacta Photon can be enabled or disabled by a project administrator. For more information, see Dataprep Project Settings Page.

Execution of scheduled jobs on Trifacta Photon is not supported

In conjunction with the previous change, execution of scheduled jobs is not supported on Trifacta Photon. Since Trifacta Photon jobs are now limited to 10 minutes of execution time, scheduled jobs have been automatically migrated to execution on Dataflow to provide better execution success. For more information, see Trifacta Photon Running Environment.

Deprecated

None.

Known Issues

  • TD-62190: You may not be able to view the SQL that was used to execute a job within BigQuery. This issue is due to a regression in the new BigQuery console in which job identifiers containing dashes are not supported. A ticket has been filed with Google.

Fixes

  • TD-60881:  Incorrect file path and missing file extension in the application for parameterized outputs
  • TD-60382: Date format M/d/yy is handled differently by PARSEDATE function on Trifacta Photon and Spark.

May 20, 2021

Release 8.3 - push 3

What's New

Connectivity:

  • Support for SFTP connections.

    Feature Availability: This feature is available in the following editions:

    • Dataprep Enterprise Edition by Trifacta
    • Dataprep Professional Edition by Trifacta
    • Dataprep Premium by Trifacta


    NOTE: This connection type is import only.

    For more information, see SFTP Connections.

Changes

Trifacta Photon enabled by default

Trifacta Photon is an in-memory running environment that is hosted on the same node as Dataprep by Trifacta, which allows for faster execution suitable for small- to medium-sized jobs.

Feature Availability: This feature is not available in
Dataprep Legacy by Trifacta only.

NOTE: Jobs executed in Trifacta Photon are executed within the Trifacta VPC. Data is temporarily streamed to the Trifacta VPC during job execution and is not persisted.

Beginning in this release, Trifacta Photon is enabled by default. Users can choose to run jobs on Trifacta Photon.

NOTE: For Dataprep Enterprise Edition by Trifacta, Trifacta Photon is enabled by default for new projects. For existing projects, a project administrator must still choose to enable it.

Trifacta Photon can be enabled or disabled by a project administrator. For more information, see Dataprep Project Settings Page.

Deprecated

None.

Known Issues

None.

Fixes

None.

May 10, 2021

Release 8.3

What's New

Running Environments:

Cancel Jobs in Dataflow:

You can cancel  Dataflow jobs directly from the product.

NOTE: In some cases, the product is unable to cancel the job from the application. In these cases, click View in Dataflow Job and from there you can cancel the job in progress .

Support for merge (upsert) operations in BigQuery:

When publishing to a BigQuery table, you can choose to write results using the merge option. When selected, you specify a primary key of fields and then decide how data is merged into the table. For more information, see BigQuery Table Settings.

Connectivity:

  • Early Preview (read-only) connections available with this release:

    Feature Availability: This feature is available in the following editions:

    • Dataprep Enterprise Edition by Trifacta
    • Dataprep Professional Edition by Trifacta
    • Dataprep Premium by Trifacta

  • Authorize.net
  • Cockroach DB
  • DB2

  • Google Data Catalog
  • Google Spanner

  • Magento
  • Redis
  • Shopify
  • Smartsheet
  • Trello
  • QuickBase

Job execution:

Introducing new filter pushdowns to optimize the performance of your flows during job execution. For more information, see Flow Optimization Settings Dialog.

Job results:

You can now preview job results and download them from the Overview tab of the Job details page. For more information, see Job Details Page.

Tip: You can also preview job results in Flow View. See View for Outputs.

Changes

Improved method of JSON import

Beginning in this release, the Trifacta application now uses the conversion service to ingest JSON files during import. This improved method of ingestion can save significant time wrangling JSON into records.

NOTE: The new method of JSON import is enabled by default but can be disabled as needed.

For more information, see Working with JSON v2.

Flows that use imported datasets created using the old method continue to work without modification.

NOTE: It is likely that support for the v1 version of JSON import is deprecated in a future release. You should switch to using the new version as soon as possible. For more information on migrating your flows and datasets to use the new version, see Working with JSON v1.

Future work on support for JSON is targeted for the v2 version only.

Optionally, you can re-enable the old version, which is useful for migrating to the new version.

Feature Availability: This feature is not available in
Dataprep Legacy by Trifacta only.

For more information on using the old version and migrating to the new version, see Working with JSON v1.

Deprecated

None.

Known Issues

  • TD-61478: Time-based data types are imported as String type from BigQuery sources.

Fixes

  • TD-60701: Most non-ASCII characters incorrectly represented in visual profile downloaded in PDF format.
  • TD-59854: Datetime column from Parquet file incorrectly inferred to the wrong data type on import.

April 26, 2021

Release 8.2 push2

What's New

Upgrade: Trial customers can upgrade through the Admin console. See Admin Console.

This is the initial release of for the following product tiers:

  • Dataprep Enterprise Edition by Trifacta
  • Dataprep Professional Edition by Trifacta
  • Dataprep Starter Edition by Trifacta

Changes

None.

Deprecated

None.

Known Issues

None.

Fixes

None.

April 14, 2021

Release 8.2

This is the initial release of for the following product tiers:

  • Dataprep Enterprise Edition by Trifacta
  • Dataprep Professional Edition by Trifacta
  • Dataprep Starter Edition by Trifacta

What's New

Photon:

Introducing Trifacta Photon, an in-memory running environment for running jobs. Embedded in the Dataprep by Trifacta, Trifacta Photon delivers improved performance in job execution and is best-suited for small- to medium-sized jobs.

Feature Availability: This feature is not available in
Dataprep Legacy by Trifacta only.

NOTE: Trifacta Photon must be enabled by a project owner. For more information, see Dataprep Project Settings Page.

  • When you choose to run a job, you can now choose to run a job on Trifacta Photon.
  • For more information, see Run Job Page .

Quick scan sampling:

  • Trifacta Photon also enables quick scan sampling. A quick scan sample generates an appropriate selection of rows from the dataset from which the sample was initiated. These samples are faster to generate. For more information, see Overview of Sampling.
  • For more information on generating samples, see Samples Panel.

Preferences:

  • Re-organized user account, preferences, and storage settings to streamline the setup process. See Preferences Page.

Connectivity:

  • Early Preview (read-only) connections available with this release:

    Feature Availability: This feature is available in the following editions:

    • Dataprep Enterprise Edition by Trifacta
    • Dataprep Professional Edition by Trifacta
    • Dataprep Premium by Trifacta


Plan metadata references:

Feature Availability: This feature is available in the following editions:

  • Dataprep Enterprise Edition by Trifacta
  • Dataprep Professional Edition by Trifacta
  • Dataprep Premium by Trifacta

Use metadata values from other tasks and from the plan itself in your HTTP task definitions.


Improved accessibility of job results:

The Jobs tabs have been enhanced to display the list of latest and the previous jobs that have been executed for the selected output.

For more information, see View for Outputs.

Sample Jobs Page:

You can monitor the status of all sample jobs that you have generated. Project administrators can access all sample jobs in the workspace. For more information, see Sample Jobs Page.

Simplified output and destination experience:

From the Home Page, you can quickly redesign your output and destination experience. The step-by-step procedures enables you to create an improved and streamlined output creation experience. For more information, see Start with a Template.

Changes

Improved methods for disabling the product:

Project owners can choose to disable Dataprep by Trifacta from within the product. For more information, see Enable or Disable Dataprep.

After the product has been disabled in a project, Trifacta data is placed in a hidden state for later purging. For more information on purging or restoring data, see Wipe Out Dataprep Data.

API:

The following API endpoints are scheduled for deprecation in a future release:

NOTE: Please avoid using the following endpoints.

/v4/connections/vendors
/v4/connections/credentialTypes
/v4/connections/:id/publish/info
/v4/connections/:id/import/info

These endpoints have little value for public use.

Deprecated

None.

Known Issues

  • TD-60701: Most non-ASCII characters incorrectly represented in visual profile downloaded in PDF format.

Fixes

  • TD-59236:  Use of percent sign (%) in file names causes Transformer page to crash during preview.
  • TD-59218:  BOM characters at the beginning of a file causing multiple headers to appear in Transformer Page.

March 16, 2021

Release 8.1

What's New

Connectivity:

  • Introducing Early Preview connections. In each release of cloud-based product editions, new connection types may be made available in read-only mode for users to begin exploring their datasets stored in the connected datastores.

    NOTE: Early Preview connection types are read-only and are subject to change before they may be made generally available.

    Feature Availability: This feature is available in
    Dataprep Premium by Trifacta only.
  • Early Preview connections available with this release:
    • Airtable
    • Cassandra
    • Freshdesk
    • Google Analytics
    • MailChimp

Specify column headers during import:

You can specify the column headers for your dataset during import. For more information, see Import Data Page.

Sample Jobs Page:

You can monitor the status of all sample jobs that you have generated. Project administrators can access all sample jobs in the workspace. For more information, see Sample Jobs Page.

Job results:

Results of data quality checks are now part of the visual profile PDF available with your job results. In the PDF, you can download the data quality results over the entire dataset .

Feature Availability: This feature is available in
Dataprep Premium by Trifacta only.


  • Visual profiling must be enabled for the job.
  • For more information, see Job Details Page.

Sharing:

  • Define permissions on individual objects when they are shared.

    NOTE: Fine-grained sharing permissions apply to flows and connections only.

    For more information, see Changes to User Management.

API:

  • You can now transfer ownership of assets created in Dataprep by Trifacta between users, based on their user identifiers or email addresses. For more information, see Changes to the APIs.
  • Customize connection types (connectors) to ensure consistency across all connections of the same type and to meet your enterprise requirements. For more information, see Changes to the APIs.

Macro updates:

You can replace an existing macro definition with a macro that you have exported to your local desktop.

NOTE: Before you replace the existing macro, you must export a macro to your local desktop. For more information, see Export Macro.

For more information, see Macros Page.

Changes

Freed IP address ranges:

The following IP address range is the only one in use by the Trifacta Service:

34.68.114.64/28

Please discontinue whitelisting any other IP address ranges for the Trifacta Service.

These ranges have been freed to the general Internet.

Changes to Preferences:

The Preferences area of the Trifacta application has been changed. For more information, see Changes to Configuration.

Deprecated

None.

Known Issues

  • TD-58523: Cannot import dataset with filename in Korean alphabet from HDFS.

    • Workaround: You can upload files with Korean characters from your desktop. You can also add a 1 to the end of the file on HDFS, and it can then be imported.

  • TD-55299: Imported datasets with encodings other than UTF-8 and line delimiters other than \n may generate empty outputs on Spark or Dataflow running environments.

  • TD-51516: Input data containing BOM (byte order mark) characters may cause Spark or Dataflow running environments to read data improperly and/or generate invalid results.

Fixes

  • TD-56170: The Test Connection button for some relational connection types does not perform a test authentication of user credentials.
  • TD-54440: Header sizes at intermediate nodes for JDBC queries cannot be larger than 16K.
    • Previously, the column names for JDBC data sources were passed as part of a header in a GET request. For very wide datasets, these GET requests often exceeded 16K in size, which represented a security risk.

February 16, 2021

Release 8.0

Features

Tip: Add a profile picture to your account! For more information, see User Profile Page.

Flow templates:

Introducing flow templates, which are predefined flows with guidelines for creating the flow objects needed to solve a specific transformation and publication use case. These step-by-step guides leverage placeholders for flow objects to assist you in rapidly assembling your end-to-end flow pipeline.

The first available template simplifies the Data Warehouse Onboarding process, which simplifies the ingestion of datasets, transformation of them, and loading them into your data warehouse. From the Home page, you can quickly set up a pipeline from data lakes into data warehouses:

  • GCS  to BigQuery pipelineUse this template to create a flow by importing a Google Cloud Storage, transforming the data, and publishing the outputs on the BigQuery. For more information, see Start with a Template.

Authorization:

APIs:

  • Individual workspace users can be permitted to create and use their own access tokens for use with the REST APIs. For more information, see Dataprep Project Settings Page.

Connectivity:

  • Support for connections to SharePoint Lists. See SharePoint Connections.
  • Support for using OAuth2 authentication for Salesforce connections. See Salesforce Connections.

  • Support for re-authenticating through connections that were first authenticated using OAuth2.

Import:

  • Improved method for conversion and ingestion of XLS/XSLX files. For more information, see Import Excel Data.

Recipe development:

  • The Flag for Review feature enables you to set review checkpoints in your recipes. You can flag recipe steps for review by other collaborators for review and approval. For more information, see Flag for Review.

Metric-based data quality rules:

Update Macros:

  • Replace / overwrite an existing macro's steps and inputs with a newly created macro.
  • Map new macro parameters to the existing parameters before replacing.
  • Edit macro input names and default values as needed. 

Job execution:

  • You can enable the Trifacta application to apply SQL filter pushdowns to your relational datasources to remove unused rows before their data is imported for a job execution. This optimization can significantly improve performance as less data is transferred during the job run. For more information, see Flow Optimization Settings Dialog.
  • Optimizations that were applied during the job run now appear in the Job Details Page. See Job Details Page.

Changes

None.

Deprecated

None.

Known Issues

  • TD-56830: Receive malformed_query: enter a filter criterion when importing table from Salesforce.

    • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.
    • NOTE: Some Salesforce tables require mandatory filters when they are queried. Mandatory filters are not currently supported for Salesforce connections


  • TD-56170: The Test Connection button for some relational connection types does not perform a test authentication of user credentials.

    • Workaround: Append the following to your Connect String Options:

      ;ConnectOnOpen=true
    • This option forces the connection to validate user credentials as part of the connection. There may be a performance penalty when this option is used.

Fixes

None.


January 12, 2021

Release 7.10

Features

In-app chat: Have a question about the product? Use the new in-app chat feature to explore content or ask a question to our support staff. If you need assistance, please reach out!

In-app tours: Check out the new in-app tours, which walk you through the steps of wrangling your datasets into clean, actionable data. 

  • NOTE: The product tours that were accessible through the Home page of the Trifacta application are no longer available.

Import: The maximum permitted size of a file uploaded through Trifacta application has been increased from 100 MB to 1 GB.

Import and Export Plans: You can import and export plans from one environment, workspace, or projects to others.

  • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.
  • For more information, see Export Plan.
  • For more information, see Import Plan.

Share Plans : Share plans with one or more users to work together on the same plan.

  • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.
  • For more information, see Share a Plan.

Email notifications : Send email notifications to plan owners and collaborators based on the status of execution of plans.

Connectivity: Improved Salesforce connection type.

  • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.
  • For more information, see Salesforce Connections.

Changes

IP address range whitelist:

  • The list of IP addresses for the Trifacta service has been changed.
    • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.
    • These addresses must be whitelisted to provide access to your relational datastores.
    • Please verify that the following address range is whitelisted for each relational datastore that you wish to access from the product:

      34.68.114.64/28
    • Please remove the following IP address ranges from your relational whitelists. They are no longer used by Dataprep by Trifacta:

      104.198.44.13/32
      34.71.238.145/32
      104.198.217.74/32
      34.68.178.136/32
    • For more information, see Getting Started with Cloud Dataprep.

Changes to IAM roles for service accounts: Recently, Google announced changes to permissions required for attaching IAM roles to service accounts. If you are using IAM roles for your Google service accounts, these changes may require updating the permissions that you must enable in your IAM roles. For more information, see Changes to User Management.

Enable listing all users in the workspace: Beginning in this release, workspace administrators can choose to enable or disable the listing of all workspace users in the  Trifacta application . For example, if you are sharing a flow with another user and this feature is enabled, you can browse the list of all workspace users and select users with whom to share.

  • NOTE: When enabled, users are listed according to their email addresses.
  • NOTE: For Dataprep Legacy by Trifacta , this feature is disabled by default.
  • For more information on enabling or disabling this feature, see  Changes to Configuration.

Deprecated

None.

Known Issues

None.

Fixes

TD-53527:  When importing a dataset via API that is sourced from a BZIP file stored on S3, the columns may not be properly split when the platform is permitted to detect the structure.


December 14, 2020

Release 7.9

Features

In-app chat: Have a question about the product? Use the new in-app chat feature to explore content or ask a question to our support staff. If you need assistance, please reach out!

Plan View: Execute branching and parallel tasks, using success/failure criteria to determine next steps:

  • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.
  • Execute Plan using status rules: Starting in Release 7.9, you can execute tasks based on the previous task execution result. For more information, see Create a Plan.
  • Execute Parallel Plan tasks: In previous releases, plans were limited to a sequential order of task execution. Beginning in Release 7.9, you can create branches in the graph into separate parallel nodes, enabling the corresponding tasks to run in parallel. This feature enables you to have a greater level of control of your plans' workflows. For more information, see Create a Plan .
  • Zoom options: Zoom control options and keyboard shortcuts have been introduced in the plan canvas. For more information, see Plan View Page .
  • Filter Plan Runs: Filter your plan runs based on dates or plan types. For more information, see Plan Runs Page.

Transform Builder: An All option has been added for selecting columns in the Transform Builder.  For more information, see Changes to the Language.

API access tokens: Individual project users can now be permitted to create and use their own access tokens for use with the REST APIs. For more information, see Dataprep Project Settings Page.

Manage data access: You can control user access to datastores such as Base Storage and BigQuery based on finer-grained permissions assigned to the user's IAM role.

  • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.
  • For more information, see Changes to User Management.

Changes

Changes to permissions: The set of required and optional permissions has changed for Dataprep Premium by Trifacta.

  • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.
  • The set of required permissions to use the product has been reduced to only the permissions required to access the Trifacta application and related features.
  • NOTE: Permissions required to access external, optional services such as BigQuery are considered optional.
  • Administrators may wish to adjust permissions accordingly. For more information, see Changes to User Management.
  • For general information on required and optional permissions, see Required Dataprep User Permissions.

Job results page changes: The old flow view in the dependency graph tab is replaced with the new flow view.

  • The dependencies tab is renamed as dependency graph tab.
  • For more information, see Job Details Page.

Optimizer service re-enabled: In the September release, the Optimizer service was introduced, but an issue was discovered, which caused us to disable the service temporarily. This issue has been fixed.

  • The Optimizer service optimizes query execution against data sources to minimize use ofDataprep by Trifacta resources, reduce compute costs, and improve overall job execution time.
  • No configuration is required.
  • Feature Availability: This feature is available in the following editions: Dataprep Premium by Trifacta  and  Trifacta
  • You can apply optimizations for individual flows. For more information, see Flow Optimization Settings Dialog.

Deprecated

None.

Known Issues

None.

Fixes

TD-53475:  Missing associated artifact error when importing a flow.


November 17, 2020

Release 7.8

Features

Plans:

  • Create HTTP tasks for your plans, which can be configured to issue a request to an API endpoint over HTTP.

    Feature Availability: This feature is available in
    Dataprep Premium by Trifacta only.

Flow View:

  • Get started adding and building objects in your flows with new object placeholders.

  • Automatically organize the nodes of your flow with a single click.
  • The viewport position and zoom level are now preserved when returning to a given flow.

  • See Flows Page.

Data quality rules:

  • Improvements to suggestions for data quality rules:

    • Existing Data quality rules panel is updated with View suggestions button.
    • Data quality rules are now categorized by suggestions for individual columns.
    • Some rule types support the May be missing checkbox. When it is selected, the data quality rule allows missing values to be acceptable for a specified column. For more information on data quality rules, see Data Quality Rules Panel.

Language:

  • Rename columns now supports  uppercase or lowercase characters or shorten column names to a specified character length from the left or right. For more information, see Rename Columns.

APIModify the source Base Storage bucket and path for a defined imported dataset.

Feature Availability: This feature is available in Dataprep Premium by Trifacta only.

For more information, see API Workflow - Swap Datasets.

Changes

JDBC connection pooling disabled: The ability to create connection pools for JDBC-based connections has been disabled. It is likely to be removed in a future release.

Feature Availability: This feature is available in Dataprep Premium by Trifacta only.

Deprecated Parameter History Panel Feature: As a part of collaborative suggestions enhancement, the support for Parameter History panel is deprecated from the software. For more information on collaborative suggestions feature, see Overview of Predictive Transformation.

Automatic random samples are disabled: In prior releases, a random sample of the data was automatically generated for display when a recipe with source data greater than 10MB was first loaded into the Transformer page. The Initial Sample, which is the first set of rows in the dataset was displayed by default, and this automatic random sample was available for manually selection if needed.

NOTE: Recent issues with long-running random sample jobs require that the generation of automatic random samples must be disabled until the issues are addressed.

You can still generate random samples manually. If sample generation takes too long, you can cancel it and select a different sampling type. For more information, see Samples Panel.

Classic Flow View no longer available: In Release 7.6, an improved version of Flow View was released. At the time of release, users could switch back to using the classic version. 

Beginning in this release, the classic version of Flow View is no longer available. 

Tip: The objects in your flows that were created in classic Flow View may be misaligned in the new version of Flow View. You can use auto-arrange to re-align your flow objects.

For more information, see Flow View Page.

Enhanced Flow and Flow View menu options: The context menu options for Flow View and Flow have been renamed and reorganized for better user experience.

Salesforce connector disabled temporarily: In Release 7.8, the Salesforce connector has been disabled temporarily. In a future release, it will be replaced with an improved version of the Salesforce connector.

Feature Availability: This feature is available in Dataprep Premium by Trifacta only.

Deprecated

None.

Known Issues

TD-55503: When you swap datasets via API, existing samples are not discarded. These samples are invalid.

  • Workaround: This issue does not occur if you swap datasets through the Trifacta application. If it does occur via API, you can collect a new sample manually. See Samples Panel.

Fixes

TD-53318: Cannot publish results to relational targets when flow name or output filename or table name contains a hyphen (e.g. my - filename.csv).


October 2, 2020

Release 7.6 push 3

Features

None.

Changes

Disabled Optimizer Service: In the September release, the Optimizer service was introduced, which enabled users to apply advanced physical and logical optimizations for flow and job executions. Recently, an issue was discovered, which has caused us to disable the service temporarily.

  • This issue affected a very small number of users who were using the new feature. Now that the feature is disabled, impacted users should experience impacts only to performance of flow and job executions. Performance should be similar to pre-release of the service.
  • The Optimizer service was disabled through a configuration change that did not require any service interruptions. Users should not experience any loss of functionality or availability due to the work to resolve this issue.
  • The Trifacta Engineering team is actively working to resolve the issue. Thank you for patience. If you have further questions, please contact  Trifacta Support.

Deprecated

None.

Known Issues

None.

Fixes

None.


October 1, 2020

Release 7.6 push 3

Features

None.

Changes

Shared VPCs: Across all product editions, you can now run jobs through another project by specifying a full URL for the shared VPC.

  • Previously, this capability was only available for Dataprep Premium by Trifacta. This restriction has been lifted. It is now available for Dataprep Standard by Trifacta and Dataprep Legacy by Trifacta, too.
  • For more information on applying the shared VPC to a job, see Dataflow Execution Settings .
  • For more information on applying a shared VPC to jobs in the project, see Execution Settings Page.

    NOTE: You must create new or replace output objects to use these shared VPC settings across your project.

Deprecated

None.

Known Issues

None.

Fixes

None.


September 21, 2020

Release 7.6

Features

Plans:

  • Apply overrides to recipe parameters for your plans. See Plan View Page.

    Feature Availability: This feature is available in
    Dataprep Premium by Trifacta only.

In-app messaging:  Be sure to check out the new in-app messaging feature, which allows us to share new features and relevant content to Dataprep by Trifacta users. More developments coming soon!

Project Settings:

  • Project administrators can configure settings that are applied to the Dataprep by Trifacta project. For more information, see Dataprep Project Settings Page.

MySQL:

  • Support for connections to MySQL databases.
  • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.
  • See MySQL Connections.

Optimizer Service: The optimizer service optimizes query execution against data sources to minimize use of Dataprep by Trifacta resources, reduce compute costs, and improve overall job execution time.

  • No configuration is required.
  • Feature Availability: This feature is available in the following editions: Dataprep Premium by Trifacta  and  Trifacta
  • You can apply optimizations for individual flows. For more information, see Flow Optimization Settings Dialog.

Relational long-loading: For relational datasources that take time to load, you can continue to work while monitoring the loading process through the Import Data page, Flow View, or Dataset Details. For more information, see Overview of Job Monitoring.

Documentation:

  • Additional connect string options and troubleshooting information has been included for specific relational connections.
  • Feature Availability: This feature is available in the following editions: Dataprep Premium by Trifacta  and  Dataprep Standard by Trifacta
  • For more information, see Connection Types.

Changes

None.

Deprecated

None.

Known Issues

None.

Fixes

TD-52559:  When publishing a single CSV file with headers using the append or overwrite publishing action, multiple instances of the header may be written in the output file.

TD-48915: Inserting special characters in an output filename results in a validation error in the the application and job failures.


August 4, 2020

Release 7.5

Features

New Flow View is now available:

  • Drag and drop to reposition objects on the Flow View canvas, and zoom in and out to focus on areas of development.
  • Perform joins and unions between objects on the Flow View canvas. 
  • Annotate the canvas with notes.

  • Tip: You can still access classic Flow View through the context menu in Flow View.

Data quality rules: Introducing data quality rules, which enable you to define data quality checks specific to your dataset. For more information, see Overview of Data Quality.

  • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.
  • Data quality rules are created in the Transformer page. For more information, see Data Quality Rules Panel.

Flow Sharing API

  • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.
  • When you share a flow via API, you can now pass in a user's email address. For more information, see Changes to the APIs.

New functions:

Changes

Delete Table IAM permission no longer required

  • Previously, the following IAM permission was required: bigquery.tables.delete
  • This permission is no longer required.

    NOTE: If this permission is not included in a user's account, some publishing actions on BigQuery, such as drop and truncate, are not possible. For more information, see Required Dataprep User Permissions.

  • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.

Add new IP address to whitelist

  • Please add that the following range of IP addresses to the whitelist for the Trifacta services for access to relational datasources in your enterprise

    34.68.114.64/28
  • Feature Availability: This feature is available in Dataprep Premium by Trifacta only.
  • For more information, see Getting Started with Cloud Dataprep.

Deprecated

None.

Known Issues

TD-50942: If a flow is unshared with you, you cannot see or access the datasources for any jobs that you have already run on the flow. You can still access the job results.

Fixes

TD-49559:  Cannot select and apply custom data types through column Type menu.

TD-47473: Uploaded files (CSV, XLS) that contain a space in the filename fail to be converted.

TD-34840: Platform fails to provide suggestions for transformations when selecting keys from an object with many of them.

Earlier Releases

For release notes from previous releases, see Earlier Releases of Cloud Dataprep .

  • No labels

This page has no comments.