Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

For the latest release notes, see Release Notes for Dataprep by Trifacta.

March 16, 2021

Release 8.1

What's New

Connectivity:

  • Introducing Early Preview connections. In each release of cloud-based product editions, new connection types may be made available in read-only mode for users to begin exploring their datasets stored in the connected datastores.

    Info

    NOTE: Early Preview connection types are read-only and are subject to change before they may be made generally available.

    D s ed
    editionsgdppr

  • Early Preview connections available with this release:
    • Airtable
    • Cassandra
    • Freshdesk
    • Google Analytics
    • MailChimp

...

For more information, see Macros Page.

Changes

Freed IP address ranges:

The following IP address range is the only one in use by the

D s item
itemService
:

...

The Preferences area of the

D s webapp
has been changed. For more information, see Changes to Configuration.

Deprecated

None.

Known Issues

  • TD-58523: Cannot import dataset with filename in Korean alphabet from HDFS.

    • Workaround: You can upload files with Korean characters from your desktop. You can also add a 1 to the end of the file on HDFS, and it can then be imported.

  • TD-55299: Imported datasets with encodings other than UTF-8 and line delimiters other than \n may generate empty outputs on Spark or

    D s dataflow
    running environments.

  • TD-51516: Input data containing BOM (byte order mark) characters may cause Spark or
    D s dataflow
    running environments to read data improperly and/or generate invalid results.

Fixes

  • TD-56170: The Test Connection button for some relational connection types does not perform a test authentication of user credentials.
  • TD-54440: Header sizes at intermediate nodes for JDBC queries cannot be larger than 16K.
    • Previously, the column names for JDBC data sources were passed as part of a header in a GET request. For very wide datasets, these GET requests often exceeded 16K in size, which represented a security risk.

February 16, 2021

Release 8.0

Features

Tip

Tip: Add a profile picture to your account! For more information, see User Profile Page.

...

  • You can enable the 
    D s webapp
     to apply SQL filter pushdowns to your relational datasources to remove unused rows before their data is imported for a job execution. This optimization can significantly improve performance as less data is transferred during the job run. For more information, see Flow Optimization Settings Dialog.
  • Optimizations that were applied during the job run now appear in the Job Details Page. See Job Details Page.

Changes

None.

Deprecated

None.

Known Issues

  • TD-56830: Receive malformed_query: enter a filter criterion when importing table from Salesforce.

    • D s ed
      editionsgdppr
      oneLinetrue
    • NOTE: Some Salesforce tables require mandatory filters when they are queried. Mandatory filters are not currently supported for Salesforce connections

...

  • TD-56170: The Test Connection button for some relational connection types does not perform a test authentication of user credentials.

    • Workaround: Append the following to your Connect String Options:

      Code Block
      ;ConnectOnOpen=true


    • This option forces the connection to validate user credentials as part of the connection. There may be a performance penalty when this option is used.

Fixes

None.

January 12, 2021

Release 7.10

Features

In-app chat: Have a question about the product? Use the new in-app chat feature to explore content or ask a question to our support staff. If you need assistance, please reach out!

...

Changes

IP address range whitelist:

...

  • NOTE: When enabled, users are listed according to their email addresses.
  • NOTE: For 
    D s product
    productgdple
     , this feature is disabled by default.
  • For more information on enabling or disabling this feature, see  Changes to Configuration.

Deprecated

None.

Known Issues

None.

Fixes

TD-53527:  When importing a dataset via API that is sourced from a BZIP file stored on S3, the columns may not be properly split when the platform is permitted to detect the structure.

December 14, 2020

Release 7.9

Features

In-app chat: Have a question about the product? Use the new in-app chat feature to explore content or ask a question to our support staff. If you need assistance, please reach out!

...

Changes

Changes to permissions: The set of required and optional permissions has changed for 

D s product
productgdppr
.

...

  • The Optimizer service optimizes query execution against data sources to minimize use of
    D s product
    productgdp
     resources, reduce compute costs, and improve overall job execution time.
  • No configuration is required.
  • D s ed
    editionsgdppr, gpdst
    oneLinetrue
  • You can apply optimizations for individual flows. For more information, see Flow Optimization Settings Dialog.

Deprecated

None.

Known Issues

None.

Fixes

TD-53475:  Missing associated artifact error when importing a flow.

November 17, 2020

Release 7.8

Features

Plans:

...

For more information, see API Workflow - Swap Datasets.

Changes

JDBC connection pooling disabled: The ability to create connection pools for JDBC-based connections has been disabled. It is likely to be removed in a future release.

D s ed
editionsgdppr
oneLinetrue

...

Salesforce connector disabled temporarily: In Release 7.8, the Salesforce connector has been disabled temporarily. In a future release, it will be replaced with an improved version of the Salesforce connector.

D s ed
editionsgdppr
oneLinetrue

Deprecated

None.

Known Issues

TD-55503: When you swap datasets via API, existing samples are not discarded. These samples are invalid.

  • Workaround: This issue does not occur if you swap datasets through the 
    D s webapp
    . If it does occur via API, you can collect a new sample manually. See Samples Panel.

Fixes

TD-53318: Cannot publish results to relational targets when flow name or output filename or table name contains a hyphen (e.g. my - filename.csv).

October 2, 2020

Release 7.6 push 3

Features

None.

Changes

Disabled Optimizer Service: In the September release, the Optimizer service was introduced, which enabled users to apply advanced physical and logical optimizations for flow and job executions. Recently, an issue was discovered, which has caused us to disable the service temporarily.

  • This issue affected a very small number of users who were using the new feature. Now that the feature is disabled, impacted users should experience impacts only to performance of flow and job executions. Performance should be similar to pre-release of the service.
  • The Optimizer service was disabled through a configuration change that did not require any service interruptions. Users should not experience any loss of functionality or availability due to the work to resolve this issue.
  • The 
    D s item
    itemEngineering team
     is actively working to resolve the issue. Thank you for patience. If you have further questions, please contact 
    D s support
    .

Deprecated

None.

Known Issues

None.

Fixes

None.

October 1, 2020

Release 7.6 push 3

Features

None.

Changes

Shared VPCs: Across all product editions, you can now run jobs through another project by specifying a full URL for the shared VPC.

  • Previously, this capability was only available for 
    D s product
    productgdppr
    . This restriction has been lifted. It is now available for 
    D s product
    productgdpst
     and 
    D s product
    productgdple
    , too.
  • For more information on applying the shared VPC to a job, see Dataflow Execution Settings .
  • For more information on applying a shared VPC to jobs in the project, see Execution Settings Page.

    Info

    NOTE: You must create new or replace output objects to use these shared VPC settings across your project.


Deprecated

None.

Known Issues

None.

Fixes

None.

September 21, 2020

Release 7.6

Features

Plans:

  • Apply overrides to recipe parameters for your plans. See Plan View Page.

    D s ed
    editionsgdppr

...

  • Additional connect string options and troubleshooting information has been included for specific relational connections.
  • D s ed
    editionsgdppr,gdpst
    oneLinetrue
  • For more information, see Connection Types.

Changes

None.

Deprecated

None.

Known Issues

None.

Fixes

TD-52559:  When publishing a single CSV file with headers using the append or overwrite publishing action, multiple instances of the header may be written in the output file.

TD-48915: Inserting special characters in an output filename results in a validation error in the the application and job failures.

August 4, 2020

Release 7.5

Features

New Flow View is now available:

...

Changes

Delete Table IAM permission no longer required

...

  • Please add that the following range of IP addresses to the whitelist for the 

    D s item
    itemservices
     for access to relational datasources in your enterprise

    Code Block
    34.68.114.64/28


  • D s ed
    editionsgdppr
    oneLinetrue
  • For more information, see Getting Started with Dataprep by Trifacta.

Deprecated

None.

Known Issues

TD-50942: If a flow is unshared with you, you cannot see or access the datasources for any jobs that you have already run on the flow. You can still access the job results.

Fixes

TD-49559:  Cannot select and apply custom data types through column Type menu.

...

TD-34840: Platform fails to provide suggestions for transformations when selecting keys from an object with many of them.

July 7, 2020

Release 7.1

Features

D gdp rn
typef

Introducing

D s product
productgdppr
and
D s product
productgdpst
:
You can now upgrade your existing
D s product
productgdp
projects to unlock advanced features, such as broader API access and relational connectivity. To see the full set of new capabilities and use cases, see https://www.trifacta.com/products/pricing/cloud-dataprep/.

...

D gdp rn
typef

Dataflow execution in non-local VPC: You can now execute your

D s dataflow
jobs on a non-local or shared virtual private network (VPC).

  • NOTE: To accommodate a wider range of shared VPCs configuration, subnetworks must be specified by full URL. See Changes below.
  • Project owners can set these execution options for the entire project. See Execution Settings Page.

Changes

D gdp rn
typec

Subnetwork specified by URL: When you are specifying the subnetwork where to execute your

D s dataflow
jobs, you must now specify the subnetwork using a URL.

  • Tip: This feature can be used when
    D s product
    is configured to execute
    D s dataflow
    jobs to run within a shared VPC hosted in a project other than the current project.
  • Previously, you could specify the subnetwork by name. However, non-local subnetwork values could not be specified in this manner.
  • For more information, see Dataflow Execution Settings.

Deprecated

None.

Known Issues

D gdp rn
typek

Soft validation errors during job execution for imported flow: If you import a flow from

D s product
productgdppr
into
D s product
productgdpst
, you may encounter soft validation errors during job execution. 

  • If the imported flow uses custom VPC mode, then the job execution may fail, since network may be inaccessible.
  • Workaround: 1) set the VPC mode to Auto or 2) set accessible VPC options before you run your job. See Dataflow Execution Settings .

Fixes

None.

June 4, 2020

Release 7.1

Features

Flow parameters: Create flow parameters that you can reference in the recipes of your flow. 

...

Changes

Parameter overrides: If you have upgraded to Release 7.1 or later, any parameter overrides that you have specified in your flows must be re-applied. For more information, see Manage Parameters Dialog.

...

  • Workflow documentation is still available with the product documentation. For more information, see API Reference.

Deprecated

Send a Copy: You can no longer send a copy of a flow to another user.

...

  • New method: Please use the /v4/jobGroups endpoint to run and re-run jobs.
  • For more information, see API Reference.

Known Issues

TD-49559: Cannot select and apply custom data types through column Type menu.

...

  • Workaround: Remove the space in the filename and upload again.

Fixes

None.

April 16, 2020

Release 6.8.2

Features

D gdp rn
typef

Advanced Dataflow Settings:



Changes

None.

Deprecated

None.

Fixes

D gdp rn
typex

TD-47149: Cannot edit settings when importing Google Sheets.

Known Issues

None.

February 14, 2020

Release 6.8.1-push2

This is the initial release of 

D s product
productgdppr
rtrue
.

Features


Changes

None.

Deprecated

None.

Fixes

D gdp rn
typex

TD-40348: When loading a recipe in an imported flow that references an imported Excel dataset, Transformer page displays Input validation failed: (Cannot read property 'filter' of undefined) error, and the screen is blank.

Known Issues

None.

February 12, 2020

Release 6.8.1

Features

D gdp rn
typef

Macros:

...

D gdp rn
typef

New functions:

Changes

D gdp rn
typec

Browser Support Policy:

  • For supported browsers, at the time of release, the latest stable version and the two previous stable versions are supported.

  • NOTE: Stable browser versions released after a given release of

    D s product
    will NOT be supported for any prior version of
    D s product
    .  A best effort will be made to support newer versions released during the support lifecycle of the release.

...

D gdp rn
typec

Import/Export: Flows can now be exported and imported across products and versions of products.

Deprecated

D gdp rn
typed

Re-run jobs using

D s dataflow
templates:

  • In prior releases, you could re-run a
    D s product
    job by configuring the
    D s dataflow
    template with input and output parameters for the job.
  • As of this release,
    D s product
    will continue to generate
    D s dataflow
    templates, but they are no longer recommended for use in programmatic execution of
    D s dataflow
    jobs.
  • Instead, you can now run jobs and monitor them through exposed API endpoints. For more information, see API Reference.
  • Support for
    D s dataflow
    templates will be decommissioned in March 2020 (previously planned for Dec 2019)

Known Issues

D gdp rn
typek

TD-47263: Importing an exported flow that references a Google Sheets or Excel source breaks connection to input source.

  • Workaround: If the importing user has access to the source, the user can re-import the dataset and then swap the source for the broken recipe.

...

D gdp rn
typek

TD-45122: API: re-running job using only the wrangleDataset identifier fails even if the original job succeeds when writeSettings were specified.

  • Workaround: Use a full jobGroups job specification each time that you run a job. See

    D s api refdoclink
    operation/runJobGroup


Fixes

D gdp rn
typex

TD-44548: RANGE function returns null values if more than 1000 values in output.

...

D gdp rn
typex

TD-42080: Cannot run flow that contains more than 10 recipe jobs

September 16, 2019

Release 6.4.1

Features

D gdp rn
typef

Introducing APIs: Manage job execution via API.

  • D s product
    now supports API endpoints for programmatic execution and monitoring of
    D s item
    itemjobs
    . Beginning in this release, you can use token-based security to manage the launching and execution of
    D s product
    jobs. For more information, see API Reference.
  • This API should be used as a replacement for
    D s dataflow
    templates for programmatic invocation of
    D s item
    itemjobs
    . In addition, this feature includes support for dynamic functions and input & output destinations.
  • NOTE:
    D s dataflow
    templates generated by
    D s product
    are still supported but are no longer recommended for use.

Changes

None.

Deprecated

D gdp rn
typed

Re-run jobs using

D s dataflow
templates:

  • In prior releases, you could re-run a
    D s product
    job by configuring the
    D s dataflow
    template with input and output parameters for the job.
  • As of this release,
    D s product
    will continue to generate
    D s dataflow
    templates, but they are no longer recommended for use in programmatic execution of
    D s dataflow
    jobs.
  • Instead, you can now run jobs and monitor them through exposed API endpoints. For more information, see API Reference.
  • Support for
    D s dataflow
    templates will be decommissioned in December 2019 March 2020.

Known Issues

D gdp rn
typek

TD-43284: When running a job via API, you cannot apply setting overrides, parameter values, or other execution settings as part of the job definition in the API. The job will be executed with the settings, parameter values and execution settings defined in the UI for that job.

  • You can run the job using default settings only through the API. Please the the Run Job UI in order to change the default settings.
  • For more information, see

    D s api refdoclink
    operation/runJobGroup


Fixes

None.

September 11, 2019

Release 6.4.1

Features

D gdp rn
typef

Introducing recipe macros: User-defined macros enable saving and reusing sequences of steps. For more information, see Overview of Macros.

...

D gdp rn
typef

Broader support for metadata references: Broader support for metadata references. For Excel files, $filepath references now return the location of the source Excel file. Sheet names are appended to the end of the reference. See Source Metadata References.


Changes

D gdp rn
typec

PNaCl browser extension no longer supported: Please verify that all users of

D s product
are using a supported version of Google Chrome, which automatically enables use of WebAssembly. For more information, see Browser Requirements .

...

D gdp rn
typec

Documentation errata: In prior releases, the documentation listed UTF32-BE and UTF32-LE as supported file formats. These formats are not supported. Documentation has been updated to correct this error. See Supported File Encoding Types

Deprecated

None.

Known Issues

None.

Fixes

D gdp rn
typex

TD-40424: UTF-32BE and UTF-32LE are available as supported file encoding options. They do not work.

  • NOTE:  Although these options are available in the application, they have never been supported in the underlying platform. They have been removed from the interface.

...