Page tree

Trifacta Dataprep



Contents:

If you licensed Dataprep by Trifacta before Oct. 14, 2020, you are using the Dataprep by Trifacta Legacy product edition. On October 14, 2022, this product edition will be decommissioned by Google and will be no longer available for use. Current customers of this product edition are encouraged to transition to one of the product editions hosted by Trifacta. See Product Editions.

   

Contents:


This section covers key known limitations of Dataprep by Trifacta®

NOTE: This list of limitations should not be considered complete.

General Limitations

Data Volume

The Trifacta application applies no fixed limits to the number of columns or rows that can be handled during transformation.

NOTE: During transformation, Dataprep by Trifacta is designed to process data volumes of any size.

However, some important considerations:

Soft row limits

  • The number of rows that you see within the Trifacta application in the currently selected sample is determined by:
    • Maximum permitted sample size stored on the base storage layer
    • Currently configured sample size for the current recipe

See Sampling below.

Soft column limits

  • Soft row limits do not affect the number of columns that are displayed. All available and visible columns are displayed. The number of rows may be affected by the number of columns. 

    Tip: Avoid creating and working with datasets that are wider than 1000 columns. Datasets that are wider than this recommendation may result in performance impacts in the Trifacta application.

  • The number of columns may be limited by:
    • Number of columns permitted in the source datastore. 
    • For SQL-based datastores, limits may be placed on the length of individual queries.

Sampling

  • All values displayed or generated in the application are based on the currently displayed sample. 
    • Transforms that generate new data may not factor values that are not present in the current sample.
    • When the job is executed, transforms are applied across all rows and values in the source data.
    • Transforms that make changes based on data values, such as header and valuestocols, will still be configured according to sample data at the time of that the step was added, instead at execution time. For example, all of the values detected in the sample are used to determine the columns of a valuestocols transform step based on the selected sample when the step was added.
  • Random samples are derived from up to the first 1 GB of the source file. 
    • Data from later parts of a multi-part file may not be included in the sample.

Internationalization

  • The product supports a variety of global file encoding types for import. 

  • Within the application, UTF-8 encodings are displayed. 
    • Limited set of characters allowed in column names.
    • Header does not support all UTF-8 characters.
    • Emoji are not supported in data wrangling operations.
    • Umlauts and other international characters are not supported when filtering datasets in browsers of external datastores.
  • States and Zip Code Column Types and the corresponding maps in visual profiling apply only to the United States.
  • UTF-8 is generated in output.
  • UTF-32 encoding is not supported

NOTE: Some functions do not correctly account for multi-byte characters. Multi-byte metadata values may not be consistently managed.

Size Limits

Upload File Size Limits

  • Maximum upload size for a file is 1 GB.


Limitations for Dataprep by Trifacta Enterprise Edition

  • Sort transform is not supported.
  • User-defined functions are not supported.
  • Custom dictionaries and custom data types are not supported.
  • Job cancellation is not supported.
  • Collaborative suggestions are not supported.

Limitations for Dataprep by Trifacta Professional Edition

All of the limitations for Dataprep by Trifacta Enterprise Edition, plus the following:

  • Use of Companion Service Accounts is not supported.
  • Premium-level support is not available.

Limitations for Dataprep by Trifacta Starter Edition

All of the limitations for Dataprep by Trifacta Professional Edition, plus the following:

  • Data quality rules are not available.
  • Breadth of relational connectivity is limited.
  • Scheduling and plan management are not supported.
  • Email alerts and webhook monitoring are not supported.
  • Limited access to APIs
  • Support is limited to Community-based support.

Limitations for Dataprep by Trifacta Premium

  • Sort transform is not supported.
  • User-defined functions are not supported.
  • Custom dictionaries and custom data types are not supported.
  • Except for BigQuery, global connections available to all users is not supported.

Limitations for Dataprep by Trifacta Standard

  • Sort transform is not supported.
  • User-defined functions are not supported.
  • Custom dictionaries and custom data types are not supported.

Limitations for Dataprep by Trifacta

  • Sort transform is not supported.
  • User-defined functions are not supported.
  • Custom dictionaries and custom data types are not supported.
  • Integrations with datastores other than BigQuery, Cloud Storage, and the local filesystem are not supported.
  • User access to administrator functions is not supported. Configuration of features, including enablement of them, is not supported. Features are either available or not in this product edition.
  • Sharing is not supported.







Other Limitations

  • File Formats: Limitations may apply to individual file formats. See Supported File Formats.
  • Data Type Conversions: There are some limitations on how data types are converted during import or export/publication. See Type Conversions.

This page has no comments.