Skip to main content

Transformation Reference

This section contains reference information on the transformations available in Dataprep by Trifacta.

Tip

Use the values in the Title column as search strings in the Search panel to begin specifying these transformations.

Name

Title

Description

scaleminmax

Scale to min max

Scale a column to a specific min max range. See Prepare Data for Machine Processing.

onehotencode

One hot encode

Create a column for each unique value indicating its presence or absence. See Prepare Data for Machine Processing.

scalestandardize

Scale to mean

Scale a column to zero mean and unit variance. See Prepare Data for Machine Processing.

bincolumn

Bin column

Bin values into ranges of equal or custom size. See Prepare Data for Machine Processing.

changetype

Change column type

Changes the data type of a column [settype]. See Change Column Data Type.

comment

Comment

Adds a comment to your recipe [comment]. See Add Comments to Your Recipe.

conditions

Conditional column

Returns values based on conditions such as if-then-else or case statements. See Apply Conditional Transformations.

convertpattern

Convert patterns

Finds one or more patterns or text literals and replaces them with specified pattern values. See Standardize Using Patterns.

countmatches

Count matches

Counts the number of matches [countpattern]. See Compute Counts.

countmatchesbetween

Count matches between delimiters

Counts the number of matches [countpattern]. See Compute Counts.

deduplicate

Remove duplicate rows

Removes duplicate rows where values in every column are the same. See Deduplicate Data.

derive

New formula

Creates a new column with the result of a formula.

drop

Delete columns

Delete one or more columns. See Remove Data.

extractbetweendelimiters

Extract between delimiters

Extracts text found between two patterns. See Extract Values.

extractcustom

Extract text or pattern

Extracts text found between two patterns. Variant: Custom text or pattern. See Extract Values.

extractfirstcharacters

Extract first

Extracts text according to its position. Variant: Extract the first n characters. See Extract Values.

extractkv

Convert key/value to Object

Extracts key-value pairs into an Object [extractkv]. See Extract Values.

extractlastcharacters

Extract last

Extracts key-value pairs into an Object [extractkv]. Variant: Extract the last n characters. See Extract Values.

extractlist

Extract matches to Array

Extracts a list into an Array [extractlist]. See Extract Values.

extractrangeofcharacters

Extract between positions

Extracts text according to its position. Variant: Extract the last n characters. See Extract Values.

extractmismatched

Extract mismatched

Extracts a list into an Array [extractlist]. Variant: The data type to match against. See Extract Values.

extractnumbers

Extract numbers

Extracts a list into an Array [extractlist]. Variant: Extract numbers from a text. See Extract Values.

extractquerystrings

Extract query strings

Extracts a list into an Array [extractlist]. Variant: Extract fields from an URL query string. See Extract Values.

filtercontains

Filter contains

Filter rows that satisfy a condition. Variant: Filter rows that contain a specified value or pattern. See Filter Data.

filtercustom

Filter custom formula

Filter rows that satisfy a condition. Variant: Filter rows that satisfy an arbitrary formula. See Filter Data.

filterendswith

Filter ends with

Filter rows that satisfy a condition. Variant: Filter rows that ends with a specified value or pattern. See Filter Data.

filterexactly

Filter exact

Filter rows that satisfy a condition. Variant: Filter rows that match exactly a specified value. See Filter Data.

filternot

Filter not equals

Filters rows that do not satisfy a condition. See Filter Data.

filterfromtop

Filter from top

Filter rows by their position. Variant: Filter rows from the top. See Filter Data.

filtergreaterthan

Filter greater than

Filter rows that satisfy a condition. Variant: Filter rows with values greater than (or equal to) a specified value. See Filter Data.

filterinterval

Filter at interval

Filter rows by their position. Variant: . Variant: The size of the interval to filter rows at. See Filter Data.

filterlessthan

Filter less than

Filter rows that satisfy a condition. Variant: Filter rows with values less than (or equal to) a specified value. See Filter Data.

filtermissing

Filter missing

Filter rows that satisfy a condition. Variant: Filter rows with missing values. See Remove Data.

filtermismatched

Filter mismatched

Filter rows that satisfy a condition. Variant: Filter rows with mismatched values. See Filter Data.

filteroneof

Filter in

Filter rows that satisfy a condition. Variant: Filter rows that match any of the specified values. See Filter Data.

filterrange

Filter range

Filter rows by their position. Variant: Filter rows within a range. See Filter Data.

filterstartswith

Filter starts with

Filter rows that satisfy a condition. Variant: Filter rows that starts with a specified value or pattern. See Filter Data.

flatten

Expand Array to rows

Converts each element in an Array into a new row. See Working with Arrays.

groupby

Group by

Group data and perform aggregated calculations on it. See Create Aggregations.

join

Join datasets

Adds additional columns from other data sources [join]. See Join Window.

locktype

Lock type

Lock column to current type.

lowercase

Lowercase text

Format text in columns. Variant: Convert text in column to lowercase. See Modify String Values.

leftpad

Pad with leading

Format text in columns. Variant: Add the necessary number of characters to each value to make them of the same length. See Modify String Values.

merge

Merge columns

Concatenates the values from two or more columns into a new column [merge]. See Add Two Columns.

move

Move columns

Moves one or more columns before or after another column [move]. See Move Columns.

nest

Nest columns

Converts columns into an Object or Array [nest]. See Working with Arrays.

pivot

Pivot

Creates a new column for each unique value in a column [pivot]. See Pivot Data.

prefix

Prefix text

Format text in columns. Variant: Specify a prefix to be added at the beginning of each selected column name. See Modify String Values.

propercase

Propercase text

Format text in columns. Variant: Convert text in column to ProperCase. See Modify String Values.

removesymbols

Remove symbols in text

Format text in columns. Variant: Remove all non-alphanumerical characters from the text. See Remove Data.

removewhitespace

Remove whitespace in text

Format text in columns. Variant: Remove all whitespace found in the text. See Remove Data.

removeaccents

Remove accents in text

Remove accent marks from text. See Modify String Values.

rename

Rename columns

Renames one or more columns [rename]. See Rename Columns.

renamepattern

Rename with pattern

Renames one or more columns [rename]. See Rename Columns.

renameprefix

Rename with prefix

Renames one or more columns [rename]. See Rename Columns.

renameheader

Rename with row(s

Renames one or more columns [rename]. See Rename Columns.

renamesuffix

Rename with suffix

Renames one or more columns [rename]. See Rename Columns.

renameupper

Rename to UPPERCASE

Renames one or more columns [rename]. See Rename Columns.

renamelower

Rename to lowercase

Renames one or more columns [rename]. See Rename Columns.

renamekeepleft

Rename from beginning

Renames one or more columns [rename]. See Rename Columns.

renamekeepright

Rename from end

Renames one or more columns [rename]. See Rename Columns.

renamesanitize

Rename by removing special characters

Renames one or more columns [rename]. See Rename Columns.

replacecell

Replace cells

Renames one or more columns [rename]. See Rename Columns.

replacepattern

Replace text or pattern

Replace text matching a pattern. See Replace Cell Values.

replacebetweenpatterns

Replace between delimiters

Replace text between delimiters. Variant: Replace text between delimiters. See Replace Cell Values.

replacebetweenpositions

Replace between positions

Replace text between delimiters. Variant: Replaces text based on position. See Replace Cell Values.

replacemismatched

Replace mismatched

Replace mismatched values. See Replace Cell Values.

replacemissing

Replace missing

Replace missing values. See Replace Cell Values.

select

Select

Create a new table of columns Selectd from your current dataset. See Select.

set

Edit with formula

Sets the values of one or more columns to the result of a formula [set].

sort

Sort rows

Sorts the rows based on the values in one or more columns.

splitondelimiter

Split on text or pattern

Split by delimiter. Variant: Text or pattern. See Split Column.

splitbetweendelimiters

Split between delimiters

Split by delimiter. Variant: Between two delimiters. See Split Column.

splitmultipledelimiters

Split with multiple delimiters

Split by delimiter. Variant: By multiple delimiters. See Split Column.

splitpositions

Split at positions

Split by character position. Variant: By positions. See Split Column.

splitevery

Split at interval

Split by character position. Variant: At regular interval. See Split Column.

splitbetweenpositions

Split between positions

Split by character position. Variant: Between two positions. See Split Column.

splitrows

Split into rows

Splits raw data into rows [splitrows]. See Split Column.

suffix

Suffix text

Format text in columns. Variant: Specify a suffix to be added to the end of each selected column name. See Modify String Values.

trimwhitespace

Trim whitespace

Format text in columns. Variant: Remove all whitespaces found at the beginning and end of the text. See Modify String Values.

trimquotes

Trim quotes

Format text in columns. Variant: Remove quotes found at the beginning and end of the text. See Modify String Values.

udf

Invoke external function

Creates a new column with the result of an external function.

Note

This feature may not be available in all product editions. For more information on available features, see Compare Editions.

Note

This transformation requires additional configuration.

uppercase

Uppercase text

Format text in columns. Variant: Convert text in column to UPPERCASE. See Modify String Values.

dateformat

Date format

Change format for Datetime columns. See Format Dates.

union

Union datasets

Adds additional rows from other data source [union]. See Union Page.

standardize

Standardize column

Single-column standardization for standardizing column values. See Standardize Page.

columnbyexample

Create column from examples

Create a new column by providing example values. See Create Column by Example.

unlocktype

Unlock type

Unlock column type.

unnest

Unnest elements

Extracts elements from an Object or Array into columns. See Working with Arrays.

unpivot

Unpivot

Turns columns into rows. Produces a key column with unnested values. See Pivot Data.

valuestocols

Convert values to columns

Creates a new column for each unique value in a column [valuestocols]. See Pivot Data.

window

Window

Performs row-based calculations across multiple ordered rows [window]. See Window Functions.

sourcerownumber

sourcerownumber

Generate a new column containing the row number for each row from the source, if available. See Source Metadata References.

filepath

filepath

Generate a new column containing the path to the source file, if available. See Source Metadata References.