Page tree

Release 7.6.2


Contents:

Scheduled Maintenance: docs.trifacta.com will be offline for maintenance at 3:00pm UTC on Thursday March 30 for about 15 minutes.

   

Contents:


Extracts the ranked unique Datetime value from the values in a column, where k=1 returns the maximum value. The value for k must be between 1 and 1000, inclusive. Inputs must be Datetime.

For purposes of this calculation, two instances of the same value are treated as the same value of k. if your dataset contains three rows with column values 2020-02-15,  2020-02-14, 2020-02-14, and 2020-02-14, then KTHLARGESTDATE returns 2020-02-14 for k=2 and 2020-02-13 for k=3.

When used in a pivot transform, the function is computed for each instance of the value specified in the group parameter. See Pivot Transform.

Input column must be Datetime type. Other values column are ignored. If a row contains a missing or null value, it is not factored into the calculation.

For a version of this function that applies to non-Datetime values, see KTHLARGESTUNIQUE Function.

Wrangle vs. SQL: This function is part of Wrangle, a proprietary data transformation language. Wrangle is not SQL. For more information, see Wrangle Language.


Basic Usage

kthlargestuniquedate(myDate, 3)

Output: Returns the third highest unique value from the myDate column.

Syntax and Arguments

kthlargestuniquedate(function_col_ref, k_integer) [ group:group_col_ref] [limit:limit_count]


ArgumentRequired?Data TypeDescription
function_col_refYstringName of column to which to apply the function
k_integerYinteger (positive)The ranking of the unique value to extract from the source column

For more information on the group and limit parameters, see Pivot Transform.

For more information on syntax standards, see Language Documentation Syntax Notes.

function_col_ref

Name of the column the values of which you want to calculate the mean. Inputs must be Datetime values.

  • Literal values are not supported as inputs.
  • Multiple columns and wildcards are not supported.

Usage Notes:

Required?Data TypeExample Value
YesString (column reference)transactionDate

k_integer

Integer representing the ranking of the unique value to extract from the source column. Duplicate values are treated as a single value for purposes of this function's calculation.

NOTE: The value for k must be an integer between 1 and 1,000 inclusive.

  • k=1 represents the maximum value in the column.
  • If k is greater than or equal to the number of values in the column, the minimum value is returned.
  • Missing and null values are not factored into the ranking of k.

Usage Notes:

Required?Data TypeExample Value
YesInteger (positive)4


Examples


Tip: For additional examples, see Common Tasks.

Example - KTHLARGESTDATE functions

This example illustrates how you can apply conditionals to calculate minimum, maximum, and most common date values:
  • KTHLARGESTDATE - Extracts the ranked Datetime value from the values in a column, where k=1 returns the maximum value. See KTHLARGESTDATE Function.
  • KTHLARGESTUNIQUEDATE - Extracts the unique ranked Datetime value from the values in a column, where k=1 returns the maximum value. See KTHLARGESTUNIQUEDATE Function.
  • KTHLARGESTDATEIF - Extracts the ranked Datetime value from the values in a column that meet a specified condition. See KTHLARGESTDATEIF Function.
  • KTHLARGESTUNIQUEDATEIF - Extracts the ranked unique Datetime value from the values in a column that meet a specified condition. See KTHLARGESTUNIQUEDATEIF Function.

Source:

Here is some example transaction data:

DateProductUnitsUnitCostOrderValue
3/28/2020ProductA410.0040.00
3/8/2020ProductB420.0080.00
3/12/2020ProductC230.0060.00
3/23/2020ProductA110.0010.00
3/20/2020ProductB220.0040.00
3/12/2020ProductC930.00270.00
3/28/2020ProductA510.0050.00
3/23/2020ProductB820.00160.00
3/16/2020ProductC930.00270.00
3/8/2020ProductA510.0050.00
3/10/2020ProductB320.0060.00
3/13/2020ProductC130.0030.00
3/12/2020ProductA710.0070.00
3/10/2020ProductB720.00140.00
3/24/2020ProductC930.00270.00
3/15/2020ProductA810.0080.00
3/10/2020ProductB520.00100.00
3/10/2020ProductC430.00120.00


Transformation:

The following transformation computes the third highest date in the Date column:

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula kthlargestdate(Date, 3)
Parameter: New column name 'kthlargestdate'

This transformation computes the third highest unique value in the Date column:

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula kthlargestuniquedate(Date, 3)
Parameter: New column name 'kthlargestuniquedate'

Following transformation calculates the 3rd highest date value when the OrderValue > 200:

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula kthlargestdateif(Date, 3, OrderValue > 200)
Parameter: New column name 'kthlargestdateif'

Following transformation calculates the 3rd highest unique date value when the OrderValue > 200:

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula kthlargestuniquedateif(Date, 3, OrderValue > 200)
Parameter: New column name 'kthlargestuniquedateif'

Results: 

DateProductUnitsUnitCostOrderValuekthlargestdatekthlargestuniquedatekthlargestdateifkthlargestuniquedateif
3/28/2020ProductA410.0040.0003-24-202003-23-202003-23-202003-23-2020
3/8/2020ProductB420.0080.0003-24-202003-23-202003-23-202003-23-2020
3/12/2020ProductC230.0060.0003-24-202003-23-202003-23-202003-23-2020
3/23/2020ProductA110.0010.0003-24-202003-23-202003-23-202003-23-2020
3/20/2020ProductB220.0040.0003-24-202003-23-202003-23-202003-23-2020
3/12/2020ProductC930.00270.0003-24-202003-23-202003-23-202003-23-2020
3/28/2020ProductA510.0050.0003-24-202003-23-202003-23-202003-23-2020
3/23/2020ProductB820.00160.0003-24-202003-23-202003-23-202003-23-2020
3/16/2020ProductC930.00270.0003-24-202003-23-202003-23-202003-23-2020
3/8/2020ProductA510.0050.0003-24-202003-23-202003-23-202003-23-2020
3/10/2020ProductB320.0060.0003-24-202003-23-202003-23-202003-23-2020
3/13/2020ProductC130.0030.0003-24-202003-23-202003-23-202003-23-2020
3/12/2020ProductA710.0070.0003-24-202003-23-202003-23-202003-23-2020
3/10/2020ProductB720.00140.0003-24-202003-23-202003-23-202003-23-2020
3/24/2020ProductC930.00270.0003-24-202003-23-202003-23-202003-23-2020
3/15/2020ProductA810.0080.0003-24-202003-23-202003-23-202003-23-2020
3/10/2020ProductB520.00100.0003-24-202003-23-202003-23-202003-23-2020
3/10/2020ProductC430.00120.0003-24-202003-23-202003-23-202003-23-2020


This page has no comments.