Page tree

Release 9.2


Contents:

   

This example demonstrates you to extract values from one column of an array into a new column.

Functions:

ItemDescription
LIST Function Extracts the set of values from a column into an array stored in a new column. This function is typically part of an aggregation. 
UNIQUE Function Extracts the set of unique values from a column into an array stored in a new column. This function is typically part of an aggregation. 
DATEFORMAT Function Formats a specified Datetime set of values according to the specified date format. Source values can be a reference to a column containing Datetime values.


You have the following set of orders for two months, and you are interested in identifying the set of colors that have been sold for each product for each month and the total quantity of product sold for each month. 

Source:

OrderIdDateItemQtyColor
10011/15/15Pants1red
10021/15/15Shirt2green
10031/15/15Hat3blue
10041/16/15Shirt4yellow
10051/16/15Hat5red
10061/20/15Pants6green
10071/15/15Hat7blue
10084/15/15Shirt8yellow
10094/15/15Shoes9brown
10104/16/15Pants1red
10114/16/15Hat2green
10124/16/15Shirt3blue
10134/20/15Shoes4black
10144/20/15Hat5blue
10154/20/15Pants6black

Transformation:

To track by month, you need a column containing the month value extracted from the date:

Transformation Name Edit column with formula
Parameter: Columns Date
Parameter: Formula DATEFORMAT(Date, 'MMM yyyy')

You can use the following transform to check the list of unique values among the colors:

Transformation Name Pivot columns
Parameter: Row labels Date
Parameter: Values unique(Color, 1000)
Parameter: Max number of columns to create 10

Dateunique_Color
Jan 2015["green","blue","red","yellow"]
Apr 2015
["brown","blue","red","yellow","black","green"]

Delete the above transform.

You can aggregate the data in your dataset, grouped by the reformatted Date values, and apply the LIST function to the Color column. In the same aggregation, you can include a summation function for the Qty column:

Transformation Name Pivot columns
Parameter: Row labels Date
Parameter: Values list(Color, 1000),sum(Qty)
Parameter: Max number of columns to create 10

Results:

Datelist_Colorsum_Qty
Jan 2015["green","blue","blue","red","green","red","yellow"]28
Apr 2015
["brown","blue","red","yellow","black","blue","black","green"]
38

See Also for EXAMPLE - LIST and UNIQUE Function:

This page has no comments.