This example shows how you can
break out a column of nested values into separate rows and columns of data.
The following data covers magazine subscriptions for individual customers. Their subscriptions are stored in an array of values. You are interested in who is subscribing to each magazine.
|Anne Aimes||["Little House and Garden","Sporty Pants","Life on the Range"]|
|Barry Barnes||["Sporty Pants","Investing Smart"]|
|Cindy Compton||["Cakes n Pies","Powerlifting Plus","Running Days"]|
|Darryl Diaz||["Investing Smart","Cakes n Pies"]|
When this data is loaded into the Transformer, you might need to apply a
header to it. If it is in CSV format, you might need to apply some
replace transforms transformations to clean up the
Subscriptions column so it looks like the above.
Subscriptions column contains cleanly formatted arrays, the column is re-typed as Array type. You can then apply the
flatten transformthe following transformation:
Subscription combination is now written to a separate row. You can use this new data structure to break out instances of magazine subscriptions. Using the following transformtransformation, you can add the corresponding
CustId value to the column:
Delete the two source columns:
|D s also|