Page tree

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 37 Next »


Contents:

Our documentation site is moving!

For up-to-date documentation of Designer Cloud on AWS, please visit us at https://help.alteryx.com/AWS/.

   

This example demonstrates how to use quote parameter for more sophisticated splitting of columns of data using the split transform.

Source:

In this example, the following CSV data, which contains contact information, is imported into the application:

LastName,FirstName,Role,Company,Address,Status
Wagner,Melody,VP of Engineering,Example.com,"123 Main Street, Oakland, CA 94601",Prospect
Gruber,Hans,"Director, IT",Example.com,"456 Broadway, Burlingame, CA, 94401",Customer
Franks,Mandy,"Sr. Manager, Analytics",Tricorp,"789 Market Street, San Francisco, CA, 94105",Customer

Transformationn:

When this data is pulled into the application, some initial parsing is performed for you:

column2column3column4column5column6column7
LastNameFirstNameRoleCompanyAddressStatus
WagnerMelodyVP of EngineeringExample.com"123 Main Street, Oakland, CA 94601"Prospect
GruberHans"Director, IT"Example.com"456 Broadway, Burlingame, CA, 94401"Customer
FranksMandy"Sr. Manager, Analytics"Tricorp"789 Market Street, San Francisco, CA, 94105"Customer

When you open the Recipe Panel, you should see the following transforms:

Transformation Name Split into rows
Parameter: Column column1
Parameter: Split on \n
Parameter: Ignore matches between \"
Parameter: Quote escape character \"

Transformation Name Split column
Parameter: Column column1
Parameter: Option On pattern
Parameter: Match pattern ','
Parameter: Number of Matches 5
Parameter: Ignore matches between \"

The first transform splits the raw source data into separate rows in the carriage return character (\r), ignoring all values between the double-quote characters. Note that this value must be escaped. The double-quote character does not require escaping. While there are no carriage returns within the actual data, the application recognizes that these double-quotes are identifying single values and adds the quote value.

The second transform splits each row of data into separate columns. Since it is comma-separated data, the application recognizes that this value is the column delimiter, so the on value is set to the comma character (,). In this case, the quoting is necessary, as there are commas in the values in column4 and column6, which are easy to clean up.

To finish clean up of the dataset, you can promote the first row to be your column headers:

Transformation Name Rename column with row(s)
Parameter: Option Use row(s) as column names
Parameter: Type Use a single row to name columns
Parameter: Row number 1

You can remove the quotes now. Note that the following applies to two columns:

Transformation Name Replace text or patterns
Parameter: Column Address,Role
Parameter: Find '\"'
Parameter: Replace ''
Parameter: Match all occurrences true

Now, you can split up the Address column. You can highlight one of the commas and the space after it in the column, but make sure that your final statement looks like the following:

Transformation Name Split column
Parameter: Column column1
Parameter: Option On pattern
Parameter: Match pattern ','
Parameter: Number of Matches 2

Notice that there is some dirtiness to the resulting Address3 column:

Address3
CA 94601
CA, 94401
CA, 94105

Use the following to remove the comma. In this case, it's important to leave the space between the two values in the column, so the on value should only be a comma. Below, the width value is two single quotes:

Transformation Name Replace text or patterns
Parameter: Column Address3
Parameter: Find ','
Parameter: Replace ''
Parameter: Match all occurrences true

You can now split the Address3 column on the space delimiter:

Transformation Name Split by delimiter
Parameter: Column Address3
Parameter: Option by delimiter
Parameter: Delimiter ' '
Parameter: Number of columns to create 2

Results:

After you rename the columns, you should see the following:

LastNameFirstNameRoleCompanyAddressCityStateZipcodeStatus
WagnerMelodyVP of EngineeringExample.com123 Main StreetOaklandCA94601Prospect
GruberHansDirector, ITExample.com456 BroadwayBurlingameCA94401Customer
FranksMandySr. Manager, AnalyticsTricorp789 Market StreetSan FranciscoCA94105Customer

See Also for EXAMPLE - Quote Parameter:

Error rendering macro 'contentbylabel'

parameters should not be empty

  • No labels

This page has no comments.