Page tree

Trifacta Dataprep



Contents:

If you licensed Dataprep by Trifacta before Oct. 14, 2020, you are using the Dataprep by Trifacta Legacy product edition. On October 14, 2022, this product edition will be decommissioned by Google and will be no longer available for use. Current customers of this product edition are encouraged to transition to one of the product editions hosted by Trifacta. See Product Editions.

   

In this example, you can see how the CHAR function can be used to convert numeric index values to Unicode characters, and the UNICODE function can be used to convert characters back to numeric values.

Functions:

ItemDescription
CHAR Function Generates the Unicode character corresponding to an inputted Integer value. 
UNICODE Function Generates the Unicode index value for the first character of the input string.

Source:

The following column contains some source index values:

index
1
33
33.5
34
48
57
65
90
97
121
254
255
256
257
9998
9999

Transformation:

When the above values are imported to the Transformer page, the column is typed as integer, with a single mismatched value (33.5). To see the corresponding Unicode characters for these characters, enter the following transformation:

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula CHAR(index)
Parameter: New column name 'char_index'

To see how these characters map back to the index values, now add the following transformation:

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula UNICODE(char_index)
Parameter: New column name 'unicode_char_index'

Results:

indexchar_indexunicode_char_index
1 1
33!33
33.5  
34"34
48048
57957
65A65
90Z90
97a97
122z122
254þ254
255ÿ255
256Ā256
257ā257
99989998
99999999

Note that the floating point input value was not processed. 

See Also for EXAMPLE - UNICODE Function:

This page has no comments.