Page tree


Contents:

   

Removes all characters from a string that are not letters, numbers, accented Latin characters, or whitespace.

NOTE: Non-Latin letters are also removed.

Tip: This function also removes common punctuation, such as the following:

. , ! & ?

To preserve these characters, you might replace them with an alphanumeric text string. For example, the question mark might be replaced by:

zzQUESTIONMARKzz

After the function has been applied, you can replace these strings with the original values.

Wrangle vs. SQL: This function is part of Wrangle, a proprietary data transformation language. Wrangle is not SQL. For more information, see Wrangle Language.

 

Basic Usage

Column reference example:

removesymbols(MyName)

Output: Returns the value in the MyName column value with all non-alphanumeric characters removed. 

String literal example:

removesymbols('Héllõ, Wõrlds!?!?')

Output: Returns the string: Héllõ Wõrlds.

Wildcard example:

removesymbols($col)

Output: Strips all non-alphanumeric or space characters from all columns in the dataset.

Syntax and Arguments

removesymbols(column_string)


ArgumentRequired?Data TypeDescription
column_stringYstringName of the column or string literal to be applied to the function

For more information on syntax standards, see Language Documentation Syntax Notes.

column_string

Name of the column or string constant to be trimmed of symbols.

  • Missing string or column values generate missing string results.
  • String constants must be quoted ('Hello, World').
  • Multiple columns and wildcards are not supported.

Usage Notes:

Required?Data TypeExample Value
YesString literal or column referencemyColumn

Examples


Tip: For additional examples, see Common Tasks.

Example - String cleanup functions together

This example demonstrates functions that can be used to clean up strings.

Functions:

ItemDescription
TRIM Function Removes leading and trailing whitespace from a string. Spacing between words is not removed.
REMOVEWHITESPACE Function Removes all whitespace from a string, including leading and trailing whitespace and all whitespace within the string.
REMOVESYMBOLS Function Removes all characters from a string that are not letters, numbers, accented Latin characters, or whitespace.

Source:

In the following (space) and (tab) indicate space keys and tabs, respectively. Carriage return and newline characters are also supported by whitespace functions.

Stringssource
String01this source(space)(space)
String02(tab)(tab)this source
String03(tab)(tab)this source(space)(space)
String04this source's?
String05Why, you @#$%^&*()!
String06this söurce
String07(space)this söurce
String08à mañana

Transformation:

The following transformation steps generate new columns using each of the string cleanup functions:

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula TRIM(source)
Parameter: New column name 'trim_source'

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula REMOVEWHITESPACE(source)
Parameter: New column name 'removewhitespace_source'

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula REMOVESYMBOLS(source)
Parameter: New column name 'removesymbols_source'

Results:

Stringssourceremovesymbols_sourceremovewhitespace_sourcetrim_source
String01this source(space)(space)  this source(space)(space)  thissourcethis source
String02(tab)(tab)this source(tab)(tab)this sourcethissourcethis source
String03(tab)(tab)this source(space)(space)(tab)(tab)this source(space)(space)thissourcethis source
String04this source's?this sourcesthissource's?this source's?
String05"Why, you @#$%^&*()!"Why you Why,you@#$%^&*()!Why, you @#$%^&*()!
String06this söurcethis söurcethissöurcethis söurce
String07(space)this söurce(space)this söurcethissöurcethis söurce
String08à mañanaà mañanaàmañanaà  mañana

See Also for EXAMPLE - String Cleanup Functions:

This page has no comments.