Contents:
Basic Usage
You can filter your dataset based on the following condition types:
Example - is missing
filter missing: qty action: Drop
qty
column is missing.Example - is mismatched
filter col: CoName mismatched: 'String' action: Drop
CoName
column does not match the String data type.Example - is exactly
filter col: basic exactly: find(basic, '545', true, 1) == 8 action: Keep
545
appears at the eighth character in the basic
column.Example - is one of
filter col: zipCode oneOf: '94104','94105' action: Keep
zipCode
column is either 94104
or 94105
and deletes all other rows in the dataset.Example - Less than (or equal to)
filter col: row_number lessThanEqual: 5 action: Keep
row_number
column is less than or equal to 5
. All other rows are deleted.Example - Greater than (or equal to)
filter col: row_number greaterThanEqual: 10 action: Drop
row_number
is greater than or equal to 10
.Example - Is Between
filter col: row_number greaterThan: 5 lessThanEqual: 15 action: Keep
row_number
value is greater than 5
or less than or equal to 15
. All other rows are deleted.Example - Contains
filter col: phoneNum contains: `\({digit}{3}\)` action: Keep
phoneNum
value contains a three-digit pattern surrounded by parentheses (XXX)
. All other rows are deleted.Example - Starts with
filter col: phoneNum startsWith: '(981)' action: Keep
phoneNum
value begins with (981)
. All other rows are deleted.Example - Ends with
filter col: zipCode endsWith: `\-{digit}{4}` action: Drop
zipCode
value ends with a four-digit extension.Example - custom formula
filter row: (row_number >= 25 && firstName == 'Steve') action: Keep
row_number
value is greater than or equal to 25 and the firstName
value is Steve
. All other rows are deleted.
Syntax and Parameters
filter col:column_ref [missing: column_ref] [exactly: expression_ref] [mismatched: 'data_type_str'] [exactly: expression] [oneOf: 'string_1','string_2'] [lessthan | lessThanEqual: numVal] [greaterthan | greaterThanEqual: numVal] [contains: string_or_pattern] [startsWith|endsWith: string_or_pattern] action: [Drop|Keep]
Token | Required? | Data Type | Description |
---|---|---|---|
filter | Y | transform | Name of the transform |
row | N | string | Expression identifying the row or rows to filter. If expression evaluates to true for a row, the row is either kept or deleted. |
col | N | string | Name of the column or expression for columns to drop |
missing | N | string | Name of column to evaluate for missing values. |
mismatched | N | string | String literal for the data type to check for mismatches. |
exactly | N | string | String literal, Alteryx pattern, or regular expression that evaluates to an exact match for a row value in the specified column. |
oneOf | N | string | List of string literals, any of which can be matched. |
lessThan or lessThanEqual | N | integer, decimal, or expression | Integer or decimal literal or expression evaluating to numeric value below which results in a match. Can also match on the specified expression exactly. Parameter is also used for the Between condition type. |
greaterThan or greaterThanEqual | N | integer, decimal, or expression | Integer or decimal literal or expression evaluating to numeric value above which results in a match. Can also match on the specified expression exactly. Parameter is also used for the Between condition type. |
contains | N | string | String literal, Alteryx pattern, or regular expression to be matches somewhere within the specified column values. |
startsWith | N | string | String literal, Alteryx pattern, or regular expression to match the beginnings of the values in the specified column. |
endsWith | N | string | String literal, Alteryx pattern, or regular expression to match the endings of the values in the specified column. |
action | Y | string | Drop or Keep the listed columns |
For more information on syntax standards, see Language Documentation Syntax Notes.
row
true
or false
.Examples:
Expression | Description |
---|---|
Score >= 50 | true if the value in the Score column is greater than 50. |
LEN(LastName) > 8 | true if the length of the value in the LastName column is greater than 8. |
ISMISSING([Title]) | true if the row value in the Title column is missing. |
ISMISMATCHED(Score,['Integer']) | true if the row value in the Score column is mismatched against the Integer data type. |
Example:
delete row: (lastContactDate < 01/01/2010 || status == 'Inactive')
Output: Deletes any row in the dataset where the lastContactDate
is before January 1, 2010 or the status is Inactive
.
Usage Notes:
Required? | Data Type |
---|---|
Yes | Expression that evaluates to true or false |
col
Identifies the column or columns to which to apply the transform. You can specify one column or more columns.
Usage Notes:
Required? | Data Type |
---|---|
Yes | String (column name) |
missing
For the Is Missing condition type, this value specifies the column to check for missing values.
Usage Notes:
Required? | Data Type |
---|---|
Required for Is Missing condition type only | String (column name) |
mismatched
For the Mismatched condition type, this value specifies string for the data type identifier value to check for mismatches. For more information, see Valid Data Type Strings.
The col
parameter is also required.
Usage Notes:
Required? | Data Type |
---|---|
Required for Mismatched condition type only | String (data type identifier) |
exactly
For the Exactly condition type, this value is a String literal, Alteryx pattern, or regular expression that exactly matches row values in the specified column.
The col
parameter is also required.
Usage Notes:
Required? | Data Type |
---|---|
Required for Exactly condition type only | String (expression) |
oneOf
For the One Of condition type, this value is a list of string literals, Alteryx patterns, or regular expressions. If a row value for the specified column matches one of these expressions, the row is either deleted or kept.
The col
parameter is also required.
Usage Notes:
Required? | Data Type |
---|---|
Required for One Of condition type only | List of string literals, Alteryx patterns, or regular expressions |
lessThan or lessThanEqual
For the Less Than conditional types, this value is an Integer or Decimal literal or an expression that evaluates to an Integer or Decimal literal. If the value in the specified column is less than (or optionally equal to) this value, then the row is either deleted or kept.
The col
parameter is also required.
Usage Notes:
Required? | Data Type |
---|---|
Required for Less than (or equal to) condition type only | Integer or Decimal literal or expression evaluating to one of these data types |
greaterThan or greaterThanEqual
For the Less Than conditional types, this value is an Integer or Decimal literal or an expression that evaluates to an Integer or Decimal literal. If the value in the specified column is greater than (or optionally equal to) this value, then the row is either deleted or kept.
The col
parameter is also required.
Usage Notes:
Required? | Data Type |
---|---|
Required for Greater than (or equal to) condition type only | Integer or Decimal type or expression evaluating to one of these data types |
contains
For the Contains condition type, this value identifies a String literal, Alteryx pattern, or regular expression, which is used to evaluate partial or full matches to row values in the specified column.
The col
parameter is also required.
Usage Notes:
Required? | Data Type |
---|---|
Required for Contains condition type only | String literal, Alteryx pattern, or regular expression |
startsWith
For the Starts With condition type, this value identifies the String literal, Alteryx pattern, or regular expression with which a value must start in the specified column to match.
The col
parameter is also required.
Usage Notes:
Required? | Data Type |
---|---|
Required for Starts with condition type only | String literal, Alteryx pattern, or regular expression |
endsWith
For the Ends With condition type, this value identifies the String literal, Alteryx pattern, or regular expression with which a value must end in the specified column to match.
The col
parameter is also required.
Usage Notes:
Required? | Data Type |
---|---|
Required for Ends with condition type only | String literal, Alteryx pattern, or regular expression |
action
Identifies whether the action performed by the transform:
Drop
- Listed columns are dropped from the dataset.Keep
- Listed columns are retained in the dataset, and all other columns are dropped.
Usage Notes:
Required? | Data Type |
---|---|
Yes | String (Drop or Keep ) |
Tip: For additional examples, see Common Tasks.
Examples
See above.
This page has no comments.