In general terms, a null value is a definition that points to nothing. A container for a value, such as a row-column combination or a variable, exists, but the container points to no actual value.

Important notes on null values

NOTE: In the platform, null values are a subset of the category identifying missing values. For technical reasons, however, displays null values as missing values and visually treats them as the same. Internally, they are understood to be different values.

Implications:

NOTE: When a recipe containing a user-defined function is applied to text data, any null characters cause records to be truncated by the running environment during Photon job execution. In these cases, please execute the job in the Spark running environment.

Locate null values

Null values are displayed with missing values in the Missing values category of the data quality bar (in black).

You can use the following transform to distinguish between null and missing values. This transform generates a new column of values, which are set to true if the value in isActive is a null value:

Tip: You can use this transform and a subsequent sorting step on the generated column to filter for null values.

High percentage of nulls

On import, if a column has a high enough percentage of null values, the platform may retype the column as a String column, which may yield mismatched values in addition to the missing values that were imported from null values.

Fix null values

See Find Missing Data.

Null values in transformations

Functions:

Transforms:

Write null values

If needed, you can write a null value to a set of data. In the following example, all missing values in a column are replaced by nulls, using the NULL function.

NOTE: The NULL function is typically used to pass null values into functions that have been designed to specifically address them.

The following example tests all columns in the range between column1 and column255 for whether a missing value is detected. If so, a null value is written. Otherwise, the column value is written back to the column: 

The above transform writes null values, but these values are converted to missing values on export.