This page describes the statistical information available for individual columns of data.
Below, you can review general statistics maintained for each data type, followed by breakdowns of statistics for each specific type of data.
For any selection of values in a column, the following counts are generally available.
|Valid Values||Count of values that are valid for the column's data type|
|Unique Values||Count of unique values. Duplicate values are not counted.|
Count of values that qualify as outliers. An outlier value is either:
|Mismatched Values||Count of values that do not confirm to the column's data type. For example, an Integer column with a value of "MISSING" results in a mismatched value.|
|Missing Values||Count of values that are not populated|
These statistics are available for most types of data through the Column Browser.
|Minimum||Lowest value in the column|
|Lower Quartile||The median of the lower half of values (25th percentile)|
The middle value of the selected set. For example, in a set of 21 values, the median value is the 11th value in ascending order.
|Upper Quartile||The median of the upper half of values (75th percentile)|
|Maximum||Highest value in the column|
|Average||Average value in the column|
|Standard Deviation||The computed standard deviation for the selected values.|
The following functions can be applied to a set of columnar data taken from one or more columns in your dataset. Unless otherwise noted, these functions apply to numeric data. For more information, see Aggregate Functions.