The bar chart at the top of each column in the Transformer page, called a histogram, characterizes the data in that column. Each column histogram displays the count of each detected value in the column (for string data) or the count of values within a numeric range (for number data). |
You can use this histogram to identify unusual values or outlier values, which should be removed or corrected.
NOTE: Counts in a column histogram reflect only the data in the sample in the data grid. Counts in the entire dataset may differ. |
Column Histogram |
Tip: When you resize the width of a column, the number of bars displayed in the column histogram changes accordingly. You can use this dynamic resizing to change the granularity displayed in histograms. |
The contents of the column histogram vary depending on the data type for the column. For example:
Tip: If you hover over a bar in the histogram, you can review specific values, the count of that value, and the percentage that value represents of the total count of values in the column. |
When you select values:
CTRL
- click to select multiple discrete values.