Skip to main content

Data Health Tool Icon Data Health

Use Data Health to check on the health of your data. You can use the tool with the feature-engineering tools, like Build Features and Feature Types, to improve the health of your data. To determine how healthy your data is, the tool analyzes missing values, outliers, and sparsity.

Alteryx Intelligence Suite Required

This tool is part of Alteryx Intelligence Suite. Intelligence Suite requires a separate license and add-on installer to Designer. After you install Designer, install Intelligence Suite and start your free trial.

Tool Components

The Data Health tool has 4 anchors.

  1. Input anchor: The input anchor connects to the data whose health you want to check.

  2. S output anchor: The S output anchor passes each column's associated data-health scores downstream.

  3. R output anchor: The R output anchor passes a comprehensive report about the data's health downstream. You can view the report using a Browse tool.

  4. O output anchor: The O output anchor passes the outliers from the data downstream.

Configure the Tool

To use the Data Health tool, you have to configure options for what scale to use for scoring the health of the data and whether to include recommendations for how to improve the health of the data in the output.

Important

To use this tool, the upstream data has to have at least 30 rows.

1. Scale

From the dropdown, choose whether you want to output scores on a percentage scale (0–100%) or a normalized scale (0–1). The tool generates the score by assessing missing values, outliers, and sparsity.

2. Output Recommendations Based On Score

Check the box if you want the tool to give you recommendations for how to improve the health of the data, based on the score it receives. The recommendation appears as an additional column in the dataset.