The data grid in the Transformer Page displays how your current recipe applies to the data in your currently selected sample.
Data Grid Panel
Click column headings to review a visual profile of the column's data and a set of suggestions for transformations to apply to the column.
These columns appear in the context panel on the right side of the screen.
Tip: Keep clicking columns. You can select multiple columns to prompt for another set of applicable suggestions.
Select specific values in a column for suggestions on those strings.
NOTE: Values in a cell cannot exceed 25,000 characters in length.
Tip: If you select a single value in the data grid, the suggestion cards suggest operations specific to that string. If you multi-select multiple values, the suggestions can apply any pattern shared between the values. For example, selecting "
Tip: If the contents of a cell are too large for the display, you can click the Caret ( > ) icon to the right of the cell value in the data grid to display the entire contents of the cell.
Add or Edit:
You can reorder the rows based on the values in a column. From the Column menu, select Edit Column > Sort. For more information, see Column Menus.
At the top of the data grid, you can use the toolbar to quickly build common transformations, filter the display, and other operations. See Transformer Toolbar.
Below the data grid, you can review summary information about the data in your currently selected sample.
Sample Status Bar
Click the Eye icon to open the Visible Columns panel, where you can toggle the display of individual columns. For more information, see Visible Columns Panel.
The status bar contains metrics about the current dataset sample for the currently selected recipe step.
Tip: Before you begin transforming your data, you might want to verify the columns and count of data types against the data before it was imported. If there are discrepancies, you might want to investigate the differences. Unless your sample includes the entire dataset, row counts should differ.
NOTE: In the running environment, results can differ between executions of the same recipe due to its parallel execution and data limiting within the Transformer page. In particular, joins with multiple matches per key can sometimes cause a difference in the number of reported rows when the job is re-executed.
When transformation steps are previewed, you can use these checkboxes to display only the previewed changes for affected rows, columns, or both.
Tip: These options assist in narrowing the data grid display to only the steps affected by the current recipe step.
In a wide dataset, click the Find icon in the Transformer toolbar to locate the column of interest.
Find column search bar
Column header, data quality bar, and histogram
Identifies the selected data type, which can be inferred by the application based on the contents of the column. Click the icon to change the data type.
See Supported Data Types.
|Column name||To change the column name, select Rename... from the column menu.|
|Column menu||Depending on the column data type, you can select from a set of predefined recipe steps in the column menu under the caret on the right side of the menu. See Column Menus.|
|Data quality bar|
The horizontal line shows valid, missing, and mismatched values in the column compared to the column's data type.
See Data Quality Bars.
For each column, you can see the range and frequency of values in the column.
See Column Histograms.
You can click and drag to select values in a column:
On the left side of the screen, you can see a column of black dots. If you hover over one of these, you can see the current row number and, if the information is still available, the row number for the row from the original source data. These values apply only to the sample in the current dataset.
Tip: To review the original row number for a row, hover over the black dot in the data grid. These values can be referenced using the
From the Filters drop-down, you can define filters to apply to columns, rows, or both in the data grid. See Filter Panel.
Before a transformation in development has been added to the recipe, a preview of the results is generated in the data grid. See Transform Preview.
When a target has been assigned to your recipe, you can review the column names and data types that are expected for the target in the Target Matching bar above the column histograms.
Target Matching Bar
In the Target Matching bar, you can review how the target above matches the current recipe below. For each column, matching assesses:
Tip: Two solid green schema tags indicate a perfect match based on the above conditions.
For more information on the schema tags, see Column Browser Panel.