Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Published by Scroll Versions from space DEV and version next

...

When a dataset sample is first loaded into the Transformer page,

D s product
 attempts to split out the unstructured data to form regular, tabular data. If your data appears to contain a header row, it can be used for the titles of the columns.

Image RemovedImage Added

D caption
typefigure
Transformer page

...

After you have removed unused data, you can examine the quality of data within each column just below the column title. 

Image RemovedImage Added

D caption
typefigure
Column header with data quality bar

...

ColorDescription
greenThese values are valid for the column data type.
redThese values do not match those of the column type.
blackgrayThere are no values for the column in these rows.

...

In the following example, the missing values in the SUBSCRIBER_AGE column have been selected, and a set of suggestion cards is displayed. 

Image RemovedImage Added

D caption
typefigure
Selecting missing values

...

Just below a column's data quality bar, you can review a histogram of the values found in the column. In the following example, the data histogram on the left applies to the ZIP column, while the one on the right applies the WEB_CHAT_ID column.

Image RemovedImage Added

D caption
typefigure
Column data histogram

...

In the following example, the improperly capitalized word BALTIMORE has been selected, so that you can change it to its propercase spelling (Baltimore). Those rows are highlighted in the row data, and a set of suggestions for how to fix has been provided in the cards at the bottom of the screen. See Selection Details Panel.

Image RemovedImage Added

D caption
typefigure
Selecting values to modify

...

When you modify a transform step, you can make changes in the Transform Builder, which is a simple, menu-driven interface for modifying your transformations:

Image RemovedImage Added

D caption
typefigure
Modifying steps in the Transform Builder

...

  • If your dataset is small enough, the sample is the entire dataset. 
  • For larger datasets, 
    D s product
     auto-generates an initial data sample  from from the first rows of your dataset.

...