Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin
d-excerpt

You might want to execute a profile of the data that you imported from the source. As soon as you create a recipe from a source, you can execute a job to profile the dataset.

By profiling the data as soon as you load it into the Transformer page, you can assess the following:

  • Identify problems in the source and potentially correct them in the source system.
  • Create a baseline to evaluate the data wrangling work you do in 
    D s product
    rtrue
    .
  • Identify mismatched or missing values.
Tip

Tip: You can also use this technique to generate an output of your source data, which is useful if you do not have read access to the source outside of

D s product
.

Steps:

  1. Create an imported dataset from your source. Add it to a flow. See Import Data Page.
    1. Depending on how your data is structured, you may choose to disable Detect Structure. For more information, see Initial Parsing Steps.
  2. In Flow view, create a recipe for your imported dataset. See Flow View Page.
  3. In Flow view, edit the newly created recipe. It is opened in the Transformer page. See Transformer Page.
  4. If needed, add a header step to your dataset. 
  5. Click Run Job.
  6. In the Run Job page, select the following options:
    1. Choose the default running environmentIf you have the option of selecting a running environment, select the default one. This option may not be available in your product.

    2. CSV format (you need at least one format to generate your dataset's profile).
    3. Select to profile results.
  7. Click Run Job.
  8. When the results are generated, click View Results.

  9. A profile of your dataset is displayed.

...

You might also download the dataset for recordkeeping. See Job Results Details Page.

Preserve Source Visual Profile

...