Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Published by Scroll Versions from space DEV and version next

...

If you know that you are dropping some rows and columns from your dataset, add these transform transformation steps early in your recipe. This reduction simplifies working with the content through the application and, at execution, speeds the processing of the remaining valid data. Since you may be executing your job multiple times before it is finalized, it should also speed your development process.

  • To drop columns:
    • Select Drop from the column drop-down for individual columns. See Column Menus.
    • Use the drop transform to Delete Columns transformation to remove multiple discrete columns or ranges of columns. See Drop Transform. 
  • To delete rows: The following example removes all rows that lack a value for the id column:

    D trans
    p03Valuedelete matching rows
    Typestep
    p01NameCondition
    p01ValueIs missing
    p02NameColumn
    p02Valueid
    p03NameAction
    SearchTermFilter rows

  • To keep rows: The following example keeps all rows that lack a value in the id column:
    D trans
    p03Valuekeep matching rows
    Typestep
    p01NameCondition
    p01ValueIs missing
    p02NameColumn
    p02Valueid
    p03NameAction
    SearchTermFilter rows
  • See Filter Data.

Perform joins early

After you have filtered out unneeded rows and columns, join operations should be performed in your recipe.These steps bring together your data into a single consistent dataset. By doing them early in the process, you reduce the chance of having changes to your join keys impacting the results of your join operations. See Join Panel.

...