Registered users of this product or Trifacta Wrangler Enterprise should login to Product Docs through the application.
Through the Flow View page, you can access and manage all objects in your flow. For each imported dataset, recipe, or wrangled dataset in your flow, you can perform a variety of actions to effectively manage flow development and job execution through a single page.
The imported datasets in the flow or wrangled datasets added to the flow are listed on the left side of the screen. Associated with each imported dataset can be one or more recipes, which are used to transform the source data into wrangled datasets.
- These objects are connected together by lines flowing between them, which show the relationships between the objects of the flow.
For any dataset, any objects on which it depends are displayed to the left of the object on one of the flowing lines leading from the dataset. In the above example, the
POS-01dataset is dependent on all of the objects in the flow, while the
REF_CALdataset is only dependent on its recipe and the
Tip: When you run a job from a wrangled dataset, all of the recipes steps for the preceding datasets are executed as part of the job, and only the results of the terminal wrangled dataset are generated.
- For more information on these objects, see Object Overview.
Select an object from your flow to open an object-specific panel on the right side of the screen.
|Add Datasets||Add new datasets to the flow. See Add Datasets to Flow below.|
|Make a copy||Create a copy of the flow. The copied flow is owned by the user who copied it.|
|Edit name and description...||Change the name and description of the flow.|
Delete the flow.
Deleting a flow removes all wrangled datasets that are contained in the flow. If copies of these datasets exist in other flows, they are not touched. Imported datasets are not deleted by this action.
Add Datasets to Flow
From the Flow View page, you can add imported or wrangled datasets to your flow. These datasets are added as independent objects in the flow and can be joined, unioned, or referenced by other datasets in the flow.
- Search for or select the dataset to add.
- Use the page view controls to browse for other datasets, or select the appropriate tab to filter the list to Wrangled or Imported datasets.
- To import new datasets from external sources, click Import Datasets. See Import Dataset Page.
- When you have made your selections, click Add.
The dataset is added as a new object in flow view.
NOTE: For imported datasets that do not have a published schema, such as (CSV, TXT, LOG, or JSON files), a recipe, including steps for inferring structure, and a wrangled dataset are automatically created as part of the process.
View for Imported Datasets
When you select an imported dataset, you can preview the data contained in it, swap the source object, and more from the right-side panel.
In the Data Preview window, you can see a small section of the data that is contained in the imported dataset. This window can be useful for verifying that you are looking at the proper data.
Tip: You can select and copy data from this preview window.
|Type||Indicates where the data is sourced or the type of file.|
|File Size||Size in KB.|
|Location||Path to the location of the file.|
|Used In||Count of flows where the imported dataset is used, including the current one.|
Swap out the current source for a new source for the imported dataset.
NOTE: If the swapped-in source does not have the same schema as the original source, recipe steps in the current flow and any flow that uses the imported dataset may be broken.
For more information, see Dataset Browser.
|Add new Recipe||Create a new recipe and wrangled dataset from the imported dataset. This recipe and dataset combination is independent of the original one.|
|Edit name and description...||Change the name and description of the imported dataset.|
Remove the imported dataset from the flow.
NOTE: Any recipe and wrangled dataset using the imported dataset are also removed. In the Remove Dataset dialog, click Details to review the imported dataset.
|More details||See Dataset Details Page.|
View for Recipes
For each recipe, you can review or edit its steps or create new recipes altogether.
|Steps Preview||Preview the first steps in the recipe.|
|Steps||Total count of the steps in the recipe.|
|Edit Recipe||Open the recipe and begin editing. See Transformer Page.|
|Make a Copy||Create a copy of the recipe and a new wrangled dataset. The copied recipe is owned by the user who copied it.|
|Move...||Move the recipe to a different flow, or create a new flow to contain it.|
Delete the recipe.
This step cannot be undone.
|See data||View the data in the wrangled dataset. See View for Wrangled Datasets below.|
View for Wrangled Datasets
In the Data Preview window, you can see a small section of the data that is contained in the wrangled dataset. This window can be useful for verifying that you are looking at the proper data.
Tip: You can select and copy data from this preview window.
|Size||Count of columns and data types in the wrangled dataset.|
|Used In||Count of flows where the dataset is used.|
|Ran||Count of jobs launched for the wrangled dataset.|
|Last Ran||Timestamp for when the job last ran.|
|Edit Recipe||Edit the recipe of the wrangled dataset. See Transformer Page.|
Launch a job for the wrangled dataset, its recipes, and all preceding datasets.
NOTE: This feature is not available in Trifacta Wrangler.
|Add new Recipe||Create a new recipe and wrangled dataset from the wrangled dataset. This recipe and dataset combination is independent of the original one.|
|Edit name and description...||Change the name and description for the wrangled dataset.|
|See recipe||View the steps of the recipe associated with the wrangled dataset. See View for Recipes above.|
|More details||Review details on the flows where the dataset is used.|
View for Referenced Datasets
A referenced dataset is a wrangled dataset that is added to a flow from another flow.
NOTE: A referenced dataset is a read-only object in the flow where it is referenced.
To add a referenced dataset, click Add Datasets from the main Flow View page and select one from a different flow. See Add Datasets to Flow above.
|Size||Number of columns and data types in the referenced dataset.|
|Source Flow||Flow that contains the dataset. Click the link to open the Flow View page for that dataset.|
|Add new Recipe||Create a new recipe and wrangled dataset from the referenced dataset. This recipe and dataset combination is independent of the original one.|
|Remove...||Remove the referenced dataset from the flow. The source dataset in the other flow is untouched.|
This page has no comments.