When you import a dataset from a file, Trifacta® Wrangler attempts to detect the structure of the file and to apply an initial set of parsing steps to the data to render it in tabular form for display in the Transformer page. For example, JSON files may be turned into a table of data as long as the structure of the data supports this structuring.
NOTE: Initial parsing steps are applied only to file-based sources of data.
These steps vary based on the file format of data that is being imported. Depending on the dataset, you may need to modify these steps or rebuild them altogether. You can use the following steps to prevent Trifacta® Wrangler from detecting the structure and automatically hiding these steps.
Tip: You should allow the product to detect the structure first. If it does not detect the structure well, you can experiment with disabling it and rebuilding the steps to meet your dataset requirements.
Tip: If you have already imported the dataset and need to change this setting, you can re-import the source and change the settings. In any flows that use the previously imported version of this dataset, you can change the input for any recipe that uses the old version to use this newly imported version in Flow View.
NOTE: When the steps are completed, the initial parsing steps are listed in any recipe that you create from the imported dataset. If you wish to remove them altogether, you can delete them from the recipe.
- After you have selected or specified the file to import in the Import Data page, click Edit Settings for the dataset card in the right panel.
- Deselect the Detect Structure checkbox. For more information, see File Import Settings.
- Continue the import process by adding the dataset to a new flow.
- When the imported dataset is added to a flow, it is listed as an unstructured dataset.
- Select the dataset and click Add new recipe.
- When you select the recipe, the initial parsing steps are listed in the right panel.
When the dataset is loaded into the Transformer page, you can modify these steps to improve the parsing or delete them altogether.
NOTE: Any step that breaks up the data into individual rows into individual rows must be the first step in the recipe. To create, enter Break into rows in the Search panel.
This page has no comments.