When you import a dataset from a file, the Dataprep by Trifacta Cloud attempts to detect the structure of the file and to apply an initial set of parsing steps to the data to render it in tabular form for display in the Transformer page. For example, JSON files may be turned into a table of data as long as the structure of the data supports this structuring.
NOTE: Initial parsing steps are applied only to file-based sources of data.
These steps vary based on the file format of data that is being imported. Depending on the dataset, you may need to modify these steps or rebuild them altogether. You can use the following steps to prevent theDataprep by Trifacta Cloud from detecting the structure and automatically hiding these steps.
Tip: You should allow the product to detect the structure first. If it does not detect the structure well, you can experiment with disabling it and rebuilding the steps to render your data in tabular format.
Tip: If you have already imported the dataset and need to change this setting, you can re-import the source and change the settings. Wherever the previously imported version of this dataset is referenced, you must change the reference to use this newly imported dataset.
NOTE: When the steps are completed, the initial parsing steps are listed in any recipe that you create from the imported dataset. If you wish to remove them altogether, you can delete them from the recipe.
Import Unstructured Dataset
- After you have selected or specified the file to import in the Import Data page, click Edit Settings for the dataset card in the right panel.
- Deselect the Detect Structure checkbox. For more information, see File Import Settings.
- Continue the import process by adding the dataset.
- When the imported dataset is added, it is listed as an unstructured dataset.
Use Unstructured Dataset
- Import the unstructured dataset in Flow View.
- Select the unstructured dataset and click Add new recipe.
- When you select the recipe, the initial parsing steps are listed in the right panel.
When the dataset is loaded into the Transformer page, you can modify these steps to improve the parsing or delete them altogether.
NOTE: Any step that breaks up the data into individual rows into individual rows must be the first step in the recipe. To create, enter Break into rows in the Search panel.
This page has no comments.