- The colon character (
:) cannot appear in a filename or a file path.
- Filenames cannot begin with special characters like dot (
.) or underscore(
NOTE: Compressed files are recognized and can be imported based on their file extensions.
|D s product|
For more information on the supported input formats, see see Supported File Formats.
2. Add datasets
When you have found your source directory or file:
You can hover over the name of a file to preview its contents.
NOTE: Preview may not be available for some sources, such as Parquet.
Click the Plus icon next to the directory or filename to add it as a dataset.
Tip: You can import multiple datasets at the same time. See below.
Excel files: Click the Plus icon next to the parent workbook to add all of the worksheets as a single dataset, or you can add individual sheets as individual datasets. See See Import Excel Data.
3. Configure selections
When a dataset has been selected, the following fields appear on the right side of the screen. Modify as needed:
This name appears in the interface.
You may add an optional description that provides additional detail about the dataset. This information is visible in some areas of the interface.
Tip: Click the Eye icon to inspect the contents of the dataset prior to importing.
You can select a single dataset or multiple datasets for import.
Import file settings
You can modify settings used during import for individual files. In the card for an individual dataset, click click Edit Settings.
NOTE: In some cases, there may be discrepancies between row counts in the previewed data versus the data grid after the dataset has been imported, due to rounding in row counts performed in the preview.
|D s product|
applies a specified encoding type on the imported the file. In some cases, the data preview panel may contain garbled data, due to a mismatch in encodings. In the Data Preview dialog, you can select a different encoding for the file. When the correct encoding is selected, the preview displays the data as expected.
NOTE: Assessing the file encoding type based on parsing an input file is not an accurate method. Instead,
NOTE: In some cases, imported files are not properly parsed due to issues with encryption types or encryption keys in the source datastore. For more information, please contact your datastore administrator.
For a list of supported encoding types, see Supported File Encoding Types.
|D s product|
attempts to interpret the structure of your data during import. This structuring attempts to apply an initial tabular structure to the dataset.
- Unless you have specific problems with the initial structure, you should leave the Detect structure setting enabled. Recipes created from these imported datasets automatically include the structuring as the first, hidden steps. These steps are not available for editing, although you can remove them through the Recipe panel. See See Recipe Panel.
- When detecting structure is disabled, imported datasets whose schema has not been detected are labeled, unstructured datasets. When recipes are created for these unstructured datasets, the structuring steps are added into the recipe and can be edited as needed.
- For more information, see see Initial Parsing Steps.
Remove special characters from column names: When selected, characters that are not alphanumeric or underscores are stripped, and space characters are converted to underscores.
For more information, see Sanitize Column Names.
4. Import selections
If you have selected a single dataset for import:
- Click Click Continue. The dataset is imported.
- A recipe is created for it, added to a new flow, and loaded in the Transformer page for wrangling. See See Transformer Page.
- To import the selected datasets, click click Continue.
The datasets are imported and added to a new flow.
NOTE: To rename this flow, click
Untitledat the top of Flow View.
Tip: To begin transforming one of these datasets, select it. From its context menu, select Add new recipe. Select the recipe. In the context panel on the right, select Edit Recipe. See Transformer Page.
For more information, see Flow View Page.
- To remove a dataset from import, click the X in the dataset card.