In addition to CSV and other formats, can directly import Microsoft® Excel® workbooks. The worksheets of a workbook can be imported as:
A dataset with parameters
NOTE: When importing as a parameterized dataset, all selected worksheets are imported into a single dataset.
Workbooks can be uploaded from your local system. If is connected to a backend file storage system, you can also import workbooks stored in readable directories.
Source Excel files with cells bracketed by single double quotes may not be properly ingested if any terminating quotes are missing.
Tip: You can check the data quality bars for mismatched values or, for strings, the data histogram bars for anomalous values to see if the above issue is present. If so, deselect Detect Structure on import. Then, use a Split rows transformation applied to the affected column to break up the column as needed.
You cannot import password-protected Excel files.
If you modify the parameters such that an Excel file is no longer the first matching file, you may see an error similar to the following:
First matched file cannot be changed from Excel or vice-versa.
Excel is a special case of first file matching. Please modify parameters so that an Excel file is the first matching file.
The schema of the dataset is determined by the first matching file. If you modify the parameters of your dataset after creation such that the first match file has changed, you can corrupt the dataset schema, as the new Excel file may have different sheets with different names.
In Flow View, you must edit the dataset settings on the imported dataset object. Select the proper sheets to use in the new first matching file.
If you do not fix this issue, results may be unexpected in the application.
In the Library page, click Import Data. Select the connection to use. See Import Data Page.
Import Excel workbook
Tip: If you experience issues uploading large XLS/XLSX files, you can convert the files to CSV files and then upload them.
By default, all worksheets in the workbook are imported as individual datasets. To change how the data is imported, click Edit in the right panel.
Import settings for Excel datasets
Selected sheets into 1 dataset: All selected sheets in the workbook are combined and imported as a single dataset.
NOTE: The schemas of each dataset must match. Columns must be listed in the same order in each dataset. The column headers are taken from the first selected dataset.
All and future sheets into 1 dataset: If the workbook is updated periodically with new sheets that you would like to add in the future, select this option. After initial selection of sheets, all sheets that are added to the workbook in the future are automatically added as part of the imported dataset.
NOTE: This option is available only if you are connected to a backend file storage system, such as HDFS, S3, or GCS.