If needed, you can clean the names of the columns in your dataset.
When column names are sanitized:
- alphanumeric characters and underscores (
_) are permitted
- spacebars are converted to underscores
- all other characters are removed
Although Dataprep by Trifacta® supports a wider range of characters, you may wish to sanitize your column names to simplify publishing to and import into downstream systems.
Sanitize during Import
The above sanitization can be applied to your column names when the dataset is imported.
Tip: If you notice issues with references to your column names in your recipes, you may be able to fix them by re-importing the dataset and choosing to sanitize during import.
- From the menubar, click Library.
- In the Library page, click Import Data.
- In the Import Data page, select the file or table to import.
- Click Edit Settings.
- In the dialog, select Remove special characters from column names.
- Complete the import of the dataset.
For more information, see File Import Settings.
Sanitize via Transformation
Through the Transform Builder, you can add a step to sanitize column names in your recipe.
Tip: If you are sanitizing your column names for downstream systems, you should add this step at the end of your recipe.
You can perform more fine-grained column renaming operations. See Rename Columns.
This page has no comments.