A column is referenced by the name of the column, which can be inferred from the first row of data in your dataset.
When a dataset is loaded, the application inserts a few transform steps automatically. If the application can identify that the first row of data is likely to contain the column headers for the dataset, this row is promoted to be used as the first version of the names of each column.
In some cases, however, this auto-generation of column headers may not work as expected, or you may have chosen at import time to not detect the structure of the dataset.
This section describes how you can generate column headers from within the application.
If the initial transforms do not promote your first row of data to be the column headers, you can use the following transform to promote the first row of data to be the column headers:
In some cases, the first row of data might not contain the headers or might not contain all of them.
For example, you may have some columns that contain nested data, and the column headers may not be immediately accessible.
Tip: After you unnest data in one or more columns, the first row might contain column headers. You can apply the
In some cases, data may be imported such that header information is stored in a row other than the first one in your dataset.
NOTE: If source row number information is not available due to changes in the dataset, this transform may not be available.
Hover your mouse over the black dot to the left of the row that contains your header information. The popup displays something similar to the following:
Row 12 Source Row 12
Add a transform step using the source row number that you found:
If for some reason your source data does not include header information, you can insert header information using the following method.
NOTE: In general, it is easiest to manually rename columns through the application. See Rename Columns.
However, if your data contains a large number of columns, manually renaming each column may be time-consuming, and each column rename adds a step to your recipe. For wide datasets, this solution may be easier to execute and to maintain.
If you are using another application, make sure that you are inserting commas between each value and putting your column names between double-quotes.
NOTE: You may need to create a dummy second row, which forces the application to treat the imported dataset as multiple columns. Otherwise, it may treat the incoming CSV as a single columnar value.
Rename this column to prepend the name with
aaa. For our
TransactionId column, the new column name would be the following:
In the append dataset, the column header row is the last one. Locate the column that contains the column header by which you want to sort.
aaaTransactionIdvalue should be at the top of the dataset.
Now, use this row to create your headers:
Rename the column header to remove the
Change the data type back to its original value, if necessary.