Formats a specified Datetime set of values according to the specified date format. Source values can be a reference to a column containing Datetime values. |
For more information on formatting numeric types, see NUMFORMAT Function.
dateformat(MyDate, 'yyyy-MM-dd') |
Output: Returns the valid date values in the MyDate
column converted to year-month-day format.
dateformat(Datetime_col, date_format_string) |
Argument | Required? | Data Type | Description |
---|---|---|---|
Datetime_col | Y | datetime | Name of column containing date values to be formatted |
date_format_string | Y | string | String literal identifying the date format to apply to the value |
Name of the column whose date data is to be formatted.
Required? | Data Type | Example Value |
---|---|---|
Yes | Datetime | myDate |
String value indicating the date format to apply to the input values.
supports Java formatting strings, with some exceptions.
NOTE: If the platform cannot recognize the date format string, the generated result is written as a string value. |
For more information on supported date formats, see Datetime Data Type.
Required? | Data Type | Example Value |
---|---|---|
Yes | String | 'MM/dd/yyyy' |
This example illustrates several ways of wrangling heterogeneous date values, including the use of the DATEFORMAT
function.
Source:
Your dataset includes the following messy date values:
MyDate |
---|
2/1/00 9:20 |
4/5/10 11:25 |
6/7/99 22:00 |
12/20/1894 15:45:00 |
13/7/1999 22:00:00 |
Transformation:
When this data is loaded into the application, it is not immediately recognized as a Datetime column, as the variation among the data complicates deciding on the proper date format. The first three rows look to be in a consistent format, but the other two are problematic.
You can try to change the column to a Datetime type with a format that matches the first three rows. You can select the appropriate format through the type drop-down. When previewed, the transform looks like the following:
NOTE: Do not add this transform at this time. It is strictly used for reviewing the effects on data quality. |
When the column is reformatted, you should notice that the last two values in the column are mismatched. In the column histogram, you can see that date ranges include the 1999 date in the third row, so the final row should work if it was a valid date.
The 1894 value looks like an outlier value and could be removed:
For the remaining 1999 row, you can delete it or use the following transforms to conform it to the other rows. Use the following transform to change the 13
month value to a 12
:
The following two transforms complete the cleanup steps:
If you apply the original formatting step, all dates are valid:
Now, your Datetime column can be formatted as needed using the dateformat
function. The following step generates a new column that contains year, month, and day information as a single numeric value:
Results:
The final dataset should look like the following:
MyDate | dateformat_MyDate |
---|---|
2/1/00 9:20 | 20000201 |
4/5/10 11:25 | 20100405 |
6/7/99 22:00 | 19990607 |
12/7/99 22:00 | 19991207 |
Source | Transformation | Results |
---|---|---|
2/15/16 13:26:58.123 3/12/99 2:45:21.456 11/21/11 23:02:18.000 | 2016-02-15 1999-03-12 2011-11-21 |
Source | Transformation | Results |
---|---|---|
2/15/16 13:26:58.123 3/12/99 2:45:21.456 11/21/11 23:02:18.000 | 2/15/16 3/12/99 11/21/11 |
Source | Transformation | Results |
---|---|---|
2/15/16 13:26:58.123 3/12/99 2:45:21.456 11/21/11 23:02:18.000 | February 15, 2016 March 12, 1999 November 21, 2011 |
Source | Transformation | Results |
---|---|---|
2/15/16 13:26:58.123 3/12/99 2:45:21.456 11/21/11 23:02:18.000 | Mon Feb 15, 2016 Fri Mar 12, 1999 Mon Nov 21, 2011 |
Source | Transformation | Results |
---|---|---|
2/15/16 13:26:58.123 3/12/99 2:45:21.456 11/21/11 23:02:18.000 | 13:26:58.123 2:45:21.456 23:02:18.000 |
Source | Transformation | Results | |
---|---|---|---|
2/15/16 13:26:58.123 3/12/99 2:45:21.456 11/21/11 23:02:18.000 |
| 1:26:58 PM 2:45:21 AM 11:02:18 PM |
For more information on supported date formats, see Datetime Data Type.