Optionally, you can insert an integer as a parameter.
derive type:single value: RAND() as:'random'
Output: For each row, generate a random number between 0 and 1 in the new
There are no arguments for this function.
In the following example, the
random column is generated by the
derive type:single value:RAND() as:'random'
RAND function is typically used to introduce randomness of some kind in your data. In the following example, it is used to perform sampling within your wider dataset.
Tip: Keep in mind that for larger datasets the application displays only a sample of them. This method of randomization is applied to the full dataset during job execution.
You want to extract a random sample of 20% of your set of orders for further study:
You can use the following transform to generate a random integer from one to 10:
derive type:single value:round(RAND() * 10) as:'random'
You can now use the following transform to keep only the rows that contain random values that are in the top 20%, where the value is
keep row:(random > 8)
NOTE: Since the results are randomized, your results might vary.