Page tree

Release 9.2


Contents:

   

This example illustrates you to apply percentile functions.

Functions:

ItemDescription
MEDIAN Function Computes the median from all row values in a column or group. Input column can be of Integer or Decimal.
PERCENTILE Function Computes a specified percentile across all row values in a column or group. Input column can be of Integer or Decimal.
QUARTILE Function Computes a specified quartile across all row values in a column or group. Input column can be of Integer or Decimal.
APPROXIMATEMEDIAN Function Computes the approximate median from all row values in a column or group. Input column can be of Integer or Decimal.
APPROXIMATEPERCENTILE Function Computes an approximation for a specified percentile across all row values in a column or group. Input column can be of Integer or Decimal.
APPROXIMATEQUARTILE Function Computes an approximation for a specified quartile across all row values in a column or group. Input column can be of Integer or Decimal.

Source:

The following table lists each student's height in inches:

StudentHeight
164
265
363
464
562
666
766
865
969
1066
1173
1269
1369
1461
1564
1661
1771
1867
1973
2066

Transformation:

Use the following transformations to calculate the median height in inches, a specified percentile and the first quartile.

  • The first function uses a precise algorithm which can be slow to execute across large datasets.
  • The second function uses an appropriate approximation algorithm, which is much faster to execute across large datasets. 
    • These approximate functions can use an error boundary parameter, which is set to 0.4 (0.4%) across all functions.

Median: This transformation calculates the median value, which corresponds to the 50th percentile.

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula median(heightIn)
Parameter: New column name 'medianIn'

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula approximatemedian(heightIn, 0.4)
Parameter: New column name 'approxMedianIn'

Percentile: This transformation calculates the 68th percentile.

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula percentile(heightIn, 68, linear)
Parameter: New column name 'percentile68In'

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula approximatepercentile(heightIn, 68, 0.4)
Parameter: New column name 'approxPercentile68In'

Quartile: This transformation calculates the first quartile, which corresponds to the 25th percentile.

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula quartile(heightIn, 1, linear)
Parameter: New column name 'percentile25In'

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula approximatequartile(heightIn, 1, 0.4)
Parameter: New column name 'approxPercentile25In'

Results:

studentIdheightInapproxPercentile25Inpercentile25InapproxPercentile68Inpercentile68InapproxMedianInmedianIn
164646467.166.926666
265646467.166.926666
363646467.166.926666
464646467.166.926666
562646467.166.926666
666646467.166.926666
766646467.166.926666
865646467.166.926666
969646467.166.926666
1066646467.166.926666
1173646467.166.926666
1269646467.166.926666
1369646467.166.926666
1461646467.166.926666
1564646467.166.926666
1661646467.166.926666
1771646467.166.926666
1867646467.166.926666
1973646467.166.926666
2066646467.166.926666

This page has no comments.