To prevent overwhelming the client or significantly impacting performance,
How Sampling Works
When a dataset is first created, a background job begins to generate a sample using the first set of rows of the dataset.
data sample is usually very quick to generate, so that you can get to work right away on your transformations.
- The default sample is the initial sample.
- By default, each sample is 10 MB in size or the entire dataset if it's smaller.
- If your source of data is a directory containing multiple files, the initial sample for the combined dataset is generated from the first set of rows in the first filename listed in the directory.
If the matching file is a multi-sheet Excel file, the sample is taken from the first sheet in the file.If you are wrangling a dataset with parameters, the initial sample loaded in the Transformer page is taken from the first matching dataset.