In some cases, the Initial Data sample is the entire dataset.
Tip: For purposes of loading the data, the initial data sample is generated and displayed at first. For a better representation of the entire dataset, you should create a new sample.
- In other cases, the Initial Data sample is generated from a collection of files.
- If the recipe is a child recipe, then the Initial Data sample indicates the selected sample of the parent recipe.
- For more information on this special sampling type, see Overview of Sampling.
At the top of the panel, you can review the currently loaded sample. Each user has his own active sample on a dataset.
NOTE: When a new sample is generated, any Sort transformations that have been applied previously must be re-applied. Depending on the type of output, sort order may not be preserved.
Initial Data: By default, the application loads the first N rows of the dataset as the initial data sample when the Transformer page is opened. The number of rows depends on column count, data density, and other factors. If the dataset is small enough, the full dataset is used.
NOTE: By default, samples may be up to 10 MB in size or may be limited based on the maximum number of files that can be scanned. For datasets smaller than this limit, the entire dataset is loaded. See Overview of Sampling.
Click the link in the current sample card to see the list of all available samples.
Tip: To change the name of a sample, click its card in the list of all available. Then, click the Edit icon.