Page tree

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin


 can import data from a variety of flat file formats and other distributed sources.



D s product
does not modify a source. Instead, a set of metadata is associated with the source data, which enables transformation of the source. On export, a new version of the data is written to one or more specified output destinations.

For more information on the formats supported for input, see Supported File Formats.


When data is imported, a reference to it is stored by the platform as an imported dataset. The source data is not modified. In the application, you modify the recipe associated with a dataset to transform the imported data.


NOTE: Any user with a valid user account can import data from a local file.

  1. Login to the application.


    Tip: When you login for the first time, you can immediately import a dataset to begin transforming it.

  2. In the menubar, click Datasets Library. Click Import Data.
  3. To add a dataset:

    Now, click Edit Recipe to
    1. Select the connection where your source is located. For this basic workflow, select  
    2. Upload: 
      1. Select Upload to upload a file from your local desktop. You can select multiple files to upload. For this example, select only one file.
      2. Navigate and select the file or files for your source. Click Open.
    3. Backend storage, such as S3:
      1. Navigate and select the file or files for your source.
      2. To queue the dataset for uploading, click the Plus icon next to its name.
  4. To begin working with a dataset, you must first add it into a flow, which is a container for datasets. Click the Add Dataset to a Flow checkbox and enter the name for a new flow. Click Import & Add to Flow.


    Tip: If you have selected a single file, you can begin wrangling it immediately. Click Import and Wrangle. The flow is created for you and is named based on the dataset name. Your dataset is automatically added to it.

  5. After the flow has been created, you can begin working with the flow objects. See Flow View Page.
  6. Select the dataset and click Create New Recipe
      1. You can select multiple files.
    1. Select the Add to new flow checkbox. This option creates a new flow, which is a container object for your 
      D s item
      . Your imported dataset is added to it.
  7. To begin working with your dataset, click Continue

  8. The imported dataset and its containing flow are created. 
  9. You can begin working with the dataset in the Transformer page. See  For more information, see Transform Basics.

Tip: If you are interested, you can create a visual profile of your source data before you begin transforming. For more information, see Profiling Basics.


For  For more information on details about importing, see Import Tasks.

Example Dataset

You can download an example dataset from the following URL:

http above dataset is used as part of the on-boarding tour presented to new users.