Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Published by Scroll Versions from space DEV and version next

Excerpt

D s product
rtrue
 can import data from a variety of flat file formats and other distributed sources.


Info

NOTE:

D s product
does not modify a source. Instead, a set of metadata is associated with the source data, which enables transformation of the source. On export, a new version of the data is written to one or more specified output destinations.

For more information on the formats supported for input, see Supported File Formats.Steps:

When data is imported, a reference to it is stored by the platform as an imported dataset. The source data is not modified. In the application, you modify the recipe associated with a dataset to transform the imported data.

Info

NOTE: Any user with a valid user account can import data from a local file.

  1. Login to the application.

    Tip

    Tip: When you login for the first time, you can immediately import a dataset to begin transforming it.

  2. In the menubar, click Library. Click Import Data.
  3. To add a dataset:

    1. Select the connection where your source is located. For this basic workflow, select Upload to upload a file from your local desktop. You can select multiple files to upload. For this example, select only one file.
    2. Navigate and select the file or files for your source. Click Open.
    3. To queue the dataset for uploading, click the Plus icon next to its name.
  4. To begin working with your dataset, click Continue. The flow is created for you, and your dataset is automatically added to it.

  5. You can begin working immediately with the dataset in the Transformer page. See Transform Basics.
Tip

Tip: If you are interested, you can create a visual profile of your source data before you begin transforming. For more information, see Profiling Basics.

 For more information, see Import Tasks.

Example Dataset

You can download an example dataset from the following URL:

http://trifacta-public-datasets.s3.amazonaws.com/demo-datasets/customers_sample_data.csv