This section describes how you interact through the  with your  environment.

Uses of 

The can use  for the following tasks:

  1. Creating Datasets from Files: You can read in source data stored in . An imported dataset may be a single  file or a folder of identically structured files. See Reading from Sources below.

  2. Reading Datasets: When creating a dataset, you can pull your data from a source in . See Creating Datasets below.
  3. Writing Results: After a job has been executed, you can write the results back to . See Writing Results below.

  4. As default storage environment: If is your default storage environment, additional data is stored by operations of the . See Configure Storage Environment.

In the ,  is accessed through the  browser. See TFS Browser.

NOTE: When the executes a job on a dataset, the source data is untouched. Results are written to a new location, so that no data is disturbed by the process.

Before You Begin Using 

Avoid using /trifacta/uploads for reading and writing data. This directory is used by the .

Your writeable home output directory is available through your user profile. See Storage Page.

Secure Access

Access to  is governed by IAM roles that are automatically assigned to workspace users of the product. No additional configuration is necessary to access .

Storing Data in 

When you upload raw data to , it is stored in your pre-configured uploads directory based on an internal upload identifier.

All should have a clear understanding of the folder structure within  where each individual can read from and write results.

NOTE: The does not modify source data in . Source data stored in is read without modification, and source data uploaded to the is stored in /trifacta/uploads.

Reading from Sources

You can create an imported dataset from one or more files stored in .

Wildcards:

You can parameterize your input paths to import source files as part of the same imported dataset. For more information, see Overview of Parameterization.

Folder selection:

When you select a folder in  to create your dataset, you select all files in the folder to be included.

Notes:

When a folder is selected from , the following file types are ignored:

NOTE: If you have a folder and file with the same name in , search only retrieves the file. You can still navigate to locate the folder. 

Creating Datasets

When creating a dataset, you can choose to read data in from a source stored from  or local file.

Data may be individual files or all of the files in a folder.

Writing Results

When you run a job, you can specify the file path where the generated results are written. By default, the output is generated in the default output home directory.

Creating a new dataset from results

As part of writing results, you can choose to create a new dataset, so that you can chain together data wrangling tasks.

NOTE: When you create a new dataset as part of your results, the file or files are written to the designated output location for your user account. Depending on how your permissions are configured, this location may not be accessible to other users.

Purging Files

Other than temporary files, the does not remove any files that were generated or used by the platform, including: