Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Enabled S3 Integration: The

    D s platform
    has been configured to integrate with your S3 instance.For more information, see Enable S3 Access.

  2. Creating Datasets from S3 Files: You can read in source data stored in S3. An imported dataset may be a single S3 file or a folder of identically structured files. See Reading from Sources in S3 160409818 below.
  3. Reading Datasets: When creating a dataset, you can pull your data from a source in S3. See Creating Datasets 160409818 below.
  4. Writing Results: After a job has been executed, you can write the results back to S3. See Writing Results below.

In the

D s webapp
, S3 is accessed through the S3 browser. See S3 Browser.

...

  • Access: If you are using system-wide permissions, your administrator must configure access parameters for S3 locations. If you are using per-user permissions, this requirement does not apply. See Enable S3 Access.

     



    Warning

    Avoid using /trifacta/uploads for reading and writing data. This directory is used by the

    D s webapp
    .


  • Your administrator should provide a writeable home output directory for you. This directory location is available through your user profile. See Storage Config Page.

...

You can parameterize your input paths to import source files as part of the same imported dataset. For more information, see Overview of Parameterization.

Info

NOTE: Import of glaciered objects is not supported.


Folder selection:

When you select a folder in S3 to create your dataset, you select all files in the folder to be included.

...

  • Each set of results must be stored in a separate folder within your S3 output home directory.
  • For more information on your output home directory, see Storage Config Page.


Warning

If

D s item
installation
installation
is using S3, do not use the trifacta/uploads directory. This directory is used for storing uploads and metadata, which may be used by multiple users. Manipulating files outside of the
D s webapp
can destroy other users' data. Please use the tools provided through the
D s webapp
interface for managing uploads from S3.


Info

NOTE: When writing files to S3, you may encounter an issue where the UI indicates that the job failed, but the output file or files have been written to S3. This issue may be caused when S3 does not report the files back to the application before the S3 consistency timeout has expired. For more information on raising this timeout setting, see Enable S3 Access.


Creating a new dataset from results

...