This section applies to getting started with , an AWS-native platform for data wrangling. The following following product tiers are available:
enables you to rapidly ingest, transform, and deliver clean, actionable data across your entire enterprise. Please review the following sections on how to prepare for and set up your workspace.
NOTE: This section applies to both the free version and the licensed version of . For more information on the differences, see Product Limitations.
This section provides an overview of how to get started using the product.
Having difficulties? To speak to a support representative, click the icon in the corner and submit your question.
Before you begin. If you are using your own AWS S3 buckets, you should prepare them and their access policies to ensure that can integrate with them.
NOTE: If you do not have these AWS resources, they can be created for you. Details are below.
Hosted on Amazon Web Services, is designed to natively interact with AWS datasources, so that you can rapidly transform your data investments in AWS.
When the product is first launched, a default storage environment is automatically created for you as part of this setup process. is backed by AWS S3 buckets hosted by and secured by IAM policies.
This default storage environment is managed by and is used for storing data assets as well as assets generated by use of the product.
If preferred, you can configure the use of S3 as the default storage environment.
NOTE: When S3 is used as the default storage environment, you must provide the policies, buckets, and other AWS resources required to manage your datasets and generated results.
Any of the following storage environment options can be configured after completing sign-up:
|Default Storage Environment||Additional Storage Environment|
NOTE: If you are using your own AWS/S3 resources, you must acquire configuration information before you can connect to your S3 assets. These requirements for these resources are covered later.
If you are enabling any relational source, including Redshift, you must whitelist the IP address range of the in the relevant security groups.
NOTE: The database to which you are connecting must be available from the over the public Internet.
The IP address range of the is:
For Redshift, there are two ways to whitelist the IP range depending on if you are using EC2-VPC or EC2-Classic (not common).
For details on this process with RDS in general, see https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/Overview.RDSSecurityGroups.html
For more information, please contact .
Tip: You can begin using pre-configured with a template of interest. Please visit the Templates page and select the template of interest. Then, click Sign up for Free Trial. For more information, see https://www.trifacta.com/templates.
To begin the registration process, please visit https://www.trifacta.com/start-wrangling.
After you have completed registration, please login to the application. The Home page is displayed.
NOTE: You can now access online documentation through the application. From the left nav bar, select Help menu > Documentation.
For more information on changing the default storage environment or enabling S3 as a storage environment, see Configure Storage Environment.
As the first registered user, you are assigned the workspace admin role, which provides control over workspace-level settings. Before you invite users to the workspace, you should review and modify the basic configuration for the workspace.
Tip: You can also rename the workspace.
For more information, see Workspace Settings Page .
NOTE: Workspace administrators should complete the following steps to verify that the product is operational end-to-end.
NOTE: First-time users of the product should access it by invitation only. Do not provide direct URLs to first-time users.
When a new workspace is created, the first user is provided a set of example flows. These flows are intended to teach by example and illustrate many recommended practices for building your own flows. For more information on example flows, see Workflow Basics.
Tip: After you have registered for a trial, you can immediately begin using the product.
This section contains additional steps that may be required to use all features of the product.
When you login for the first time, you may see a Missing Storage Settings error message. Your storage settings must be configured before you can import data and run jobs.
When you connect to the product, you may see a banner indicating that you have not confirmed your email address.
NOTE: You cannot invite other users to your workspace or receive email-based notifications from the product until you verify your email address.
In the , click the button in the banner to send an email to the address that you used to register.
You should receive an email like the following:
To access the full customer documentation, from the left nav bar, select Help menu > Documentation.
The following resources can assist workspace users in getting started with wrangling.
Tip: Check out the product walkthrough available through in-app chat! This tour steps through each phase of ingesting, transforming, and generating results for your data.