...
- Review Planning Guide: Please review and verify Install Preparation and sub-topics.
- Limitations: For more information on limitations of this scenario, see Product Limitations in the Install Preparation area.
Read: Please read this entire document before you create the EMR cluster or install the
.D s platform - Acquire Assets: Acquire the installation package for your operating system and your license key. For more information, contact
.D s support - If you are completing the installation without Internet access, you must also acquire the offline versions of the system dependencies. See Install Dependencies without Internet Access.
- VPC: Enable and deploy a working AWS VPC.
S3: Enable and deploy an AWS S3 bucket to use as the base storage layer for the platform. In the bucket, the platform stores metadata in the following location:
Code Block <S3_bucket_name>/trifacta
- IAM Policies: Create IAM policies for access to the S3 bucket. Required permissions are the following:
The system account or individual user accounts must have full permissions for the S3 bucket:
Code Block Delete*, Get*, List*, Put*, Replicate*, Restore*
These policies must apply to the bucket and its contents. Example:
Code Block "arn:aws:s3:::my-trifacta-bucket-name" "arn:aws:s3:::my-trifacta-bucket-name/*"
- See https://console.aws.amazon.com/iam/home#/policies
- EC2 instance role: Create an EC2 instance role for your S3 bucket policy. See https://console.aws.amazon.com/iam/home#/roles.
- EC2 instance: Deploy an AWS EC2 with SELinux where the
can be installed.D s item item software The required set of ports must be enabled for listening. See System Ports.
This node should be dedicated for
.D s item item use Info NOTE: The EC2 node must meet the system requirements. For more information, see System Requirements.
.
- EMR cluster: An existing EMR cluster is required.
Cluster sizing: Before you begin, you should allocate sufficient resources for sizing the cluster. For guidance, please contact your
.D s item item representative - See Deploy the Cluster below.
- Databases:
- The platform utilizes a set of databases that must be accessed from the
. Databases are installed as part of the workflow described later.D s node - For more information on the supported databases and versions, see System Requirements.
- For more information on database installation requirements, see Install Databases.
- If installing databases on Amazon RDS an admin account to RDS is required. For more information, see Install Databases on Amazon RDS.
- The platform utilizes a set of databases that must be accessed from the
...