Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Published by Scroll Versions from space DEV and version next

d-toc

D s install marketplace

Scenario Description

This scenario assumes the following about the

D s platform
rtrue
 deployment:

  • The platform is to be installed via an Amazon AMI onto an EC2 instance.
  • It is to be connected to an EMR cluster.
  • No security features are applied to the platform and its use of the datastore.
  • You have acquired a 
    D s item
    itemlicense key
    . The license key must be deployed to the  
    D s item
    itemnode
     before you start the platform.
Info

NOTE: This scenario does not provide information on installing and configuring optional components, including security features. It is intended to get the

D s platform
installed, operational, and connected to the EMR cluster.

Pre-requisites

Warning

If you are integrating the

D s platform
with an EMR cluster, you must acquire a license first. Additional configuration is required. For more information, please contact aws-marketplace@trifacta.com.

Before you begin:

  1. Read: Please read this entire document before you create the EMR cluster or install the 

    D s platform
    .

  2. Cluster sizing: Before you begin, you should allocate sufficient resources for sizing the EMR cluster. For guidance, please contact your 

    D s item
    itemrepresentative
    .

Internet access

D excerpt include
pageConfigure for AWS
nopaneltrue

SELinux

By default, 

D s product
productpro
 is installed on a server with SELinux enabled. Security-enhanced Linux (SELinux) provides a set of security features for, among other things, managing access controls. 

Tip

Tip: The following may be applied to other deployments of the

D s platform
on servers where SELinux has been enabled.

 

In some cases, SELinux can interfere with normal operations of platform software. If you are experiencing connectivity problems related to SELinux, you can do either one of the following:

...

  1. By default, the 

    D s webapp
     listens on port 3005. The following opens that port when SELinux is enabled:

    Code Block
    semanage port -a -t http_port_t -p tcp 3005
  2. Repeat the above step for any other ports that you wish to open on the server.

...

D s item
itemnode

...

Code Block
setsebool -P httpd_can_network_connect 1

Product Limitations

  • The EC2 instance, S3 buckets, and any connected Redshift databases must be located in the same Amazon region. Cross-region integrations are not supported at this time.
  • No support for Hive integration
  • No support for secure impersonation or Kerberos
  • No support for high availability and failover
  • Job cancellation is not supported on EMR.
  • When publishing single files to S3, you cannot apply an append publishing action.

Install

Info

NOTE: Before you install, you should review the configuration content for specific instructions on setting up the

D s item
itemnode
. See below.

  1. Create the EC2 instance for the 
    D s platform
    .
  2. Download and deploy the AMI into the EC2 instance.

...

Desktop Requirements

  • All desktop users of the platform must have the latest version of Google Chrome installed on their desktops.
    • Google Chrome must have the PNaCl client installed and enabled.
    • PNaCl Version:  0.50.x.y or later
  • All desktop users must be able to connect to the EC2 instance through the enterprise infrastructure.

Pre-requisites

Before you install the platform, please verify that the following steps have been completed.

  1. EULA. Before you begin, please review the End-User License Agreement. See https://docs.trifacta.com/display/PUB/End-User+License+Agreement+-+Trifacta+Wrangler+Enterprise.

  2. S3 bucket. Please create an S3 bucket to store 

    D s item
    itemassets
    . In the bucket, the platform stores metadata in the following location:

    Code Block
    <S3_bucket_name>/trifacta

    See https://s3.console.aws.amazon.com/s3/home.

  3. IAM policies. Create IAM policies for access to the S3 bucket. Required permissions are the following: 
    • The system account or individual user accounts must have full permissions for the S3 bucket:

      Code Block
      Delete*, Get*, List*, Put*, Replicate*, Restore*
    • These policies must apply to the bucket and its contents. Example:

      Code Block
      "arn:aws:s3:::my-trifacta-bucket-name"
      "arn:aws:s3:::my-trifacta-bucket-name/*"
    • See https://console.aws.amazon.com/iam/home#/policies
  4. EC2 instance role. Create an EC2 instance role for this policy. See https://console.aws.amazon.com/iam/home#/roles.

Install Steps

...

  1. Instance size: Select the instance size.
  2. Network: Configure the VPC, subnet, firewall and other configuration settings necessary to communicate with the instance. 
  3. Auto-assigned Public IP: You must create a public IP to access the 
    D s platform
    .
  4. EC2 role: Select the EC2 role that you created.
  5. Local storage: Select a local EBS volume. The default volume includes 100GB storage.

    Info

    NOTE: The local storage environment contains the

    D s item
    itemdatabases
    , the product installation, and its log files. No source data is ever stored within the product.

  6. Security group: Use a security group that exposes access to port 3005, which is the default port for the platform. 
  7. Create an AWS key-pair for access: This key is used to provide SSH access to the platform, which may be required for some admin tasks.
  8. Save your changes.

Apply license key:

...

Acquire the license.json license key file that was provided to you by your

D s item
itemrepresentative
.

...

Transfer the license key file to the EC2 node that is hosting the

D s platform
. Navigate to the directory where you stored it.

...

Make the

D s item
itemuser
the owner of the file:

Code Block
sudo chown trifacta:trifacta license.json

Make sure that the

D s item
itemuser
 has read permissions on the file:

Code Block
sudo chmod 644 license.json

...

Copy the license key file to the proper location:

Code Block
cp license.json /opt/trifacta/license/

...

Launch the configured platform.

Info

NOTE: From the EC2 Console, please acquire the instanceId, which is needed in a later step.

...

When the instance is spinning up for the first time, performance may be slow. When the instance is up, navigate to the following:

Code Block
http://<public_hostname>:3005

...

  1. Username: admin@trifacta.local
  2. Password: (the instanceId value)

    Info

    NOTE: As soon as you login as an admin for the first time, you should immediately change the password. Select the User Profile menu item in the upper-right corner. Change the password and click Save to restart the platform.

...

  1. In the Search bar, enter the following:

    Code Block
    aws.s3.bucket.name
  2. Set the value of this setting to be the bucket that you created.

The following setting must be specified.

Code Block
"aws.mode":"system",

You can set the above value to either of the following:

...

Details on the above configuration are described later.

...

Click Save.

...

When the platform restarts, you can begin using the product.

SSH Access

If you need to SSH to the

D s item
itemnode
, you can use the following command:

Code Block
ssh -i <path_to_key_file> <userId>@<tri_node_DNS_or_IP>

...

<path_to_key_file>

...

Path to the key file stored on your local computer.

...

<tri_node_DNS_or_IP>

...

DNS or IP address of the

D s item
itemnode

If you are integrating with an EMR cluster, additional configuration is required. 

Info

NOTE: Please review these steps with your

D s item
itemrepresentative
.

D excerpt include
pageConfigure for EMR
nopaneltrue

Configure for EC2 Role-Based Authentication

This configuration is optional.

D excerpt include
pageConfigure for EC2 Role-Based Authentication
nopaneltrue

Start and Stop the Platform

D excerpt include
pageInstall Start Platform
nopaneltrue

Verify

D excerpt include
pageInstall Verify
nopaneltrue

Upgrade

For more information, see Upgrade for Amazon Marketplace with EMR.

Documentation

You can access complete product documentation in online and PDF format. From within the product, select Help menu > Product Docs.

...

Related Topics

...

s-wip

D s install marketplace