Page tree

 

Contents:


You can integrate the Trifacta® platform with the Waterline data catalog to simplify finding datasets within your enterprise data lake. The Waterline integration supports the following methods:

  1. Read directly from Waterline through a search box integrated into the Import Data page. 
  2. Locate assets through Waterline and open them with the Trifacta platform.

Waterline Data is a data catalog service for Hive. For more information, see www.waterlinedata.com.

Limitations

NOTE: This integration is not supported in the Wrangler Enterprise desktop application.

Pre-requisites

  • Waterline 4.0 and higher

  • Waterline must be integrated with your deployment of the Trifacta platform. For more information, please contact your Waterline administrator.

  • You must have credentials to access Waterline. 

    NOTE: Your Waterline administrator must ensure that your account has the appropriate permissions to search for and access datasets within Waterline and its integrated sources.

  • You must acquire the URL for the host of your Waterline deployment.

  • You must acquire the hostname and port for the Trifacta platform.

Enable Waterline Integration

Steps:

NOTE: Although the integration appears as a connection in the application, the connection cannot be created through the application. Please complete the following steps.

  1. Login to the platform as an administrator.
  2. From the menu, select User menu > Admin console > Admin settings.
  3. Search for waterline.
  4. Update the values for the following properties accordingly:

     

    PropertyDescription
    waterline.searchPath
    This value identifies the path on the Waterline server for executing a search. Do not modify this value.
    waterline.enabled
    Set this value true to enable the integration.
    waterline.catalogHost
    Set this value to the URL of the Waterline deployment.
  5. Save your changes. 

Testing Waterline browsing integration

  1. Restart services. See Start and Stop the Platform.
  2. When the platform has restarted, login. 
  3. Click Library. Then, click Import Data.
  4. In the Import Data page, the Waterline connection should appear in the left nav bar. Select it. 
  5. Enter your search term, which can be a filename or description of the dataset.
  6. Browse and select your dataset. 
  7. From the Gear menu in Waterline, select Wrangle.
  8. The dataset is imported into the platform where it can be added to flows. See Import Data Page

Try running a simple job. For more information, see Verify Operations.

This page has no comments.