You can integrate the  with the Waterline data catalog to simplify finding datasets within your enterprise data lake. The Waterline integration supports the following methods:

  1. Read directly from Waterline through a search box integrated into the Import Data page. 
  2. Locate assets through Waterline and open them with the .

Waterline Data is a data catalog service for Hive. For more information, see www.waterlinedata.com.

Limitations of Waterline Integration

NOTE: This integration is not supported in the .

Pre-requisites

Enable Waterline Integration

Steps:

NOTE: Although the integration appears as a connection in the application, the connection cannot be created through the GUI or through the CLI. Please complete the following steps.

  1. Login to the platform as an administrator.
  2. From the menu, select Settings menu > Admin Settings.
  3. Search for waterline.
  4. Update the values for the following properties accordingly:

     

    PropertyDescription
    waterline.searchPath
    This value identifies the path on the Waterline server for executing a search. Do not modify this value.
    waterline.enabled
    Set this value true to enable the integration.
    waterline.catalogHost
    Set this value to the URL of the Waterline deployment.
  5. Save your changes. 

Testing Waterline browsing integration

  1. Restart services. See Start and Stop the Platform.
  2. When the platform has restarted, login. 
  3. Click Datasets. Then, click Import Data.
  4. In the Import Data page, the Waterline connection should appear in the left nav bar. Select it. 
  5. Enter your search term, which can be a filename or description of the dataset.
  6. Browse and select your dataset. 
  7. From the Gear menu in Waterline, select Wrangle.
  8. The dataset is imported into the platform where it can be added to flows. See Import Data Page

Try running a simple job. For more information, see Verify Operations.