The  can be configured to access data stored in relational database sources over JDBC protocol. When this connection method is used, individual database tables and views can be imported as datasets. 

Supported Relational Databases

The  can natively connect to these relational database platforms. Natively supported versions are the following:

  • Oracle 12.1.0.2
  • SQL Server 12.0.4
  • PostgreSQL 9.3.10
  • Teradata 14.10+

    NOTE: To enable Teradata connections, you must download and install Teradata drivers first. For more information, see Enable Teradata Connections.

Additional relational connections can be enabled and configured for the platform. For more information, see Connection Types.

Ports

For any relational source to which you are connecting, the  must be able to access it through the specified host and port value.

Please contact your database administrator for the host and port information.

Enable

This feature is enabled automatically. 

NOTE: Disabling this feature hides existing relational connections.

Disable relational publishing

By default, relational connections are read/write, which means that users can create connections that enable writing back to source databases.

  • When this feature is enabled, writeback is enabled for all natively supported relational connection types. See Connection Types.
  • Depending on the connection type, the writes its data to different field types in the target database. For more information, see Type Conversions.
  • Some limitations apply to relational writeback. See Limitations below.

As needed, you can disable this feature.

Steps:

  1. Locate the following parameter and set it to false:

    "webapp.connectivity.relationalWriteback.enabled": true,
  2. Save changes and restart the platform.

Publishing through relational connections is disabled.

Limitations

NOTE: Unless otherwise noted, authentication to a relational connection requires basic authentication (username/password) credentials.

Limitations on relational publishing:

When the relational publishing feature is enabled, it is automatically enabled for all platform-native connection types. You cannot disable relational publishing for Oracle, SQL Server, PostgreSQL, or Teradata connection types. Before you enable, please verify that all user accounts accessing databases of these types have appropriate permissions.

NOTE: Writing back to the database utilizes the same user credentials and therefore permissions as reading from it. Please verify that the users who are creating read/write relational connections have appropriate access.

 

Execution at scale

Jobs for large-scale relational sources can be executed on the Spark running environment. After the data source has been imported and wrangled, no additional configuration is required to execute at scale.

NOTE: End-to-end performance is likely to be impacted by:

  • streaming data volumes over 1 TB from the source,
  • streaming from multiple concurrent sources,
  • overall network bandwidth.

When the job is completed, any temporary files are automatically removed from HDFS. 

For more information, see Run Job Page.

Password Encryption Key File

Relational database passwords are encrypted using key files: