The can be configured to access data stored in relational database sources over JDBC protocol. When this connection method is used, individual database tables and views can be imported as datasets.
Supported Relational DatabasesThe
PortsFor any relational source to which you are connecting, the Please contact your database administrator for the host and port information. EnableThis feature is enabled automatically. To disable:
Disable relational publishingBy default, relational connections are read/write, which means that users can create connections that enable writing back to source databases.
As needed, you can disable this feature. Steps:
Publishing through relational connections is disabled. |
There are some differences in behavior between reading tables and views. See Using Databases.
Limitations on relational publishing:
When the relational publishing feature is enabled, it is automatically enabled for all platform-native connection types. You cannot disable relational publishing for Oracle, SQL Server, PostgreSQL, or Teradata connection types. Before you enable, please verify that all user accounts accessing databases of these types have appropriate permissions. |
NOTE: Writing back to the database utilizes the same user credentials and therefore permissions as reading from it. Please verify that the users who are creating read/write relational connections have appropriate access. |
Jobs for large-scale relational sources can be executed on the Spark running environment. After the data source has been imported and wrangled, no additional configuration is required to execute at scale.
NOTE: End-to-end performance is likely to be impacted by:
|
When the job is completed, any temporary files are automatically removed from HDFS.
For more information, see Run Job Page.
Relational database passwords are encrypted using key files:
For more information, see Configure Security for Relational Connections.
Configure at least one individual connection for any of the supported relational systems. You can configure more than one connection to the same relational system using different credentials. See Connection Types.
If you have enabled Kerberos on the Hadoop cluster, you can leverage the Kerberos global keytab to enable SSO connections to relational sources. See Enable SSO for Relational Connections.