Page tree

Trifacta Dataprep



Contents:

If you licensed Dataprep by Trifacta before Oct. 14, 2020, you are using the Dataprep by Trifacta Legacy product edition. On October 14, 2022, this product edition will be decommissioned by Google and will be no longer available for use. Current customers of this product edition are encouraged to transition to one of the product editions hosted by Trifacta. See Product Editions.

   

Contents:


Feature Availability: This feature is available in the following editions:

  • Dataprep by Trifacta® Enterprise Edition
  • Dataprep by Trifacta Professional Edition
  • Dataprep by Trifacta Premium

Select connection types may be made available through  Dataprep by Trifacta® prior to general availability and full support. These Early Preview connection types are intended to enable customers to get connected to their source data to begin building their data wrangling pipelines. The currently available Early Preview connection types are described below.

General Limitations

Early Preview connection types provide early access to and use of new connection types so that you may begin working with data in your connected datastores. Avoid deploying Early Preview connections into a production environment before testing in Dev/Testing environments. To send feedback, please contact Trifacta Support.

Unless explicitly stated below, each Early Preview connection type has the following limitations:

  • Cloud/SaaS only. Early Preview connections are not available for self-managed deployments.
  • Read-only. Writing back through the connection is not supported.
  • Authentication. Access to the datastore must be completed through the listed and supported method for the connection type. Depending on the connection, the method may be limited to one of the following:
    • Basic credentials (username/password)
    • APIKey
    • SecurityToken
    • OAuth 2.0
    • Methods of authentication not listed with the connection type are not supported.
  • Limited data type support. All imported data is mapped to the following basic data types:
    • String
    • Integer
    • Decimal
    • Boolean
    • Other data types, such Datetime, are not officially supported for Early Preview connections. They might be processed properly. 

Additional possible limitations include:

  • SSO is unlikely to be supported.
  • Connect String Options cannot be applied to data modeling.
  • Import uses a default schema, which may not show all of the available tables in the connected database.
  • Of the available tables, some tables may not show all columns due to limitations in the data source.
  • Performance through an Early Preview connection may be slower than for generally available connections.
  • Connector-specific functionality that may require additional engineering may not be supported through the Early Preview connection.

Support Policy

Early Preview connections are made available through the Trifacta application with the intention of making them generally available and fully supported at some point in the future.

Tip: Using Early Preview connections can assist Trifacta in prioritizing them for general availability.

These connections are supported with the following limitations:

  • These connections are supported only for importing datasets and generating samples.
  • The configuration, supported use cases, supported data types, and limitations are subject to change prior to making the connection generally available.
  • While it is the intention of  Trifacta to make all Early Preview connections generally available for our customers,  Trifacta reserves the right to remove the connection from availability at all or to change the connection at any time prior to general availability.
  • Bugs found in Early Preview connections may not be fixed before general availability.

Avoid using Early Preview connections in production environments without testing them first in Development/Staging environments.

Coming Soon connections:

Connections that are labeled Coming Soon in the Trifacta application may be made available in the future. If you are interested to getting early access, please contact  Trifacta Support.

Create Connection

Steps:

Please complete the following steps to create any Early Preview connection.

  1. Login to the Trifacta application.
  2. In the left nav bar, click the Connections icon. 
  3. In the Connections page, click Create.
  4. Select the appropriate card for the connection type. The cards for the connections below are marked: Early Preview
  5. Specify the connection according to the instructions provided below. 
  6. Verify the connection by creating an imported dataset.

For more information, see Connections Page.

Create Connection via API

Depending on your product edition, you may be able to create your connection via API.

To create an API connection, you must acquire the vendor, vendorName, and type information from the appropriate section below. 

"vendor": "<Vendor>",
"vendorName": "<VendorName>",
"type": "<typeIdentifier>"

Use Connection

This section contains basic information on using connections to various types of supported storage. 

Early Preview Connections

ItemDescription
LinkedIn Ads Connections LinkedIn Ads is a paid marketing tool that offers access to LinkedIn social networks through various sponsored posts and other methods. For more information, see https://business.linkedin.com/marketing-solutions.
Zendesk Connections Zendesk is a service-first CRM company that builds software designed to improve customer relationships. For more information, see https://www.zendesk.com/.
Instagram Ads Connections Instagram Ads is a method of paying to post sponsored content on the Instagram platform to reach a larger audience. For more information, see https://business.instagram.com/advertising?locale=en_GB.
Marketo Connections Marketo is a marketing automation platform that enables marketers to manage personalized multi-channel programs and campaigns to prospects and customers. For more information, see https://developers.marketo.com/.
Airtable Connections Airtable is an easy-to-use online platform for creating and sharing relational databases. For more information, see https://airtable.com/.
Apache Impala Connections Apache Impala is a MPP (Massively Parallel Processing) SQL query system for processing volumes of data stored in a Hadoop cluster. For more information, see https://impala.apache.org/.
Asana Connections Asana is a web and mobile application designed to help teams organize, track, and manage their work. For more information, see https://asana.com/.
Authorize.net Connections Authorize.Net  is a payment gateway service provider that allows merchants to accept credit cards, contactless payments, and eChecks through their website and over an IP connection. For more information, see https://www.authorize.net/.
Cassandra Connections Cassandra DB is a distributed system for deployment of a large number of nodes across multiple data centers. For more information, see https://cassandra.apache.org/.
CockroachDB Connections Cockroach DB is a cloud-native SQL database for building global and scalable cloud services that survive disasters. For more information, see https://www.cockroachlabs.com/.
DB2 Connections IBM DB2 connects the different applications in your enterprise to your mainframe. For more information, see https://www.ibm.com/products/db2-connect.
Exact Online Connections Exact Online is an online business software for business owners and accountants. The combination of accounting and CRM offers the perfect basis for any healthy business. For more information, see https://www.exact.com/us/software/exact-online.
Facebook Ads Connections Facebook Ads are paid messages that businesses place on Facebook. Ads appear in News Feed on desktop and mobile. For more information, see https://www.facebook.com/business.
Freshdesk Connections Freshdesk by Freshworks is a cloud-based customer engagement service for managing customer interactions and the sales pipeline. For more information, see https://www.freshworks.com/.
Google Ads Connections Google Ads is an online advertising platform that allows you to create online ads to reach audiences interested in the products and services that an advertiser offers. For more information, see https://ads.google.com/intl/en_in/home/.
Google Analytics Connections Google Analytics is a web analytics service offered by Google that tracks and reports website traffic. For more information, see https://analytics.google.com/analytics/web/provision/#/provision.
Google Data Catalog Connections Google Data Catalog is a fully managed and highly scalable data discovery and metadata management service. For more information, see https://cloud.google.com/data-catalog.
Google Spanner Connections Google Spanner is a fully managed relational database service that offers transactional consistency at global scale, schemas, and synchronous replication for high availability. For more information, see https://cloud.google.com/spanner.
Greenplum Connections Greenplum uses a high-performance architecture to distribute the load of multi-terabyte data warehouses and can leverage a system's resources in parallel to process a query. For more information, see https://greenplum.org/.
Hubspot Connections HubSpot is an inbound marketing and sales platform that offers marketing, sales, customer service, and CRM software. For more information, see https://www.hubspot.com/.
JIRA by Atlassian Connections Jira by Atlassian is a software application used for issue tracking and project management. For more information, see https://www.atlassian.com/software/jira.
Magento Connections Magento is an e-commerce platform that provides online merchants with a flexible shopping cart system and offers powerful marketing, search optimization, and catalog-management tools. For more information, see https://magento.com/.
Mailchimp Connections
Mailchimp is a marketing application that enables you to maintain contact management practices and powerful data analysis. For more information, see https://mailchimp.com/ .
MariaDb Connections MariaDb is a popular open source relational database. It has been integrated into various cloud offerings and is the default database for many Linux distributions. For more information, see https://mariadb.org/.
Microsoft Advertising Connections Microsoft Advertising  is a service that provides pay per click advertising on the Bing, Yahoo, and other search engines. For more information, see https://ads.microsoft.com/.
Microsoft Dynamics 365 Sales Connections Microsoft Dynamics 365 Sales is a set of interconnected, modular Software-as-a-Service (SaaS) applications and services designed to both transform and enable your core customers, employees, and business activities. For more information, see https://dynamics.microsoft.com/en-us/sales/overview/.
NetSuite Connections NetSuite is the leading integrated cloud business software suite, including business accounting, ERP, CRM, and e-commerce software.
Pinterest Connections Pinterest is a visual discovery engine for finding ideas like recipes, home, and style inspiration. For more information, see https://in.pinterest.com/business/hub/.
Presto Connections Presto is a high-performance, distributed SQL engine for running interactive analytic queries against data sources of all sizes ranging from Gigabytes to Petabytes. The architecture of Presto enables users to query various data sources such as Hadoop, MYSQL, MongoDB, and Teradata. For more information, see https://prestodb.io/.
QuickBase Connections Quickbase is an application development platform that unites business and IT teams by enabling problem solvers to work together to safely, securely, and sustainably create an ecosystem of applications. For more information, see https://www.quickbase.com/.
Quickbooks Online Connections QuickBooks Online is a cloud-based software that can be accessed anywhere you have an internet connection. This software includes access to all product and feature updates, automatic data backups, as well as the ability to restore company data from backups. For more information, see https://quickbooks.intuit.com/online/.
Redis Connections Redis (Remote Dictionary Server), is a fast, open-source, in-memory key-value data store for use as a database, cache, message broker, and queue. For more information, see https://redis.io/.
ServiceNow Connections ServiceNow provides technical management support, such as IT service management and help desk functionality, to the IT operations of large corporations. For more information, see https://www.servicenow.com/
Shopify Connections Shopify is an e-commerce platform that provides online retailers with a suite of services such as payments, marketing, shipping, and customer engagement tools.
Smartsheet Connections Smartsheet is a project management and collaboration tool served in a simple spreadsheet layout. For more information, see https://www.smartsheet.com/ .
Splunk Connections Splunk® is a software platform used for monitoring, searching, analyzing, and visualizing machine-generated data in real-time. For more information, see https://www.splunk.com/.
SurveyMonkey Connections SurveyMonkey is an online survey software that helps you to create and run professional online surveys. For more information, see https://www.surveymonkey.com/.
Trello Connections Trello  is a visual collaboration tool that enables you to organize and prioritize projects in a flexible and rewarding way. For more information, see https://trello.com/en-US.
Trino Connections Trino is a distributed SQL query engine designed to query large data sets distributed over one or more heterogeneous data sources. For more information, see https://trino.io/.
Xero Connections Xero is cloud-based accounting software for small businesses. It performs bookkeeping functions such as invoicing and payroll and allows you to connect to a live bank feed. For more information, see https://www.xero.com/ .
YouTube Analytics Connections YouTube Analytics enables you to measure the success of your YouTube marketing efforts. For more information, see https://developers.google.com/youtube.

This page has no comments.