Page tree

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

Trifacta Dataprep



Contents:

   

When publishing to BigQuery, please complete the following steps to configure the table and settings to apply to the publish action.

Steps:

  1. Select location: Navigate the BigQuery browser to select the database and table to which to publish.
    1. To create a new table, click Create a new table.
  2. Select table options:
    1. Table name:

      NOTE: BigQuery does not support destinations with a dot (.) in the name.

      1. New table: Enter a name for it. You may use a pre-existing table name, and schema checks are performed against it.
      2. Existing table: You cannot modify the name.
    2. Output database: To change the database to which you are publishing, click the BigQuery icon in the sidebar. Select a different database.
    3. Publish actions: Select one of the following.
      1. Create new table every run: Each run generates a new table with a timestamp appended to the name.
      2. Append to this table every run: Each run adds any new results to the end of the table.
      3. Truncate the table every run: With each run, all data in the table is truncated and replaced with any new results.
      4. Drop the table every run: With each run, the table is dropped (deleted), and all data is deleted. A new table with the same name is created, and any new results are added to it.

      5. Merge the table every run: This publishing option merges the rows in your results with any existing rows in the target BigQuery table. For more information, see Merge Table Operations below.

  3. To save the publishing action, click Add or Update.

Merge Table Operations

The publishing option to merge table with every run allows you to update existing rows of data in the target table with corresponding values from your results (merge) and optionally to insert any rows in your results into the table.

Steps:

  1. In the Table Settings panel, select Merge the table every run.
  2. Choose columns for matching rows: Select one or more columns whose values determine if a row in your source results matches a row in the target. When these key values match, the following columns are updated.
    1. If the matching columns have duplicate rows in the target table, all rows in the target are updated.
    2. If the matching columns have duplicate rows in the source, the job fails.
  3. Choose columns to update: Select one or more columns whose values are updated from your source results when values from the previous set of columns match. These are the columns that are merged into the table.

    Tip: If All Columns is selected, all columns other than the matching columns are updated on a match. All columns continue to be updated even if the schema changes, and the matching columns remain in the schema.

  4. Insert records if keys don't match:
    1. When selected, rows in your source that do not have a matching set of values in key columns are inserted into the table as new rows.
    2. When deselected, these unmatched rows are not written to the target table.

  • No labels

This page has no comments.