Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Excerpt

This section provides overview information on the key data and metadata that should be managed by your enterprise backup and recovery policies. 

Info

NOTE: This section covers how to perform a basic cold backup of the product. Hot backups are not supported.


Warning

All backups should be performed in accordance with your enterprise's backup and recovery policies.

Stop All Services

Before you begin, the 

D s platform
 and databases should be stopped. See Start and Stop the Platform.

Backup Platform Files

The following directories on the 

D s node
 should be backed up on a regular basis:

Configuration

You can back up all key configuration files into the /tmp directory using the following commands:

Code Block
cp /opt/trifacta/conf/trifacta-conf.json /tmp/trifacta-conf.json
cp /opt/trifacta/conf/env.sh /tmp/env.sh
cp /etc/init.d/trifacta /tmp/trifacta.service
cp /opt/trifacta/conf/data-service/application.properties /tmp/application.properties
cp -R /opt/trifacta/services/data-service/build/conf/vendor /tmp/vendor

License

You should backup your license key:

Code Block
cp /opt/trifacta/license/license.json /tmp/license.json

See License Key.

Backup Databases

The 

D s platform
 utilizes the following databases as part of normal operations. These databases should be backed up on a regular basis:

Database NameDatabaseIdDescription
Main DB

trifacta

Stores users and metadata for flows, including datasets, and recipes.
Jobs DBtrifacta-activitiStores and maintains job execution status and details.
Scheduling DBtrifactaschedulingserviceStores metadata for scheduled jobs.
Time-based Trigger DBtrifactatimebasedtriggerserviceAdditional database required for scheduled jobs.
Configuration Service DBtrifactaconfigurationserviceStores configuration settings for the workspace.
Artifact Storage Service DBtrifactaartifactstorageservice

Stores feature usage data such value mappings for the standardization feature.

Job Metadata Service DBtrifactajobmetadataserviceStores metadata on job execution.
Authorization Service DBtrifactaauthorizationserviceStorage of object permissions.
Orchestration Service DBtrifactaorchestrationserviceStorage of plans, triggers, tasks, and snapshots.
Optimizer Service DBtrifactaoptimizerserviceStorage of SQL queries for optimization during job execution.

For more information on setting up these databases, see Install Databases.

Location of backup and recovery tools

PostgreSQL

Depending on your operating system, you can find the backup tools in the following location.CentOS/RHEL 7 - PostgreSQL 12.3:

Info

NOTE: These locations apply to PostgreSQL 12.3, which is supported in this release on CentOS/RHEL 7 only.

Code Block
/usr/pgsql-12/bin/pg_dump /usr/pgsql-12/bin/psql

9.6.


CentOS/RHEL:

info

NOTE: These locations apply to PostgreSQL 9.6.

Code Block
/usr/pgsql-9.6/bin/pg_dump
/usr/pgsql-9.6/bin/psql


 Ubuntu: 

Code Block
/usr/lib/postgresql/9.6/bin/pg_dump
/usr/lib/postgresql/9.6/bin/psql
MySQL

Please locate the following programs in your MySQL distribution:

Code Block
mysqldump
mysql

Backup commands

The following commands can be used to back up the databases.

PostgreSQL

For more information on command options, see https://www.postgresql.org/docs/9.6/static/backup.html.

Info

NOTE: These commands must be executed as the trifacta user.


Info

NOTE: The following commands are for PostgreSQL 129.3 6 for all supported operating systems. For specific commands for other versions, please see the database documentation.


Tip

Tip: You may see performance improvements by backing up and restoring using .TAR files. However, there is a risk that .TAR support could change in the future. For more information, please see the PostgreSQL documentation.

Trifacta DB:

Info

NOTE: If you are providing a dump of the trifacta database to

D s support
, please include a dump of the trifactaauthorizationservice database, as well.


Code Block
pg_dump trifacta > trif_triDB_bkp_<date>.sql

Jobs DB:

Code Block
pg_dump trifacta-activiti > trif_actDB_bkp_<date>.sql

Scheduling DB:

Code Block
pg_dump trifactaschedulingservice > trif_schDB_bkup_<date>.sql

Time-Based Trigger DB:

Code Block
pg_dump trifactatimebasedtriggerservice > trif_tbtsDB_bkup_<date>.sql

Configuration Service DB:

Code Block
pg_dump trifactaconfigurationservice > trif_confservDB_bkup_<date>.sql

Artifact Storage DB:

Code Block
pg_dump trifactaartifactstorageservice > trif_artifactstorageservDB_bkup_<date>.sql

Job Metadata Service DB:

Code Block
pg_dump trifactajobmetadataservice > trif_jobmetadataservDB_bkup_<date>.sql

Authorization Service DB:

Code Block
pg_dump trifactaauthorizationservice > trif_authorizationservDB_bkup_<date>.sql

Orchestration Service DB:

Code Block
pg_dump trifactaorchestrationservice > trif_orchestrationservDB_bkup_<date>.sql

Optimizer Service DB:

Code Block
pg_dump trifactaoptimizerservice > trif_optimizerservDB_bkup_<date>.sql
MySQL

For more information on command options, see https://dev.mysql.com/doc/refman/5.7/en/mysqldump-sql-format.html.

Code Block
su - mysql


Info

NOTE: The following commands are for MySQL 5.7 for all supported operating systems. For specific commands for other versions, please see the database documentation.


Trifacta DB:

Info

NOTE: If you are providing a dump of the trifacta database to

D s support
, please include a dump of the trifactaauthorizationservice database, as well.


Code Block
mysqldump trifacta > trif_triDB_bkp_<date>.sql

Jobs DB:

Code Block
mysqldump trifacta-activiti > trif_actDB_bkp_<date>.sql

Scheduling DB:

Code Block
mysqldump trifactaschedulingservice > trif_schDB_bkup_<date>.sql

Time-Based Trigger DB:

Code Block
mysqldump trifactatimebasedtriggerservice > trif_tbtsDB_bkup_<date>.sql

Configuration Service DB:

Code Block
mysqldump trifactaconfigurationservice > trif_confservDB_bkup_<date>.sql

Artifact Storage DB:

Code Block
mysqldump trifactaartifactstorageservice > trif_artifactstorageservDB_bkup_<date>.sql

Job Metadata Service DB:

Code Block
mysqldump trifactajobmetadataservice > trif_jobmetadataservDB_bkup_<date>.sql

Authorization Service DB:

Code Block
mysqldump trifactaauthorizationservice > trif_authorizationservDB_bkup_<date>.sql

Orchestration Service DB:

Code Block
mysqldump trifactaorchestrationservice > trif_orchestrationservDB_bkup_<date>.sql

Optimizer Service DB:

Code Block
mysqldump trifactaoptimizerservice > trif_optimizerservDB_bkup_<date>.sql

Scheduling

You can schedule nightly execution of these backups using a third-party scheduler such as cron.

Restart

You can restart the 

D s platform
 now. See Start and Stop the Platform.

Recovery

See Platform Rollback.