Data locations

A data location is a database schema into which successive model editions will be deployed.

Data location types

The data location type is selected when the data location is created, and cannot be changed afterward.

There are two types of data locations:

  • The development data location, which supports deploying open or closed model editions. This type of data location is suitable for testing models in development and quality assurance environments.

  • The production data location, which supports deploying only closed model editions. This type of data location is suitable for deploying data hubs in production environments.

Be cautious when choosing the data location type, as it will determine the type of deployment operations that can be done. It is recommended to use only production data locations for production and user-acceptance testing environments.

Data location contents

A data location contains the hub data, which are stored in the schema that is accessed using the data location’s datasource. This schema contains database tables and other objects generated from the model edition.

Three types of jobs (stored in the repository) can be carried out within a data location:

  • Installation jobs: the jobs for creating or modifying, in a non-destructive way, the data structures in the schema.

  • Integration jobs: the jobs for certifying data in these data structures, according to the model job definitions.

  • Purge jobs: the jobs for purging the logs, data history, and deployment history according to the retention policies.

In addition to the deployed model editions, the jobs, and their execution logs, the data locations also contain the configuration of:

  • The continuous loads, used by integration specialists to push data into the data location in a continuous way.

  • The job notifications policies, sent under certain conditions when an integration job is completed for administration, monitoring, or integration automation purposes.

  • The data notifications, used to propagate data from the data hub to downstream systems.

  • The data purge schedule, to reduce the data location storage volume by pruning the history of data changes and job logs.