Scheduling data generation

Required workspace permission: Run data generation

Not available for Structural free trials, or for expired pay-as-you-go accounts.

You can also configure data generation to run on a regular schedule. For example, if new data is added regularly to your source database, then you might want to automatically run data generation jobs to transform the new data.

To schedule the job, you set up one or more configurations. Each configuration includes a cron expression to specify the schedule.

Creating and editing a job schedule

You manage the job schedule from Jobs view.

To create a schedule for a job, click Create Schedule.

To edit an existing job schedule, click Edit Schedule.

Set up the schedule configurations, then click Save.

Adding a configuration to a schedule

To add a configuration to the schedule:

Click Add Configuration.
By default, the configuration is active, and Active is in the on position. To have Structural ignore the configuration, toggle Active to the off position.
In the Cron Expression field, provide the schedule expression. The default value is 0 0 * * *, which means to run the job every day at midnight.
From the time zone dropdown list, select the time zone to use for the schedule.

Overview of the cron expression syntax

A cron expression is made up of five values separated by a space.

<minute> <hour> <day of month> <month> <day of week>

Where:

<minute>

The minute at which to run the job. Is a value between 0 and 59. Use * to run every minute. Use , to separate a list of values. For example, to run at 10 and 40 past the hour, use 10,40. Use - to provide a range of values. For example, to run every minute between 20 and 25 past the hour, use 20-25.

<hour>

The hour at which to run the job. Is a value between 0 and 23. Use * to run every hour. Use , to separate a list of values. For example, to run at 11:00 AM and 2:00 PM, use 11,14.

Use - to provide a range of values. For example, to run every hour between 5:00 PM and 9:00 PM, use 17-21.

<day of month>

The day of the month on which to run the job. Is a value between 1 and 31.

Use * to run every day.

Use , to separate a list of values. For example, to run on the 5th and the 15th of the month, use 5,15.

Use - to provide a range of values. For example, to run on the 8th through the 12th of the month, use 8-12.

<month>

The month in which to run the job. Can use either numbers (1 through 12) or 3-letter abbreviations (JAN through DEC).

Use * to run every month.

Use , to separate a list of values. For example, to run in March and September, use 3,9 or MAR,SEP.

Use - to provide a range of values. For example, to run every month between June and August, use 6-8 or JUN-AUG.

<day of week>

The day of the week on which to run the job. Can use either numbers (0 through 6) or 3-letter abbreviations (SUN through SAT).

Use * to run every day.

Use , to separate a list of values. For example, to run every Monday, Wednesday, and Friday, use 1,3,5 or MON,WED,FRI.

Use - to provide a range of values. For example, to run every week from Tuesday through Thursday, use 2-4 or TUE-THU.

Here are some example expressions:

15 * * * *

Every hour at 15 minutes past the hour.

15 10,14,18 * * *

Every day at 10:15 AM, 2:15 PM, and 6:15 PM.

0 0 15 * *

Midnight on the 15th of every month.

0 0 10-15 6 *

Midnight on June 10 through June 15th.

30 12 * * TUE

Every Tuesday at 12:30 PM.

Removing a configuration from the schedule

To remove a configuration from the schedule, click its Delete option.

If you remove all of the configurations and then save, then the job is no longer scheduled.

Scheduled jobs and other workspace configuration

When a job runs on a schedule, you cannot configure the options that are available from the Confirm Generation panel when you run data generation manually.

Subsetting

For a workspace that has subsetting configured, whether the scheduled job generates a subset is based on the Use Subsetting toggle on Subsetting view.

Upsert

For a workspace that has upsert enabled, whether a scheduled job completes both the initial data generation and the upsert process is determined by the workspace setting Automatically Start Upsert After Successful Data Generation.

If this is in the on position, then the scheduled job does both data generation and upsert.

If this is in the off position, then the scheduled job only does the data generation to the intermediate database.

Diagnostic logging

For a scheduled job, whether the job uses diagnostic logging is based on the environment setting configuration for the data connector.

Performance traces

A scheduled job does not generate performance traces.

Last updated 6 months ago

Was this helpful?