Scheduler Module

(2Q19)


eXist has a job scheduler based on Quartz, a full-featured, open source job scheduling system. This article will explain how to use the scheduler.

Scheduling Jobs

Jobs can be configured for execution by the scheduler in one of two ways:

System Start-up

Jobs may be statically scheduled by configuring them in the <scheduler> element of eXist-db's conf.xml configuration file. When eXist-db starts-up this configuration is read and the jobs will be scheduled with the scheduler. The configuration file contains (commented out) example jobs. An example of a scheduler entry:

<scheduler>
  <job type="user" class="com.example.MyJob" period="600000" delay="300000" repeat="10" unschedule-on-exception="false"/>
</scheduler>
Scheduler XQuery Module

The XQuery function module for initiating and managing scheduled jobs is not activated out-of-the-box. It has to be activated through the conf.xml file. See below

There is an XQuery library module, Scheduler Module, which provides functions for scheduling jobs and managing jobs scheduled for execution. The scheduler XQuery library module is activated by uncommenting the following in eXist-db's conf.xml configuration file:

<module class="org.exist.xquery.modules.scheduler.SchedulerModule" uri="http://exist-db.org/xquery/scheduler"/>

Once the scheduler XQuery library module is active, XQuery code can be written to invoke and manage user jobs (see Job Types).

The scheduler keeps a log file of its actions and issues that occur when executing jobs in logs/scheduler.log.

Job Types

The scheduler supports three different types of job, each of which has a distinct purpose:

Startup

Start-up jobs are executed only once during the database start-up. This execution occurs before the database becomes available for general use. Each start-up job is executed in-turn to completion.

User

User jobs are a general class of job, authored by a user of the system. They can be scheduled for execution either periodically or as a one-off. A user job can also be configured to execute concurrently with another instance of itself, should schedules overlap due to execution time, or be mutually exclusive.

System

System jobs are similar to user jobs, but require the database to be in a consistent state: no other concurrent operations are running and all in-memory buffers have been flushed to disk. These are typically used for internal tasks such as flushing the database Journal. In general users should avoid scheduling this type of job because all other database operations will be paused until the job finishes or raises an exception. Any exception will be caught and a warning is written to the scheduler log.

Authoring Jobs

The scheduler supports jobs that are authored in either XQuery or Java:

XQuery Jobs

Jobs coded in XQuery can be user type jobs only.

An XQuery job is a standard XQuery Main Module which is stored in the database. You configure the scheduling of the job by providing the database path to the XQuery, for example: /db/my-collection/my-job.xq.

XQuery job's are launched under the guest account. If you wish to perform tasks as another user , either switch permissions by calling xmldb:login() from within your job, or set the SetUid/SetGid bits on the XQuery file's permissions (see Security for more information).

Java Jobs

Jobs coded in Java can be startup, system or user type jobs.

A Java job is a class (available on eXist-db's classpath) which extends either org.exist.scheduler.UserJavaJob (for user type jobs) or org.exist.storage.SystemTask (startup or system type jobs). For example:

package com.example;

import java.util.Map;
import java.util.concurrent.atomic.AtomicInteger;

import org.exist.storage.BrokerPool;

public class MyJob extends UserJavaJob {

    private static final AtomicInteger COUNTER = new AtomicInteger();
    private String jobName = "MyJob";
    
	public void execute(final BrokerPool brokerpool, final Map<String, ?> params) throws JobException {

		System.out.println("****** My Job count=" + COUNTER.incrementAndGet());

	}

	public String getName() {
		return jobName;
	}

    public void setName(final String name) {
        this.jobName = name;
    }
}

Job Schedule

Job's may be scheduled using one of two mechanisms, a simple mechanism for periodic execution, or a more complex mechanism which uses the Cron syntax to offer greater flexibility.

Periodic Scheduling

For period scheduling enables you can specify for the job to run every n milliseconds. There are additional options to specify a delay before the first execution of the job, and to only repeat the execution of the job schedule a fixed number of times.

Cron Scheduling

Cron scheduling enables you to specify the execution schedule of a job using the more complicated but flexible Cron syntax.

This section was copied from the Quartz Cron Trigger Tutorial.

Cron Introduction

cron is a UNIX tool that has been around for a long time, so its scheduling capabilities are powerful and proven. The CronTrigger class is based on the scheduling capabilities of cron.

CronTrigger uses "cron expressions", which are able to create firing schedules such as: "At 8:00am every Monday through Friday" or "At 1:30am every last Friday of the month".

cron expressions are powerful, but can be confusing. This tutorial aims to take some of the mystery out of creating a cron expression, giving users a resource which they can visit before having to ask in a forum or mailing list.

Cron Format

A cron expression is a string comprised of 6 or 7 fields separated by white space. Fields can contain any of the allowed values, along with various combinations of the allowed special characters for that field. The fields are as follows:

Field Name

Mandatory?

Allowed Values

Allowed Special Characters

Seconds

YES

0-59

, - * /

Minutes

YES

0-59

, - * /

Hours

YES

0-23

, - * /

Day of month

YES

1-31

, - * ? / L W

Month

YES

1-12 or JAN-DEC

, - * /

Day of week

YES

1-7 or SUN-SAT

, - * ? / L #

Year

NO

empty, 1970-2099

, - * /

So cron expressions can be as simple as this: * * * * ? *

Or more complex, like this: 0 0/5 14,18,3-39,52 ? JAN,MAR,SEP MON-FRI 2002-2010

Cron special characters

  • * ("all values") - used to select all values within a field. For example, "*" in the minute field means "every minute".

  • ? ("no specific value") - useful when you need to specify something in one of the two fields in which the character is allowed, but not the other. For example, if I want my trigger to fire on a particular day of the month (say, the 10th), but don't care what day of the week that happens to be, I would put "10" in the day-of-month field, and "?" in the day-of-week field. See the examples below for clarification.

  • - - used to specify ranges. For example, "10-12" in the hour field means "the hours 10, 11 and 12".

  • , - used to specify additional values. For example, "MON,WED,FRI" in the day-of-week field means "the days Monday, Wednesday, and Friday".

  • / - used to specify increments. For example, "0/15" in the seconds field means "the seconds 0, 15, 30, and 45". And "5/15" in the seconds field means "the seconds 5, 20, 35, and 50". You can also specify '/' after the '' character - in this case '' is equivalent to having '0' before the '/'. '1/3' in the day-of-month field means "fire every 3 days starting on the first day of the month".

  • L ("last") - has different meaning in each of the two fields in which it is allowed. For example, the value "L" in the day-of-month field means "the last day of the month" - day 31 for January, day 28 for February on non-leap years. If used in the day-of-week field by itself, it simply means "7" or "SAT". But if used in the day-of-week field after another value, it means "the last xxx day of the month" - for example "6L" means "the last friday of the month". When using the 'L' option, it is important not to specify lists, or ranges of values, as you'll get confusing results.

  • W ("weekday") - used to specify the weekday (Monday-Friday) nearest the given day. As an example, if you were to specify "15W" as the value for the day-of-month field, the meaning is: "the nearest weekday to the 15th of the month". So if the 15th is a Saturday, the trigger will fire on Friday the 14th. If the 15th is a Sunday, the trigger will fire on Monday the 16th. If the 15th is a Tuesday, then it will fire on Tuesday the 15th. However if you specify "1W" as the value for day-of-month, and the 1st is a Saturday, the trigger will fire on Monday the 3rd, as it will not 'jump' over the boundary of a month's days. The 'W' character can only be specified when the day-of-month is a single day, not a range or list of days. The 'L' and 'W' characters can also be combined in the day-of-month field to yield 'LW', which translates to "last weekday of the month".

  • # - used to specify "the nth" XXX day of the month. For example, the value of "6#3" in the day-of-week field means "the third Friday of the month" (day 6 = Friday and "#3" = the 3rd one in the month). Other examples: "2#1" = the first Monday of the month and "4#5" = the fifth Wednesday of the month. Note that if you specify "#5" and there is not 5 of the given day-of-week in the month, then no firing will occur that month. The legal characters and the names of months and days of the week are not case sensitive. MON is the same as mon.

Cron examples

Here are some full examples:

Expression

Meaning

0 0 12 * * ?

Fire at 12pm (noon) every day

0 15 10 * * ?

Fire at 10:15am every day

0 15 10 * * ? *

Fire at 10:15am every day

0 15 10 * * ? 2005

Fire at 10:15am every day during the year 2005

0 * 14 * * ?

Fire every minute starting at 2pm and ending at 2:59pm, every day

0 0/5 14 * * ?

Fire every 5 minutes starting at 2pm and ending at 2:55pm, every day

0 0/5 14,18 * * ?

Fire every 5 minutes starting at 2pm and ending at 2:55pm, AND fire every 5 minutes starting at 6pm and ending at 6:55pm, every day

0 0-5 14 * * ?

Fire every minute starting at 2pm and ending at 2:05pm, every day

0 10,44 14 ? 3 WED

Fire at 2:10pm and at 2:44pm every Wednesday in the month of March.

0 15 10 ? * MON-FRI

Fire at 10:15am every Monday, Tuesday, Wednesday, Thursday and Friday

0 15 10 15 * ?

Fire at 10:15am on the 15th day of every month

0 15 10 L * ?

Fire at 10:15am on the last day of every month

0 15 10 ? * 6L

Fire at 10:15am on the last Friday of every month

0 15 10 ? * 6L

Fire at 10:15am on the last Friday of every month

0 15 10 ? * 6L 2002-2005

Fire at 10:15am on every last friday of every month during the years 2002, 2003, 2004 and 2005

0 15 10 ? * 6#3

Fire at 10:15am on the third Friday of every month

0 0 12 1/5 * ?

Fire at 12pm (noon) every 5 days every month, starting on the first day of the month.

0 11 11 11 11 ?

Fire every November 11th at 11:11am.

Pay attention to the effects of '?' and '*' in the day-of-week and day-of-month fields!

Cron additional notes

  • Support for specifying both a day-of-week and a day-of-month value is not complete (you must currently use the '?' character in one of these fields).

  • Be careful when setting fire times between mid-night and 1:00 AM - "daylight savings" can cause a skip or a repeat depending on whether the time moves back or jumps forward.