Configuring processing and parallelism

The following environment variables control job and file processing.

Configuring the number of jobs to run concurrently

By default, each Tonic Textual worker can run eight jobs at the same time. For example, it can process up to eight files simultaneously.

The environment variable SOLAR_MAX_CONCURRENT_WORKER_JOBS controls the number of jobs to run concurrently.

The number of jobs that can run concurrently can affect the number of Textual workers that you need. The more jobs that can run concurrently, the fewer workers that are needed.

Configuring the size of the datetime generator cache

When it generates datetime values, to optimize the processing, Textual stores the redacted datetime values in a cache.

To change the cache size, configure the environment variable SOLAR_DATETIME_GENERATOR_CACHE_CAPACITY.

The default value is 100000, meaning that the cache contains 100,000 values.

Note that while increasing the size of the cache can speed up processing, it also uses more RAM.

Configuring the number of PDF pages to redact simultaneously

When Textual redacts PDF files so that a user can preview or download the output, the following environment variable determines the number of pages that it processes simultaneously:

SOLAR_PDF_PAGE_REDACTION_PARALLELISM

The default value is 4, meaning that Textual processes 4 pages at a time.

Configuring the number of PDF files to plan simultaneously

When Textual plans the redaction of PDF files for a user to preview or download, the following environment variable determines the number of files that it plans simultaneously.

SOLAR_PDF_DOC_PLAN_PARALLELISM

The default value is 3, meaning that Textual plans 3 PDF files at a time.

Configuring how often to purge cached PDF pages

When it redacts PDF files, Textual stores the redacted PDF pages in a cache.

The following environment variable determines how often Textual purges the cache of PDF pages.

PURGE_REDACTED_PAGES_IN_HOURS

The default value is 12, meaning that Textual purges the redacted PDF pages cache every 12 hours.

Last updated

Was this helpful?