Textual architecture
Last updated
Was this helpful?
Last updated
Was this helpful?
The following diagram shows how data and requests flow within the Tonic Textual application:
The Textual application database is a PostgreSQL database that stores the dataset configuration.
If you do not configure an S3 bucket, then it also stores uploaded files and files that you use the SDK to redact.
You can configure an S3 bucket to store uploaded files and individual files that you use the SDK to redact. For more information, go to Setting the S3 bucket for file uploads and redactions.
If you do not configure an S3 bucket, then the files are stored in the Textual application database.
Runs the Textual user interface.
A textual instance can have multiple workers.
The worker orchestrates jobs. A job is a longer running task such as the redaction of a single file.
If you redact a large number of files, you might deploy additional workers and machine learning containers to increase the number of files that you can process concurrently.
A textual installation can have 1 or more machine learning containers.
The machine learning container hosts the Textual models. It takes text from the worker or web server and returns any entities that it discovers.
Additional machine learning containers can increase the number of words per second that Textual can process.
The OCR service converts PDFs and images to text that Textual can then scan for sensitive values.
For more information, go to Enabling PDF and image processing.
Textual only uses the LLM service for .