Tracking and managing file processing

When you add files to a local files dataset, or change the file selection for a cloud storage dataset, Tonic Textual automatically scans the files to identify the entities that they contain.

When you change the dataset configuration, Textual also prompts you to run a new scan. For example, a new scan is required when you:

  • Configure added and excluded values

  • Change the available custom entity types

The file list reflects the current scanning status for the file. A file is initially queued for scanning. When the scan starts, the status changes to scanning. When Textual finishes processing a file, it marks the file as scanned.

As Textual processes each file, it updates the results in the dataset details heading and the entity types list.

Pausing the file processing

If needed, you can pause the file processing. To pause the processing, click Pause.

The information in the heading and entity types list only reflect the files that are scanned.

For a cloud storage dataset, when you generate output, Textual only includes files that are scanned.

Starting a scan on a paused file

Required dataset permission: Start a scan of dataset files

After you pause the scan, you can start a scan on individual files.

To start a scan on a file:

  1. Click the options menu for the file.

  2. Click Scan.

Downloading logs for files that fail to process

Required dataset permission: Start a scan of dataset files

When Textual is unable to process a file, it displays an error for that file.

To download log files for the failed file:

  1. Click the options menu for the file.

  2. Click Download Logs.

Download Logs option for a file that Textual cannot process

Last updated

Was this helpful?