1 of 1

Configuring handling of .docx file components

Required dataset permission: Edit dataset settings

In .docx and .xslx files, as long as the URL entity type handling option is not set to Off, Textual automatically changes the destination of hyperlinks to google.com.

On the Dataset settings page, the Word Document Settings section contains settings to determine how to manage .docx images, tables, and comments.

To display the Dataset settings page, on the dataset details page, click Project settings.

Configuring how to handle .docx images

For .docx images, including .svg files, you can configure the dataset to either:

Redact the image content. When you select this option, Textual looks for and blocks out sensitive values in the image.
Ignore the image.
Replace the images with black boxes.

On the Dataset settings page, under Image settings for DOCX files:

To redact the image content, click Redact contents of images using OCR. This is the default selection.
To ignore the images entirely, click Ignore images during scan.
To replace the images with black boxes, click Replace images from the output file with black boxes.

Configuring how to handle .docx tables

For .docx tables, you can configure the dataset to either:

Redact the table content. When you select this option, Textual detects sensitive values and replaces them based on the entity type configuration.
Block out all of the table cells. When you select this option, Textual places a black box over each table cell.

On the Dataset settings page, under Table settings for DOCX files:

To redact the table content, click Redact content using the entity type configuration. This is the default selection.
To block out the table content, click Block out all table cell content.

Configuring how to handle .docx comments

For comments in a .docx file, you can configure the dataset to either:

Remove the comments from the file.
Ignore the comments and leave them in the file.

On the Dataset settings page, to remove the comments, toggle Remove comments from the output file to the on position. This is the default configuration.

To ignore the comments, toggle Remove comments from the output file to the off position.

Configuring handling of .docx file components

Required dataset permission: Edit dataset settings

In .docx and .xslx files, as long as the URL entity type handling option is not set to Off, Textual automatically changes the destination of hyperlinks to google.com.

On the Dataset settings page, the Word Document Settings section contains settings to determine how to manage .docx images, tables, and comments.

To display the Dataset settings page, on the dataset details page, click Project settings.

Configuring how to handle .docx images

For .docx images, including .svg files, you can configure the dataset to either:

Redact the image content. When you select this option, Textual looks for and blocks out sensitive values in the image.
Ignore the image.
Replace the images with black boxes.

On the Dataset settings page, under Image settings for DOCX files:

To redact the image content, click Redact contents of images using OCR. This is the default selection.
To ignore the images entirely, click Ignore images during scan.
To replace the images with black boxes, click Replace images from the output file with black boxes.

Configuring how to handle .docx tables

For .docx tables, you can configure the dataset to either:

Redact the table content. When you select this option, Textual detects sensitive values and replaces them based on the entity type configuration.
Block out all of the table cells. When you select this option, Textual places a black box over each table cell.

On the Dataset settings page, under Table settings for DOCX files:

To redact the table content, click Redact content using the entity type configuration. This is the default selection.
To block out the table content, click Block out all table cell content.

Configuring how to handle .docx comments

For comments in a .docx file, you can configure the dataset to either:

Remove the comments from the file.
Ignore the comments and leave them in the file.

On the Dataset settings page, to remove the comments, toggle Remove comments from the output file to the on position. This is the default configuration.

To ignore the comments, toggle Remove comments from the output file to the off position.

Configuring handling of .docx file components

hashtagConfiguring how to handle .docx images

hashtagConfiguring how to handle .docx tables

hashtagConfiguring how to handle .docx comments

Configuring handling of .docx file components

hashtagConfiguring how to handle .docx images

hashtagConfiguring how to handle .docx tables

hashtagConfiguring how to handle .docx comments

Configuring how to handle .docx images

Configuring how to handle .docx tables

Configuring how to handle .docx comments

Configuring how to handle .docx images

Configuring how to handle .docx tables

Configuring how to handle .docx comments