Previewing the original and redacted data in a file

Required dataset permission: Preview redacted dataset files

You cannot preview TIF image files. You can preview PNG and JPG files.

Displaying a dataset file preview

From the file list, to display the preview, either:

  • Click the file name.

  • Click the options menu, then click Preview.

Options menu for a dataset file with the Preview option

About the dataset file preview

On the left, the preview displays the original data. The detected entity values are highlighted.

On the right, the preview displays the data with replacement values that are based on the dataset configuration for the detected entity types.

Format of redacted values

Note that in the preview, the redacted values do not include the identifier. They only include the entity type. For example, NAME_GIVEN instead of NAME_GIVEN_1d9w5. The identifiers are included when you download the files.

File preview with the original and redacted and synthesized text data

Preview for PDF and image files

For a PDF or image file, for entity types that use the Redact handling option:

  • If there is space to display the entity type, then it is displayed.

  • Otherwise, the value is covered by a black box.

File preview for a redacted PDF file

When you hover over a black box, the entity type displays in a tooltip:

Entity type tooltip for a redacted value in a PDF file

To view the entity type labels, you can also zoom into the file.

Zoomed in version of a PDF preview that displays entity types

The preview for a PDF file also includes any manual overrides.

File preview for a PDF file that has manual overrides

Selecting entity type handling options from the preview

For .txt, .csv, and .docx files, you can use the preview to select the entity type handling option for each entity type. The options are:

  • Redact - This is the default value. Textual replaces the value with the name of the entity type followed by a unique identifier. For example, the first name John is replaced with NAME_GIVEN_12345. Note that the identifier is only visible in the downloaded file. It does not display on the preview.

  • Synthesize - Textual replaces the value with a realistic generated value. For example, the first name John is replaced with the first name Michael. The replacement values are consistent, which means that a given value always has the same replacement. For example, Michael is always the replacement value for John.

  • Off - Textual ignores the value and copies it as is to the output file.

To select the entity type handling option:

  1. In the results panel, click a detected value.

  2. On the panel, click the entity type handling option. Textual applies the same option to all entity values of that type.

Selecting an entity type handling option

From the preview, you can only select the entity type handling option. For the Synthesize option, you cannot configure synthesis options for an entity type. You must configure those options from the dataset details page. For more information, go to Configuring synthesis options.

Displaying a pipeline PDF or image file preview

On the file details for a pipeline PDF or image file, on the Original tab:

Rendered and Redacted options for a pipeline PDF file
  • To display the original file content, click Rendered.

  • To display the version of the file with the replacement values, click Redacted <file type>.

Last updated

Was this helpful?