Displaying a dataset file preview
From the file list, to display the preview, either:
Click the options menu, then click Preview.
Options menu for a dataset file with the Preview option File preview for a redacted file
For a dataset that generates output files of the same type as the original file:
On the left, the preview displays the original data. The detected entity values are highlighted.
On the right, the preview displays the data with replacement values that are based on the dataset configuration for the detected entity types.
File preview with the original and redacted and synthesized text data Preview for PDF and image files
For a PDF or image file, for entity types that use the Redact handling option, the value is covered by a black box.
File preview for a redacted PDF file The preview for a PDF file also reflects any manual overrides.
Selecting entity type handling options from the preview
You can use the preview to select the entity type handling option for each entity type. The options are:
Redaction - This is the default value. Textual replaces the value with the name of the entity type followed by a unique identifier.
For example, the first name John is replaced with NAME_GIVEN_12345. Note that the identifier is only visible in the downloaded file. It does not display on the preview.
Synthesize - Textual replaces the value with a realistic generated value.
For example, the first name John is replaced with the first name Michael. The replacement values are consistent, which means that a given value always has the same replacement. For example, Michael is always the replacement value for John.
Ignore - Textual ignores the value and copies it as is to the output file.
To select the entity type handling option:
In the results panel, click a detected value.
On the panel, click the entity type handling option. Textual applies the same option to all entity values of that type.
Selecting an entity type handling option From the preview, you can only select the entity type handling option. For the Synthesis option, you cannot configure synthesis options for an entity type. You must configure those options from the dataset details page. For more information, go to Configuring entity type synthesis options.
Ignoring specific instances in PDF files
From the PDF preview, you can also choose to ignore a specific value.
To configure whether to ignore a specific detected value:
In the results panel, click the value.
On the panel, to ignore the value, toggle Ignore to the on position.
Panel with the option to ignore a PDF value File preview for a JSON output file
For a dataset that generates JSON output:
On the left is the original content. For files other than .txt files, you can toggle between generated Markdown and the rendered file.
On the right are the results.
File preview for a text file in a JSON output dataset File preview for a PDF file in a JSON output dataset Viewing the output JSON for the file
The JSON view contains the content of the JSON output file.
For details about the JSON output structure for the different types of files, go to Structure of JSON output files.
JSON view on a file preview for a JSON output dataset Tables view - Tables in a PDF or image file
For a PDF or image file that contains one or more tables, the Tables view displays the tables.
To display Tables view, select it from the view dropdown list.
If the file does not contain any tables, then the Tables view option is not available.
Key-Values view - Key-value pairs in a PDF or image file
For a PDF or image file that contains key-value pairs, the Key-Values view displays the key-value pairs.
To display Key-Values view, select it from the view dropdown list.
If the file does not contain key-value pairs, then the Key-Values view option is not available.