Reviewing the sensitivity detection results

circle-info

Required dataset permission: View dataset settings

The dataset details page provides information about the results of the sensitivity detection, including the overall results, the results per file, and the results per entity type.

Viewing the summary results for the dataset

At the top of the dataset details page are the summary results for the dataset, including:

Summary results for a dataset
  • The number of detected entities

  • The percentage of dataset content that is sensitive

  • The number of entity types for which there are detected entities

  • The number of files in the dataset

  • The total number of words in the dataset files

Viewing the count of entities per file

For each dataset file, the Project files page displays:

Value and word counts for dataset files
  • The number of detected entity values in the file.

  • The total number of words in the file.

Viewing the list of entity types

The Entity settings page displays the list of active entity types for the dataset set. This includes:

  • All of the built-in entity types

  • Any custom entity types that are active for the dataset

Entity settings page for a dataset

Information in the entity types list

For each entity type, the list includes:

  • The name of the entity type.

  • The number of detected values for that type in the dataset files.

  • The selected handling option.

Filtering the entity types list

You can filter the entity types list by:

  • Text in the type name or description.

  • Whether the entity type is built-in or custom.

  • Whether there are detected entities for the entity type.

  • The handling option for the entity type

To filter by name or description, in the search field, begin to type text in the name or description. As you type, Textual filters the list to only include matching entity types.

Filtering the entity types list by name

To apply other filters, click Filter options, then select the filters to apply.

Filter options for dataset entity types

Viewing the detected entity values

The Entities catalog displays the list of detected entity values for the dataset. To display the Entities catalog, in the left menu on the dataset details page, click Entities catalog.

Entities catalog for a dataset

Information in the Entities catalog

The Entities catalog lists each instance of an entity value separately. For example, the given name John is detected twice in one file and 3 times in another file. The Entities catalog then contains 5 entries for John.

For each value instance, the Entities catalog includes:

  • The entity value.

  • How the value appears in the output, based on the selected handling option for the value's entity type.

  • The entity type.

  • A confidence score to indicate how confident Textual is that the value is correctly detected and identified.

  • The entity value instance in its immediate context.

  • The name of the file that contains the value instance.

Filtering the Entities catalog

Filtering by entity value

To filter the list by text in the entity value, in the search field, begin to type the text.

As you type, Textual filters the list to only include entity values that contain that text.

Entities catalog filtered by value text

Filtering by entity type

By default, the Entities catalog list includes all of the entity types. To filter the list to a specific entity type, click All types, then select the entity type. To remove the filter, select All types.

Filtering by file

By default, the Entities catalog list includes values from all of the files. To filter the list to only include values detected in a specific file, click All files, then select the file. To remove the filter, select All files.

Sorting the Entities catalog

You can sort the Entities catalog by the value, transformation, and entity type, and confidence score.

To sort by a column, click the column heading.

To reverse the sort order, click the column heading again.

How Textual handles entity values that match multiple types

A detected value might match multiple entity types.

For example, a telephone number might match both the Phone Number and Numeric Value entity types.

Every value is only counted once, for the entity type that it is assigned in the output file.

By default, a detected value is assigned the entity type that it most closely matches. For our example, the telephone number value most closely matches the Phone Number entity type, and so by default is included in the Phone Number count and values list.

If the entity type is turned off, or the value is excluded, then Textual moves the value to the next matching type.

In our example, if you set the handling type for Phone Number to Ignore, then the telephone number value is added to the count and values list for the Numeric Value entity type.

Last updated

Was this helpful?