Analysis of detected entity types

The Analytics page under Entities analysis displays a summary of the detected entity types. For each entity type, you can display the distribution across the dataset files.

Analytics page on the dataset details page

Selecting the value count option

On the Analytics page, you can choose how Textual determines the displayed value counts and entity types.

  • Match counts to redacted files - Displays value counts based on the output files. For this view, the counts do not include entity types that are ignored. The counts also resolve entity values that match multiple types and entity values that share some text. Each value is counted as a single type.

  • Show all detected entities - Displays the full detection value counts. For this view, the counts per entity type include all of the entities that Textual found during processing. This includes:

    • Values for ignored entity types

    • Entity values that match multiple entity types

    • Entity values that share some text

Summary counts

The panels at the top of the page provide summary information for the detected entity values. The displayed values are based on the selected value count option.

Summary counts for the detected entity values

The summary information includes:

  • The number of detected entity values.

  • The number of detected entity types.

  • The percentage of detected values that are redacted.

  • The percentage of detected values that are synthesized.

Counts by entity type

The entity types list on the Analytics page displays a summary of the detected value counts for the detected entity types. The displayed entity types and counts are based on the selected value count option.

Entity types list on the Analytics page

For each entity type, the list includes:

  • The count of detected values

  • The percentage of detected values in the dataset that are of that type

By default, the entity types are listed in descending order based on the value count.

You can sort the list by the entity type, count, and percentage. To sort by a column, click the heading. To reverse the sort order, click the heading again.

Displaying the top 10 file list for an entity type

When you click an entity type, Textual displays a panel that lists the 10 files that contain the most detected values for that entity type.

Summary of values per file for an entity type

The panel also allows you to change the handling option for the entity type.

Last updated

Was this helpful?