LogoLogo
Release notesPython SDK docsDocs homeTextual CloudTonic.ai
  • Tonic Textual guide
  • Getting started with Textual
  • Previewing Textual detection and redaction
  • Entity types that Textual detects
    • Built-in entity types
    • Managing custom entity types
  • Language support in Textual
  • Datasets - Create redacted files
    • Datasets workflow for text redaction
    • Creating and managing datasets
    • Assigning tags to datasets
    • Displaying the file manager
    • Adding and removing dataset files
    • Reviewing the sensitivity detection results
    • Configuring the redaction
      • Configuring added and excluded values for built-in entity types
      • Working with custom entity types
      • Selecting the handling option for entity types
      • Configuring synthesis options
      • Configuring handling of file components
    • Adding manual overrides to PDF files
      • Editing an individual PDF file
      • Creating templates to apply to PDF files
    • Sharing dataset access
    • Previewing the original and redacted data in a file
    • Downloading redacted data
  • Pipelines - Prepare LLM content
    • Pipelines workflow for LLM preparation
    • Viewing pipeline lists and details
    • Assigning tags to pipelines
    • Setting up pipelines
      • Creating and editing pipelines
      • Supported file types for pipelines
      • Creating custom entity types from a pipeline
      • Configuring file synthesis for a pipeline
      • Configuring an Amazon S3 pipeline
      • Configuring a Databricks pipeline
      • Configuring an Azure pipeline
      • Configuring a Sharepoint pipeline
      • Selecting files for an uploaded file pipeline
    • Starting a pipeline run
    • Sharing pipeline access
    • Viewing pipeline results
      • Viewing pipeline files, runs, and statistics
      • Displaying details for a processed file
      • Structure of the pipeline output file JSON
    • Downloading and using pipeline output
  • Textual Python SDK
    • Installing the Textual SDK
    • Creating and revoking Textual API keys
    • Obtaining JWT tokens for authentication
    • Instantiating the SDK client
    • Datasets and redaction
      • Create and manage datasets
      • Redact individual strings
      • Redact individual files
      • Transcribe and redact an audio file
      • Configure entity type handling for redaction
      • Record and review redaction requests
    • Pipelines and parsing
      • Create and manage pipelines
      • Parse individual files
  • Textual REST API
    • About the Textual REST API
    • REST API authentication
    • Redaction
      • Redact text strings
  • Datasets
    • Manage datasets
    • Manage dataset files
  • Snowflake Native App and SPCS
    • About the Snowflake Native App
    • Setting up the app
    • Using the app
    • Using Textual with Snowpark Container Services directly
  • Install and administer Textual
    • Textual architecture
    • Setting up and managing a Textual Cloud pay-as-you-go subscription
    • Deploying a self-hosted instance
      • System requirements
      • Deploying with Docker Compose
      • Deploying on Kubernetes with Helm
    • Configuring Textual
      • How to configure Textual environment variables
      • Configuring the number of textual-ml workers
      • Configuring the number of jobs to run concurrently
      • Configuring the format of Textual logs
      • Setting a custom certificate
      • Configuring endpoint URLs for calls to AWS
      • Enabling PDF and image processing
      • Setting the S3 bucket for file uploads and redactions
      • Required IAM role permissions for Amazon S3
      • Configuring model preferences
    • Viewing model specifications
    • Managing user access to Textual
      • Textual organizations
      • Creating a new account in an existing organization
      • Single sign-on (SSO)
        • Viewing the list of SSO groups in Textual
        • Azure
        • GitHub
        • Google
        • Keycloak
        • Okta
      • Managing Textual users
      • Managing permissions
        • About permissions and permission sets
        • Built-in permission sets and available permissions
        • Viewing the lists of permission sets
        • Configuring custom permission sets
        • Configuring access to global permission sets
        • Setting initial access to all global permissions
    • Textual monitoring
      • Downloading a usage report
      • Tracking user access to Textual
Powered by GitBook
On this page
  • Viewing the number of detected values
  • Viewing the detected entity types
  • Previewing the detected values for an entity type
  • Displaying the list of detected values for an entity type
  • Selecting the entity type
  • How Textual handles entity values that match multiple types
  • Information in the entities list
  • Viewing the list of entity types that were not detected
  • Filtering the entity types

Was this helpful?

Export as PDF
  1. Datasets - Create redacted files

Reviewing the sensitivity detection results

Last updated 14 days ago

Was this helpful?

Required dataset permission: View dataset settings

When you first create a dataset, Tonic Textual displays a single list of all of the entity types that it can detect.

As you add and remove files, Textual updates the entity types list to indicate the detected and not detected entity types.

Viewing the number of detected values

At the top of the dataset details view, the Sensitive words tile shows the total number of sensitive values in the dataset that Textual detected.

Viewing the detected entity types

As Textual processes files, it identifies the entity types that are detected and not detected.

The entity type list starts with the detected entity types. For each detected entity type, Textual displays:

  • The number of detected values that are marked as this type in the output file. Excluded values are not included in the count.

  • The selected handling option.

  • Whether there are configured added or excluded values.

Previewing the detected values for an entity type

For each detected entity type, to view a sample of up to 10 of the detected values , click the view icon next to the value count.

Displaying the list of detected values for an entity type

The entities list contains the full list of detected values for an entity type.

To display the entities list, from the value preview, click Open Entities Manager.

Selecting the entity type

When you display the entities list, the entity type that you previewed the values for is selected by default.

To change the selected entity type, from the dropdown at the top left, select the entity type to view values for.

How Textual handles entity values that match multiple types

A detected value might match multiple entity types.

For example, a telephone number might match both the Phone Number and Numeric Value entity types.

Every value is only counted once, for the entity type that it is assigned in the output file.

By default, a detected value is assigned the entity type that it most closely matches. For our example, the telephone number value most closely matches the Phone Number entity type, and so by default is included in the Phone Number count and values list.

If the entity type is turned off, or the value is excluded, then Textual moves the value to the next matching type.

In our example, if you set the handling type for Phone Number to Off, then the telephone number value is added to the count and values list for the Numeric Value entity type.

Information in the entities list

The entities list groups the entities by the file and, if relevant, the page where they were detected.

For each value, the list includes:

  • The original value.

  • The original value in the context of its surrounding text.

  • The redacted or synthesized value in the context of its surrounding text, based on the selected handling option.

Viewing the list of entity types that were not detected

Below the list of detected entity types is the Entity types not found list, which contains the list of entity types that Textual did not detect in the files.

Filtering the entity types

You can filter the entity types list by text in the type name or description. The filter applies to both the detected and undetected entity types.

To filter the types, in the filter field, begin to type text that is in the entity type name or description.

Sensitive words tile with the number of detected values
List of detected entity types in the dataset files
Sample of the detected values for an entity type
Entities list for an entity type
Entity types not found list of entity types that were not detected in the dataset files
Filtering the list of entity types