About the Snowflake Native App

The Tonic Textual Snowflake Native App uses the same models and algorithms as the Tonic Textual API, but runs natively in Snowflake.

You use the app to redact or parse your text data directly within your Snowflake workflows. The text never leaves your data warehouse.

App package containers

The app package runs natively in Snowflake, and leverages Snowpark Container Services.

It includes the following containers:

  • Detection service, which detects the sensitive entity values.

  • Redaction service, which replaces the sensitive entity values.

Redaction workflow

For the redaction workflow, you use the app to detect and replace sensitive values in text.

  1. You use TEXTUAL_REDACT to send the redaction request. When you call TEXTUAL_REDACT, it passes to the redaction service:

    • The text to redact

    • Optional configuration

  2. The redaction service forwards the text to the detection service.

  3. The detection service uses a series of NER models to identify and categorize sensitive words and phrases in the text.

  4. The detection service returns its results to the redaction service.

  5. The redaction service uses the results to replace the sensitive words and phrases with redacted or synthesized versions.

  6. The redacted text is returned to the user.

Parsing workflow

For the parsing workflow, you use the app to parse files that are in a Snowflake internal or external stage.

  1. You call TEXTUAL_PARSE to send the parse request. The request includes:

    • The fully qualified stage name where the files are located

    • The name of the file, or a variable that identifies the list of files

    • The MD5 sum of the file

  2. The app uses a series of NER models to identify and categorize sensitive words and phrases in the text.

  3. The app converts the content to a markdown format.

  4. The markdown content is part of the JSON output that includes metadata about the parsed text. You can use the metadata to built RAG systems and LLM datasets.

  5. The app stores the results of the parse request, including the output, in the TEXTUAL_RESULTS table.

Last updated