Last updated
Last updated
The Tonic Textual Snowflake Native App uses the same models and algorithms as the Tonic Textual API, but runs natively in Snowflake.
You use the app to redact or parse your text data directly within your Snowflake workflows. The text never leaves your data warehouse.
The app package runs natively in Snowflake, and leverages Snowpark Container Services.
It includes the following containers:
Detection service, which detects the sensitive entity values.
Redaction service, which replaces the sensitive entity values.
For the redaction workflow, you use the app to detect and replace sensitive values in text.
You use TEXTUAL_REDACT
to send the redaction request.
When you call TEXTUAL_REDACT
, it passes to the redaction service:
The text to redact
Optional configuration
The redaction service forwards the text to the detection service.
The detection service uses a series of NER models to identify and categorize sensitive words and phrases in the text.
The detection service returns its results to the redaction service.
The redaction service uses the results to replace the sensitive words and phrases with redacted or synthesized versions.
The redacted text is returned to the user.
For the parsing workflow, you use the app to parse files that are in a Snowflake internal or external stage.
You call TEXTUAL_PARSE
to send the parse request. The request includes:
The fully qualified stage name where the files are located
The name of the file, or a variable that identifies the list of files
The MD5 sum of the file
The app uses a series of NER models to identify and categorize sensitive words and phrases in the text.
The app converts the content to a markdown format.
The markdown content is part of the JSON output that includes metadata about the parsed text. You can use the metadata to built RAG systems and LLM datasets.
The app stores the results of the parse request, including the output, in the TEXTUAL_RESULTS
table.