Configuring a Sharepoint pipeline

Required pipeline permissions:

  • Edit pipeline settings

  • Manage the pipeline file list

For a Sharepoint pipeline, the settings include:

  • Azure credentials

  • Output location

  • Whether to also generate redacted versions of the original files

  • Selected files and folders

Pipeline settings for a Sharepoint pipeline

Changing the Sharepoint credentials for a pipeline

When you create a pipeline that uses files from Sharepoint, you are prompted to provide the credentials to use to connect to the Entra ID application.

The credentials must have the following application permissions (not delegated permissions):

  • Files.Read.All - To see the Sharepoint files

  • Files.ReadWrite.All -To write redacted files and metadata back to Sharepoint

  • Sites.ReadWrite.All: To view and modify the Sharepoint sites

From the Pipeline Settings page, to change the credentials:

Sharepoint Credentials fields for a Sharepoint pipeline
  1. Click Update Sharepoint Credentials.

Update Sharepoint Credentials fields for a Sharepoint pipeline
  1. In the Tenant ID field, provide the Sharepoint tenant identifier for the Sharepoint site.

  2. In the Client ID field, provide the client identifier for the Sharepoint site.

  3. In the Client Secret field, provide the secret to use to connect to the Sharepoint site.

  4. To test the connection, click Test Connection.

  5. To save the new credentials, click Update Sharepoint Credentials.

Selecting a location for the output files

On the Pipeline Settings page, under Select Output Location, click the edit icon, then navigate to and select the folder in Sharepoint where Textual writes the output files.

Configuring the output location for a Sharepoint pipeline

When you run a pipeline, Textual creates a folder in the output location. The folder name is the pipeline job identifier.

Within the job folder, Textual recreates the folder structure for the original files. It then creates the JSON output for each file. The name of the JSON file is <original filename>_<original extension>_parsed.json.

If the pipeline is also configured to generate redacted versions of the files, then Textual writes the redacted version of each file to the same location.

For example, for the original file Transaction1.txt, the output for a pipeline run contains:

  • Transaction1_txt_parsed.json

  • Transaction1.txt

Indicating whether to also redact the files

By default, when you run a Sharepoint pipeline, Textual only generates the JSON output.

To also generate versions of the original files that redact or synthesize the detected entity values, toggle Synthesize Files to the on position.

Synthesize Files option for a Sharepoint pipeline

For information on how to configure the file generation, go to Configuring file synthesis for a pipeline.

Filtering files in selected folders by file type

One option for selected folders is to filter the processed files based on the file extension. For example, in a selected folder, you might only want to process .txt and .csv files.

Under File Processing Settings, select the file extensions to include. To add a file type, select it from the dropdown list. To remove a file type, click its delete icon.

File Processing Settings to select file formats for a Sharepoint pipeline

Note that this filter does not apply to individually selected files. Textual always processes those files regardless of file type.

Selecting files and folders to process

Under Select files and folders to add to run, navigate to and select the folders and individual files to process.

File selection for a Sharepoint pipeline

To add a folder or file to the pipeline, check its checkbox.

When you check a folder checkbox, Textual adds it to the Prefix Patterns list. It processes all of the applicable files in the folder, based on whether the file type is a type that Textual supports and whether it is included in the file type filter.

When you click the folder name, it displays the folder contents.

When you select an individual file, Textual adds it to the Selected Files list.

To delete a file or folder, either:

  • In the navigation pane, uncheck the checkbox.

  • In the Prefix Patterns or Selected Files list, click its delete icon.

Last updated

Was this helpful?