Adding and removing dataset files
Supported file types for datasets
Tonic Textual can process the following types of files:
txt
csv
tsv
docx
xlsx
pdf
png
tif or tiff
jpg or jpeg
On a self-hosted instance, you can configure an S3 bucket where Textual stores the files. This is the same S3 bucket that is used for uploaded file pipelines. For more information, go to Setting the S3 bucket for file uploads and redactions. For an example of an IAM role with the required permissions, go to Example IAM role for file uploads and redactions.
Adding files to the dataset
From the dataset details page, to add files to the dataset:
In the panel at the top left, click Upload Files.
Search for and select the files.
Tonic Textual uploads and then processes the files.
Do not leave the page while files are uploading. If you leave the page before the upload is complete, then the upload stops.
You can leave the page while Textual is processing the file.
Removing files from the dataset
To remove a file from the dataset, you can use the option in the dataset file list or on the file manager.
From the dataset file list
From the file list on the dataset details page, to remove a file from the dataset:
Click the options menu for the file.
In the options menu, click Delete.
From the file manager
From the file manager, to remove a file from the dataset:
Click the options menu for the file.
In the options menu, click Delete File.
Last updated