Selecting the training data for your models
Before you start training models, on the Model data setup page, you select the training data to use.

About training data
The training data is a much larger set of files than the test data, and can include hundreds of files or more. The data should ideally contain at least 1,000 values for the entity type. For example, for an entity type to identify health conditions, you might use 5 medical appointment reports in your test data, but several hundred medical reports for your training data.
Similar to the test files, the training files should be relatively small - no more than 5,000 words.
For training data, there is no option to paste in text. Training data files are either uploaded from a local file system or selected from a cloud storage solution.
If you selected the test data from a cloud storage solution, then you must use the same cloud storage solution for the training data. For example, if you selected the test data from Amazon S3, then you must select the training data from Amazon S3.
You can add files to the training data at any time. New files are only used for models that are trained after the files are added.
Uploading files from a local file system
If there are no training files, then on the Model data setup page, click Upload files, then search for and select the files to upload.
To add more uploaded files to the training data:
Click Add training data.

Click Upload files.
Search for and select the files to add to the training data.
Selecting files from cloud storage
If there are no training files, then on the Model data setup page, click the cloud storage solution to use, then select the files to add.
If the test data came from a cloud storage solution, then you must use the same cloud storage option for the training data.
To add more cloud storage files to the training data:
Click Add training data.

Select the cloud storage solution.
Select the files to add to the training data.
For training data, you can select entire folders. Textual then adds all of the files in the folder.
Displaying the content of a training data file
On the Model data setup page, to display the content of an uploaded file, click the file name.
Training data file statuses
Each training data file goes through the following statuses:
Queued for upload - The file is not yet uploaded.
Uploading - Textual is uploading the file.
Ready - The file is uploaded and is used for subsequent model training.
Model training cannot start until all of the currently uploaded files are Ready.
Last updated
Was this helpful?
