- Getting started
- Balance
- Clusters
- Concept drift
- Coverage
- Datasets
- General fields (previously entities)
- Labels (predictions, confidence levels, hierarchy, etc.)
- Models
- Streams
- Model Rating
- Projects
- Precision
- Recall
- Reviewed and unreviewed messages
- Sources
- Taxonomies
- Training
- True and false positive and negative predictions
- Validation
- Messages
- Administration
- Manage sources and datasets
- Understanding the data structure and permissions
- Create a data source in the GUI
- Uploading a CSV file into a source
- Create a new dataset
- Multilingual sources and datasets
- Enabling sentiment on a dataset
- Amend a dataset's settings
- Delete messages via the UI
- Delete a dataset
- Delete a source
- Export a dataset
- Using Exchange Integrations
- Preparing data for .CSV upload
- Model training and maintenance
- Understanding labels, general fields and metadata
- Label hierarchy and best practice
- Defining your taxonomy objectives
- Analytics vs. automation use cases
- Turning your objectives into labels
- Building your taxonomy structure
- Taxonomy design best practice
- Importing your taxonomy
- Overview of the model training process
- Generative Annotation (NEW)
- Dastaset status
- Model training and annotating best practice
- Training with label sentiment analysis enabled
- Train
- Introduction to Refine
- Precision and recall explained
- Precision and recall
- How does Validation work?
- Understanding and improving model performance
- Why might a label have low average precision?
- Training using Check label and Missed label
- Training using Teach label (Refine)
- Training using Search (Refine)
- Understanding and increasing coverage
- Improving Balance and using Rebalance
- When to stop training your model
- Using general fields
- Generative extraction
- Using analytics and monitoring
- Automations and Communications Mining
- Licensing information
- FAQs and more
Training using Teach Label (Explore)
User permissions required: ‘View Sources’ AND ‘Review and annotate’.
Introduction to using 'Teach Label'
'Teach' is the second step in the Explore phase and its purpose is to show predictions for a label where the model is most confused if it applies or not. Like previous steps, we need to confirm if the prediction is correct or incorrect, and by doing so provide the model strong training signals. It is the most important label-specific training mode.
Key steps
- Select Teach from the top-left dropdown menu as shown
- Select the label you wish to train - the default selection in Teach mode is to show unreviewed messages
- You will be presented with a selection of messages where the model is most confused as to whether the selected label applied
or not - review the predictions and apply the label if they are correct, or apply other labels if they are incorrect
- Predictions will range outwards from ~50% for data with no sentiment and 66% for data with sentiment enabled
- Remember to apply all other labels that apply as well as the specific label you are focusing on
You should use this training mode as required to boost the number of training examples for each label to above 25, whereby the platform can then accurately estimate the performance of the label.
The number of examples required for each label to perform well will depend on a number of factors. In the 'Refine' phase we cover how to understand and improve the performance of each label.
The platform will regularly recommend using 'Teach Label' as a means of improving the performance of specific labels by providing more varied training examples that it can use to identify other instances in your dataset where the label should apply.
What do we do when there are insufficient 'Teach' examples?
We may find after Discover and Shuffle that some labels still have very few examples, and where ‘Teach Label’ mode doesn’t surface useful training examples. In this case, we suggest to use the following training modes to provide the platform with more examples to learn from:
Option 1 - 'Search'
Searching for terms or phrases in Explore works the same as searching in Discover. One of two key differences is that in Explore you must review and annotate search results individually, rather than in bulk. You can search in Explore by simply typing in your search term in the search box at the top left of the page.
However, too much Search can biasyour model which is something we want to avoid. Add no more than 10 examples per label in this training mode to avoid annotating bias. It's also important to allow the platform time to retrain before going back to ‘Teach’ mode.
For more information on how to use Search in Explore, click here.
Option 2 - 'Label'
Although training using 'Label' is not one of the main steps outlined in the Explore phase, it can still be useful in this phase of training. In Label mode, the platform shows you messages where that label is predicted in descending order of confidence (i.e. with the most confident predictions first and least confident at the bottom).
However, it's only useful to review predictions that are not high-confidence (90%+). This is because when the model is very confident (i.e. above 90%), then by confirming the prediction you are not telling the model any new information, it's already confident that the label applies. Look for less confident examples further down the page if needed. Although, if predictions have high confidences and are wrong, then it's important to apply the correct label(s), thereby rejecting the incorrect prediction(s).
Useful tips
- If for a label there are multiple different ways of saying the same thing (e.g. A, B or C), make sure that you give the platform training examples for each way of saying it. If you give it 30 examples of A, and only a few of B and C, the model will struggle to pick up future examples of B or C for that label.
- Adding a new label to a mature taxonomy may mean it’s not been applied to previously reviewed messages. This then requires going back and teaching the model on new labels, using the 'Missed label' function – see here for how.