Using named entities recognition
With Kili, you can do named entity recognition (NER) for text and PDF type assets.
You can select entities:
- at character level
- at token level by double clicking on the token (text only)
- across phrases
- across paragraphs
- overlapping a previous selection
To simplify annotation, we process tokens to remove extra line returns/extra spaces at the end of the token. No need to post-process them.
Plain text annotation example:
data:image/s3,"s3://crabby-images/52175/5217501e0a87bb2e93b608244f8538da3e54135a" alt="Plain text annotation example"
Plain text annotation example
If you want to rapidly select all occurrences of an entity in the text, right click on the entity and select "Propagate".
data:image/s3,"s3://crabby-images/5b3b8/5b3b8a7be1cb6442b928f8c410475032635beb2a" alt="ner-einstein-propagate-menu.png 430"
The "propagate" option
Result:
data:image/s3,"s3://crabby-images/dfa52/dfa52842289e8461644be158f006c7ea19f19f4c" alt="Plain text annotation after using the "propagate" option"
Plain text annotation after using the "propagate" option
PDF annotation example:
data:image/s3,"s3://crabby-images/afcaf/afcafa6a2b5c92f2e8c399b7994eb36e35b4ef5d" alt="PDF annotation example"
PDF annotation example
- You can copy the contents of a text entity by double-clicking on it in the job viewer. You'll be able to paste it somewhere else, without the need for manual text input.
- When finished with Named Entity Recognition, you can perform another related task called Named Entities Relation. For details, refer to Object/entity relation jobs.
Updated about 1 year ago