[AICORE-333] Retrieving / annotating enrichment metadata about a doc/blob - Nuxeo Issue Tracker

XML

Word

Printable

Details

Type: Epic
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: AI Nuxeo Services

Tags:
- icebox
- nxAI

Description

Prerequisite: Data returned from the prediction service is stored in the main datastore using the defined enrichment format.

These annotations are visible on the document to the user, and a reduced version from the metadata stored after AI service calls. They can be created either automatically from a call to an external API, or manually by the user.

Possible kinds of annotations :

Classification : List of String (labels), representing classes this document belongs to
Tagging :

Object Recognition in Image with a label and a box associated
Entity Recognition in Text with a label and word index associated

Transcription : String with a transcription of an audio stream or an image

When needed, we should create simple interfaces for these annotations. For classification and transcription this should require minimal interface, Tagging might need a more complex interface with annotation in the image or the text file.

Requirements:

Existence of a flag that defines if the tag was created by an human or automatically from an API call

In case of an annotation with multiple labels, changing one value will make it manual, even if other values where not changed by a human

An annotation is related to a blob (text, image, etc). If the original document changes, the annotation value is void and should be recalculated/reannotated

Nice to have:

Maybe a reference to the metadata used to create it, if this was automatic
An historic, who changed and what changed

Decision point:

Text classification is to be done on extracted text. Annotation should be done on the same text, but strange !
Image classification is potentially done on a small version of the original. Annotation to be done on the image to be used for classifciation

Attachments

Issue Links

is related to

AICORE-319 Review bulk Enrichments

Resolved

Activity

People

Assignee:

Unassigned

Reporter:

Gethin James

Participants:

Gethin James

Votes:

0 Vote for this issue

Watchers:

1 Start watching this issue

Dates

Created:

2018-03-28 09:13

Updated:

2020-07-08 14:11