-
Type: New Feature
-
Status: Resolved
-
Priority: Critical
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Component/s: AI Nuxeo Services
-
Tags:
-
Team:AI
-
Sprint:AI 5 full - Deduplication, AI5 full - Deduplication, AI 5 S6 - Deduplication, AI 5 S7 - Deduplication
-
Story Points:5
Iterate over all documents pairs that are similar. Will apply an operation to each tuple of similar documents. The operation can be a custom one defined by the user.
This will require a custom scroller . The scroller gets lists of similars from a REST API(getSimilarTuples)
Acceptance criteria:
- provide an operation to trigger the Stream
- provide a default operation
- Consumes:
- list of documents
- xpath[s] as a context value
- distance
- source document
- Runs logging on provided results
- Consumes:
- provide Nuxeo stream solution that is able to resolve the documents and call provided operation
- Initial computation scroll through Insight API to get tuples sending source documents further down the stream
- Is referenced in