Uploaded image for project: 'Nuxeo Platform'
  1. Nuxeo Platform
  2. NXP-19586

Drive: Implement Elasticsearch based batched remote scan

    XMLWordPrintable

    Details

    • Type: Task
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 8.3
    • Component/s: Nuxeo Drive

      Description

      The VCS implementation of the batched remote scan, see NXP-19482 and DocumentBackedFolderItem#getScrollBatch relies on a "homemade" implementation of a scroll search and doesn't allow to sort the documents by path because it relies on a queryAndFecth core query.
      Yet on the Drive side we would rather have the documents sorted by path to easily build the remote tree information in the local storage (need the FileSystemItem path).

      That's why we now rely on the Elasticsearch scroll API exposed by the ElasticSearchService, see NXP-19631, in the case of a synchronization root and if Elasticsearch is enabled of course.
      See ESSyncRootFolderItem#getScrollBatch.
      This way we use the native ES scroll which allows to return the documents sorted by path.

      Note: we consider that when scrolling through the descendants of a synchronization root its sub-tree must be indexed. This is most likely the case when performing the initial remote scan or in the case of a newly registered folder.

        Attachments

          Issue Links

            Activity

              People

              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: