Uploaded image for project: 'Nuxeo Platform'
  1. Nuxeo Platform
  2. NXP-31698

Prevent Base64-encoded images from being sent to Elasticsearch

    XMLWordPrintable

    Details

    • Release Notes Summary:
      Only text is indexed on HTML Note document
    • Backlog priority:
      750
    • Upgrade notes:
      Hide

      When using an HTML Note document, only the fulltext (extracted text) is submitted to elastic for indexation.

      Show
      When using an HTML Note document, only the fulltext (extracted text) is submitted to elastic for indexation.
    • Sprint:
      nxplatform #83, nxplatform #84
    • Story Points:
      5

      Description

      Currently, embedding the HTML text content of a Note document with an image results in its Base64-encoding being added to the document's note:note schema property - this field is always stored in Elasticsearch, resulting in a significant storage increase. This exposes a risk to the Elasticsearch cluster when multiple documents with embedded images are created.

      To avoid this, Base64 content should be filtered out before requests to the ES cluster are made for Document creation/indexing.

        Attachments

          Issue Links

            Activity

              People

              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: