Uploaded image for project: 'Nuxeo Platform'
  1. Nuxeo Platform
  2. NXP-31539

Fix fulltext extraction for blobs fetched from a document dynamic facet

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 10.10
    • Fix Version/s: 10.10-HF71, 2023.0, 2021.32
    • Component/s: Core
    • Release Notes Summary:
      BlobExtractor now also extracts a path added with dynamic facets
    • Backlog priority:
      900
    • Sprint:
      nxplatform #79

      Description

      Steps to reproduce:

      1. define a "custom" schema with a blob property
        <xs:element name="pdfprop" type="nxs:content" />
        
      2. define a "customFacet" facet
        <extension target="org.nuxeo.ecm.core.schema.TypeService" point="doctype">
        <facet name="customFacet">
         <schema name="custom" />
         </facet>
        </extension>
        
      3. dynamically add this facet to a document and set the blob property with a Blob value
      4. make a search against a work from the Blob value
      5. observe that the newly created document is not returned among the results

      What happened is that BlobsExtractor#getBlobPaths seems to read only the static document schemas and therefore the blob property from the attached facet is missed. And therefore the FulltextExtractorWork does not extract the text from this blob and the blob is not fulltext indexed.

      Expected behavior: blobs from attached facet are recognized as blob paths and are fulltext extracted/indexed

        Attachments

          Issue Links

            Activity

              People

              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: