Uploaded image for project: 'Nuxeo Platform'
  1. Nuxeo Platform
  2. NXP-9331

Improve text converters to take into account paragraphs and headings

    XMLWordPrintable

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 5.5
    • Fix Version/s: 5.6-RC1, 5.6
    • Component/s: Convert

      Description

      For now the OOo2TextConverter doesn't take into account paragraphs and headings.
      Which means that the following content in an .odt document:

      This is a document with
      a new line and a paragraph.
      
      Paragraph 1 heading
      Content of the paragraph 1. Content of the paragraph 1. Content of the paragraph 1. Content of the paragraph 1.
      

      will be converted to the string:

      This is a document with a new line and a paragraph. Paragraph 1 heading Content of the paragraph 1. Content of the paragraph 1. Content of the paragraph 1. Content of the paragraph 1.
      

      The MSOffice2TextConverter does detect paragraphs but not headings, it could be nice to have an empty line before each hading to make the difference between a paragraph (= 1 new line) and a heading (= 2 new lines) when reading the converted text.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                ataillefer Antoine Taillefer
                Reporter:
                ataillefer Antoine Taillefer
                Participants:
              • Votes:
                0 Vote for this issue
                Watchers:
                0 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - 1 day Original Estimate - 1 day
                  1d
                  Remaining:
                  Remaining Estimate - 0 minutes
                  0m
                  Logged:
                  Time Spent - 1 day, 4 hours
                  1d 4h