Uploaded image for project: 'Nuxeo Platform'
  1. Nuxeo Platform
  2. NXP-27047

Improve resilience of Nuxeo in case of infrastructure failures, namely ES erratic response times

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Won't Fix
    • Affects Version/s: 9.10
    • Fix Version/s: None
    • Component/s: Audit, Elasticsearch
    • Environment:
      Drive <-> Nuxeo <-> ES (audit)

      Description

      When the audit is stored in ElasticSearch, if ES exhibits out of a sudden very large response times (1000+ times larger than usual), this can break the Nuxeo internal audit pipeline. Restoring the audit pipeline requires a Nuxeo restart.

      The request is to fix this behavior so that the pipeline can restore on its own without requiring a full Nuxeo node restart.

      The fact that Nuxeo audit can stop working has consequences e.g. with Drive clients synchronization mechanism, which can be stopped as a consequence, making this a very visible failure.

        Attachments

          Issue Links

            Activity

              People

              • Votes:
                1 Vote for this issue
                Watchers:
                6 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  PagerDuty

                  Error rendering 'com.pagerduty.jira-server-plugin:PagerDuty'. Please contact your Jira administrators.