Uploaded image for project: 'Nuxeo Platform'
  1. Nuxeo Platform
  2. NXP-28696

Reduce the default number of metrics published by Graphite and Datadog

    XMLWordPrintable

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 11.1
    • Component/s: Monitoring
    • Upgrade notes:
      Hide

      Reported metrics can now be filtered using a contribution, see common-base/nxserver/config/metrics-config.xml for the default configuration.

      Show
      Reported metrics can now be filtered using a contribution, see common-base/nxserver/config/metrics-config.xml for the default configuration.
    • Team:
      PLATFORM
    • Sprint:
      nxplatform 11.1.31, nxplatform 11.1.32
    • Story Points:
      0

      Description

      In order to reduce the number of metrics, only useful metrics should be published to Graphite and Datadog.

      Today a timer in our code (dropwizzard metrics) reports 16 metrics:

      • min
      • max
      • mean
      • stddev: standard deviation
      • p50: percentile 50%
      • p75
      • p95
      • p98
      • p99
      • p999
      • m1_rate throughput moving average on one minute
      • m5_rate 5 minutes
      • m15_rate 15 minutes
      • mean_rate
      • count number of timer
      • sum total cumulative time

      We have timers on cache, directory, repository api, elasticsearch api, work manager, stream computation.

      So reducing metrics per timer is very effective.

      We could disable by default the following metrics and reduce to 9 metrics per timer  (43% reduction)

      • p95
      • p99
      • p999
      • m5_rate
      • m15_rate
      • mean_rate
      • sum

      Other metrics should also be disabled by default like nuxeo.ActionService timers that can be profiled in a dev environment and should not be a major performance problem or related to the production environment.

       

       

       

        Attachments

          Activity

            People

            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 0 minutes
                0m
                Remaining:
                Remaining Estimate - 0 minutes
                0m
                Logged:
                Time Spent - 2 hours
                2h

                  PagerDuty

                  Error rendering 'com.pagerduty.jira-server-plugin:PagerDuty'. Please contact your Jira administrators.