Uploaded image for project: 'Nuxeo Platform'
  1. Nuxeo Platform
  2. NXP-29642

Fix Random PG Kafka StreamWorkManagerTest.testFatWork

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 11.3, 2021.0
    • Component/s: Tests
    • Team:
      PLATFORM
    • Sprint:
      nxplatform #18
    • Story Points:
      3

      Description

      Regression
      
      Run unit tests / Run postgresql unit tests / org.nuxeo.ecm.core.work.StreamWorkManagerTest.testFatWork
      Failing for the past 1 build (Since Failed#637 )
      Took 1 min 10 sec.
      add description
      Error Message
      
      scheduled or running expected:<2> but was:<1>
      
      Stacktrace
      
      java.lang.AssertionError: scheduled or running expected:<2> but was:<1>
      	at org.junit.Assert.fail(Assert.java:89)
      	at org.junit.Assert.failNotEquals(Assert.java:835)
      	at org.junit.Assert.assertEquals(Assert.java:647)
      	at org.nuxeo.ecm.core.work.StreamWorkManagerTest.assertMetrics(StreamWorkManagerTest.java:67)
      	at org.nuxeo.ecm.core.work.AbstractWorkManagerTest$MetricsTracker.assertDiff(AbstractWorkManagerTest.java:207)
      	at org.nuxeo.ecm.core.work.AbstractWorkManagerTest.before(AbstractWorkManagerTest.java:244)
      

      output

      2020-09-11 23:17:37,297 [ForkJoinPool.commonPool-worker-7] WARN  [ComputationPool] work/SleepWork: Timeout on wait for pool termination
      2020-09-11 23:17:37,300 [work/SleepWorkPool-02] ERROR [KafkaConsumer] [Consumer clientId=nuxeo-test-1599866237778-work-SleepWork-19, groupId=nuxeo-test-1599866237778-work-SleepWork] Failed to close coordinator
      org.apache.kafka.common.errors.InterruptException: java.lang.InterruptedException
      	at org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.maybeThrowInterruptException(ConsumerNetworkClient.java:517) ~[kafka-clients-2.5.0.jar:?]
      	at org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.poll(ConsumerNetworkClient.java:278) ~[kafka-clients-2.5.0.jar:?]
      	at org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.poll(ConsumerNetworkClient.java:233) ~[kafka-clients-2.5.0.jar:?]
      	at org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.poll(ConsumerNetworkClient.java:224) ~[kafka-clients-2.5.0.jar:?]
      	at org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.awaitPendingRequests(ConsumerNetworkClient.java:335) ~[kafka-clients-2.5.0.jar:?]
      	at org.apache.kafka.clients.consumer.internals.AbstractCoordinator.close(AbstractCoordinator.java:959) ~[kafka-clients-2.5.0.jar:?]
      	at org.apache.kafka.clients.consumer.internals.ConsumerCoordinator.close(ConsumerCoordinator.java:872) ~[kafka-clients-2.5.0.jar:?]
      	at org.apache.kafka.clients.consumer.KafkaConsumer.close(KafkaConsumer.java:2343) [kafka-clients-2.5.0.jar:?]
      	at org.apache.kafka.clients.consumer.KafkaConsumer.close(KafkaConsumer.java:2310) [kafka-clients-2.5.0.jar:?]
      	at org.apache.kafka.clients.consumer.KafkaConsumer.close(KafkaConsumer.java:2260) [kafka-clients-2.5.0.jar:?]
      	at org.nuxeo.lib.stream.log.kafka.KafkaLogTailer.close(KafkaLogTailer.java:442) [nuxeo-stream-11.3.36.jar:?]
      	at org.nuxeo.lib.stream.computation.log.ComputationRunner.closeTailer(ComputationRunner.java:262) [nuxeo-stream-11.3.36.jar:?]
      	at org.nuxeo.lib.stream.computation.log.ComputationRunner.run(ComputationRunner.java:228) [nuxeo-stream-11.3.36.jar:?]
      	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) [?:?]
      	at java.util.concurrent.FutureTask.run(FutureTask.java:264) [?:?]
      	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) [?:?]
      	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) [?:?]
      	at java.lang.Thread.run(Thread.java:834) [?:?]
      Caused by: java.lang.InterruptedException
      	... 18 more
      

      multiple times then:

      2020-09-11 23:17:37,300 [work/SleepWorkPool-00] ERROR [KafkaConsumer] [Consumer clientId=nuxeo-test-1599866237778-work-SleepWork-17, groupId=nuxeo-test-1599866237778-work-SleepWork] Failed to close coordinator
      ...
      2020-09-11 23:17:37,322 [main] ERROR [StreamWorkManager] Not able to stop worker pool within the timeout.
      2020-09-11 23:17:37,322 [main] ERROR [StreamWorkManager$ComponentListener] Some processors are still active
      2020-09-11 23:18:37,784 [main] WARN  [StreamWorkManager] SleepWork timeout after: 60.00s, [SleepWork, 1, 1, 0, 0]
      

      Seen https://jenkins.platform.dev.nuxeo.com/job/nuxeo/job/nuxeo/job/master/637/testReport/junit/org.nuxeo.ecm.core.work/StreamWorkManagerTest/Run_unit_tests___Run_postgresql_unit_tests___testFatWork/

        Attachments

          Issue Links

            Activity

              People

              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0 minutes
                  0m
                  Logged:
                  Time Spent - 2 hours
                  2h