[NXP-22640] Fix random UT failure on nuxeo-mqueue with Kafka impl - Nuxeo Issue Tracker

XML

Word

Printable

Details

Type: Bug
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 9.2
Component/s: Streams

Epic Link:
Async Infra - R&D #1
Tags:
Sprint:
nxcore 9.2.6
Story Points:
3

Description

For the kafka impl of mqueue some tests are failing randomly:

    org.nuxeo.ecm.platform.importer.mqueues.tests.computation.TestMQComputationManagerKafka.testStopAndResume
    org.nuxeo.ecm.platform.importer.mqueues.tests.computation.TestMQComputationManagerKafka.testComplexTopoManyRecords
    org.nuxeo.ecm.platform.importer.mqueues.tests.TestAutomationKafka.testBlobAndDocumentImport

These failures were related to long delay to first partition attributions, the results is that consumers believes that there is no more messages to read,
now we wait for partition attribution before taking in account read timeout.

Also on test infra kafka has a 24h retention policy but this does not removes old topics, Kafka create a folder per partition, full unit test creates around 700 partitions, after dozen of executions topic creation are very slow and buggy even if we wait for topic availability for zookeeper,
there is a lag from the broker and producer may simply believe that the new partition does not exists:

12:43:28 14:43:28,709 [kafka-producer-network-thread | producer-27] WARN  [NetworkClient$DefaultMetadataUpdater] Error while fetching metadata with correlation id 1 : {nuxeo-test-1499172208473-queueName=UNKNOWN_TOPIC_OR_PARTITION}

As work around a CI cleaning job as been setup https://qa.nuxeo.org/jenkins/job/System/job/cleanup-kafka/ to reset ZK and Kafka data.

Attachments

Issue Links

is related to

NXP-22397 Provides a Kafka impl of Computation with distributed load

Resolved

is required by

NXP-21542 Upgrade Kafka to 0.10.1.0

Resolved

Activity

People

Assignee:

Benoit Delbosc

Reporter:

Benoit Delbosc

Participants:

Anahide Tchertchian, Benoit Delbosc, Jenkins

Votes:

0 Vote for this issue

Watchers:

3 Start watching this issue

Dates

Created:

2017-06-30 09:23

Updated:

2017-08-25 13:59

Resolved:

2017-07-05 14:58

Time Tracking

Estimated:

Not Specified

Remaining:

Logged: