[NXP-30426] Use a bulk action for Mongo Read ACL propagation - Nuxeo Issue Tracker

XML

Word

Printable

Details

Type: Bug
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2021.11
Component/s: Core DBS

Release Notes Summary:
Mongo Read ACL propagation uses a bulk action.
Epic Link:
ACL change Propagation on Large Repositories
Tags:
- grooming
- nxplatform
Upgrade notes:

Hide

Read ACls are now updated with a BAF action

Show
Read ACls are now updated with a BAF action
Sprint:
nxplatform #42, nxplatform #46
Story Points:
8

Description

On DBS, when an ACL is updated for a doc a FindReadAclsWork Work is scheduled.

The FindReadAclsWork scrolls all the children of the doc, using a batch of 500 and scroll timeout of 1min

for each batch, it schedules a UpdateReadAclsWork with the 500 doc ids
the transaction is committed and started

The UpdateReadAclsWork updates ACL of the docs with a batch of 50 docs in 10 transactions.

All works are scheduled on the common queue with a retry of 1.

Problems with a large repository:

FindReadAclsWork potentially can match the entire repository and takes hours to complete, this is going to block other Works in the common queue.
FindReadAclsWork can fail during scroll because of MongoDBSocketTimeout or because the leader has changed which interrupts the query with a MongoQueryException. In this case, the retry will start the process again submitting duplicate UpdateReadAclsWork.
There is no way other than introspecting the common Work queue and doing thread dump to understand what is going on in this massive processing.
There is no status of the action, any Works involved can end up in the DLQ after a retry resulting in a partial ACL propagation.
The update of ACL could trigger other listeners ((not confirmed on local test with a stock Nuxeo but probably on prod),
This generates lots of cache invalidations (every 50 docs) loading the pub-sub topic

Also, event if tuned changing root ACL means touching all docs and reindexing *this will always be a heavy process that should be avoided on large repos*,
the project should be designed with role groups (like MemberRead MemberWrite ...) set at root levels from the beginning, you give access to user or group manipulating only the groups/user directories without having to update ACL.

Need to be checked but the indexing could be duplicated, the change at the root level trigger an indexing scroller that is going to update all children, it seems that when children's ACL is updated they are also reindexed.

Attachments

Issue Links

is related to

NXP-30540 Trace long running bulk command

Resolved

NXP-30541 Use a dedicated Work queue for ACL Propagation

Resolved

NXP-30805 Route long indexing command to the Bulk Service keeping WM indexing near realtime

Resolved

NXP-30757 Use a bulk action for Mongo Read ACL propagation on 10.10

Resolved

Is referenced in

PR for 10.10: #5021

PR for 2021: #264

mentioned in: Page Loading...

(1 Is referenced in, 1 mentioned in)

Activity

People

Assignee:

Guillaume Renard

Reporter:

Benoit Delbosc

Participants:

Benoit Delbosc, Guillaume Renard, Jenkins, Support Tech User, Thomas Pare

Votes:

0 Vote for this issue

Watchers:

8 Start watching this issue

Dates

Created:

2021-05-20 09:58

Updated:

2022-01-14 15:53

Resolved:

2021-10-16 11:40