-
Type: Bug
-
Status: Resolved
-
Priority: Blocker
-
Resolution: Won't Fix
-
Affects Version/s: 3.4.3
-
Fix Version/s: NoFixVersionApplicable
-
Component/s: Production Incident
-
Tags:
On May the 3th 11pm CET, several errors occurred when customers tried to do a git clone of their Studio project. The Gitty node was temporarily unavailable and a 503 error was returned. Here are the different errors found in the application logs.
- Workers ran on Gitty node
- Failed to execute async event null on listener segmentIOEventListener
- Unable to write stat entry for xxx-xxx-xxx-xxx on Gitty nodes
- Exception during work: HouseKeepingWorker(/triggerHouseKeepingListener:12823062651770.1770079632, Progress(?%, ?/0), null) on Gitty nodes
- Exception during projectRemovalListener sync listener execution, continuing to run other listeners
=> Those errors are investigated and fixed in the scope of https://jira.nuxeo.com/browse/NXS-6360
- Git repository not found on Gitty nodes
- Failed to clone remote repository for project: project1-habeo
- Failed to clone remote repository for project: brendan-phillips-haley
- Failed to clone remote repository for project: sonia-sherman-koch-and
- Failed to clone remote repository for project: joy-nichols-mccall-and
- Failed to clone remote repository for project: clinton-fitzgerald-webb
=> In rare cases, we have identified a bunch of files locked by jgit and it prevents the cleanup of those trial projects. Because of the exceptions, the cleanup ends in a timeout, the transaction is rolled back so those "corrupted" trials are kept in Connect and the errors will occur again when the next cleanup is triggered.
https://jira.nuxeo.com/browse/NXS-6374
- Exception during jiraSynchroListener sync listener execution, continuing to run other listeners
org.nuxeo.ecm.core.api.NuxeoException: Error while trying to delete Jira user with username = mtienda@geoit.com.mx Response code : 400 Response : {"errorMessages":["Cannot delete user, the user directory is read-only."],"errors":{}
-
org.nuxeo.ecm.core.api.NuxeoException: Error while trying to delete Jira user with username = mtavila.1@gmail.com Response code : 400 Response : {"errorMessages":["Cannot delete user, the user directory is read-only."],"errors":{}}
- Couldn't find StudioProject for ConnectProject with id content-now
- It needs to be investigated further but that error is not responsible for the unavailability of the Gitty node.
- Failed to initialize project xxx
org.eclipse.jgit.errors.TransportException: http://nos-prod-gitty-nuxeo-15-euw.nos-15.prod.local/nuxeo/git/uuhp-case-tracker.git: 503 Service Temporarily Unavailable
org.eclipse.jgit.errors.TransportException: http://nos-prod-gitty-nuxeo-15-euw.nos-15.prod.local/nuxeo/git/wvhp-case-tracker.git: 503 Service Temporarily Unavailable- Those errors are a consequence of the unavailability of the Gitty node
- com.nuxeo.studio.core.api.exception.VersioningException: state='SAFE',
dir='/var/lib/nuxeo/studio/workspaces/multitenant-apple/multitenant-apple-#346-cloned-2021-04-26-16-02-42/.git',
HEAD='ref: refs/heads/wip/mcedica-ext-nuxeo-com_4199f8/master' | Could not check remote branch existence- Same as above, it's a consequence of the unavailability of the Gitty node
A note book has been created in Datadog to track the metrics related to the different ELB: https://app.datadoghq.com/notebook/749358/arnaud-5-may-2021-11-01
- mentioned in
-
Page Loading...