Uploaded image for project: 'Nuxeo Platform'
  1. Nuxeo Platform
  2. NXP-30497

[CI/CD] Try to fix Kaniko Docker build failures

    Details

    • Tags:
    • Sprint:
      nxApps 2021 Cycle 6
    • Story Points:
      3

      Description

      Randomly, the Kaniko Docker build fails for an unexplained reason:

      [2021-06-09T06:07:03.853Z] INFO[5805] cmd: EXPOSE                                  
      [2021-06-09T06:07:03.853Z] INFO[5805] Adding exposed port: 8080/tcp                
      [2021-06-09T06:07:03.853Z] INFO[5805] ENTRYPOINT ["/docker-entrypoint.sh"]         
      [2021-06-09T06:07:03.853Z] INFO[5805] CMD ["nuxeoctl", "console"]                  
      [2021-06-09T06:07:03.853Z] INFO[5805] USER 900                                     
      [2021-06-09T06:07:03.853Z] INFO[5805] cmd: USER                                    
      [2021-06-09T06:09:55.403Z] time="2021-06-09T06:09:51Z" level=fatal msg="build failed: build failed: building [nuxeo]: kaniko build for [nuxeo]: waiting for pod to complete: context closed while waiting for condition"
      script returned exit code 1
      

      It often happens several times in a row.
      It could be due to a CPU/memory resource issue when the K8s node pool is overloaded.

      See the work done in the AI builder to increase the Kaniko resources: https://github.com/nuxeo/jx-ai-builders/blob/master/skaffold.yaml#L26

      Or/and we could try these options:

      useNewRun
      snapshotmode

      Let's also add tolerations to be tolerated by the Platform dedicated node pool.

       

        Attachments

          Activity

            People

            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                PagerDuty

                Error rendering 'com.pagerduty.jira-server-plugin:PagerDuty'. Please contact your Jira administrators.