Uploaded image for project: 'Nuxeo ECM Build/Test Environment'
  1. Nuxeo ECM Build/Test Environment
  2. NXBT-3787

Fix 2021 Hotfix Docker image build

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None

      Description

      The nuxeo-hf job has been failing 3 times in a row at the "Build Docker" stage, when run with the following parameters:
      NUXEO_BRANCH=2021
      NUXEO_BUILD_VERSION=2021.48.7

      [2024-01-11T04:15:37.889Z] Installing nuxeo-2021-HF47-1.0.0
      [2024-01-11T04:15:37.889Z] Installing nuxeo-2021-HF48-1.0.0
      [2024-01-11T04:15:38.835Z] Dependency resolution:
      [2024-01-11T04:15:38.835Z]   Unchanged packages (48):       nuxeo-2021-HF03:1.0.0, nuxeo-2021-HF25:1.0.0, nuxeo-2021-HF47:1.0.0, nuxeo-2021-HF02:1.0.0, nuxeo-2021-HF24:1.0.0, nuxeo-2021-HF46:1.0.0, nuxeo-2021-HF01:1.0.0, nuxeo-2021-HF23:1.0.0, nuxeo-2021-HF45:1.0.0, nuxeo-2021-HF22:1.0.0, nuxeo-2021-HF44:1.0.0, nuxeo-2021-HF21:1.0.0, nuxeo-2021-HF43:1.0.0, nuxeo-2021-HF20:1.0.0, nuxeo-2021-HF42:1.0.0, nuxeo-2021-HF41:1.0.0, nuxeo-2021-HF40:1.0.0, nuxeo-2021-HF19:1.0.0, nuxeo-2021-HF18:1.0.0, nuxeo-2021-HF17:1.0.0, nuxeo-2021-HF39:1.0.0, nuxeo-2021-HF16:1.0.0, nuxeo-2021-HF38:1.0.0, nuxeo-2021-HF15:1.0.0, nuxeo-2021-HF37:1.0.0, nuxeo-2021-HF14:1.0.0, nuxeo-2021-HF36:1.0.0, nuxeo-2021-HF13:1.0.0, nuxeo-2021-HF35:1.0.0, nuxeo-2021-HF12:1.0.0, nuxeo-2021-HF34:1.0.0, nuxeo-2021-HF11:1.0.0, nuxeo-2021-HF33:1.0.0, nuxeo-2021-HF10:1.0.0, nuxeo-2021-HF32:1.0.0, nuxeo-2021-HF31:1.0.0, nuxeo-2021-HF30:1.0.0, nuxeo-2021-HF09:1.0.0, nuxeo-2021-HF08:1.0.0, nuxeo-2021-HF07:1.0.3, nuxeo-2021-HF29:1.0.0, nuxeo-2021-HF06:1.0.0, nuxeo-2021-HF28:1.0.0, nuxeo-2021-HF05:1.0.0, nuxeo-2021-HF27:1.0.0, nuxeo-2021-HF04:1.0.0, nuxeo-2021-HF26:1.0.0, nuxeo-2021-HF48:1.0.0
      [2024-01-11T04:15:38.835Z] Unsetting CLID
      [2024-01-11T04:15:38.835Z] Clean up package installation directories: /opt/nuxeo/server/packages/backup, /opt/nuxeo/server/packages/tmp
      [2024-01-11T04:15:39.097Z] 
      [2024-01-11T04:15:40.045Z] INFO[0490] RUN chown -R 900:0 $NUXEO_HOME   && chmod -R g+rwX $NUXEO_HOME 
      [2024-01-11T04:15:40.618Z] INFO[0490] Cmd: /bin/sh                                 
      [2024-01-11T04:15:40.619Z] INFO[0490] Args: [-c chown -R 900:0 $NUXEO_HOME   && chmod -R g+rwX $NUXEO_HOME] 
      [2024-01-11T04:15:40.619Z] INFO[0490] Util.Lookup returned: &{Uid:0 Gid:0 Username:root Name: HomeDir:/root} 
      [2024-01-11T04:15:40.619Z] INFO[0490] Performing slow lookup of group ids for root 
      [2024-01-11T04:15:40.619Z] INFO[0490] Running: [/bin/sh -c chown -R 900:0 $NUXEO_HOME   && chmod -R g+rwX $NUXEO_HOME] 
      [2024-01-11T04:15:41.567Z] INFO[0491] USER 900                                     
      [2024-01-11T04:15:41.567Z] INFO[0491] Cmd: USER                                    
      [2024-01-11T04:15:41.567Z] INFO[0491] Taking snapshot of full filesystem...        
      [2024-01-11T04:16:13.678Z] pod has failed
      

      Maybe, given the high number of hotfixes, the image is getting too large and the file system snapshotting, done in memory by Kaniko, gets the Kaniko pod OOMKilled by Kubernetes...
      We couldn't find any trace of 137 OOMKilled error, yet it's difficult to see once the pod is killed.

      Let's try to build the Hotfix Docker image with Buildx/Buildkit, like we do for the Nuxeo LTS Docker image, see NXP-31273.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                ataillefer Antoine Taillefer
                Reporter:
                ataillefer Antoine Taillefer
                Participants:
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: