Uploaded image for project: 'Nuxeo AI Core'
  1. Nuxeo AI Core
  2. AICORE-168

Fix training and evaluation dataset split failures

    XMLWordPrintable

    Details

    • Tags:
    • Sprint:
      nxAI Sprint 11.1.24, nxAI Sprint 11.1.25

      Description

      We need to investigate on the reasons why we could not have an evaluation dataset in some situations and what to do if its not the case (for now it does nothing apparently). We need to check anyway as well if the following stack trace in the client batch upload could be a consequence:

      org.nuxeo.client.spi.NuxeoClientException: Error during batch upload
      	at org.nuxeo.client.objects.upload.BatchUpload.upload(BatchUpload.java:203) ~[nuxeo-java-client-3.2.0.jar:3.2.0]
      	at org.nuxeo.ai.cloud.NuxeoCloudClient.uploadedDataset(NuxeoCloudClient.java:178) ~[nuxeo-ai-model-2.1.3-SNAPSHOT.jar:?]
      	at org.nuxeo.ai.bulk.DataSetUploadComputation.lambda$processRecord$0(DataSetUploadComputation.java:74) ~[nuxeo-ai-model-2.1.3-SNAPSHOT.jar:?]
      	at org.nuxeo.ai.bulk.DataSetUploadComputation.runInTransaction(DataSetUploadComputation.java:114) ~[nuxeo-ai-model-2.1.3-SNAPSHOT.jar:?]
      	at org.nuxeo.ai.bulk.DataSetUploadComputation.processRecord(DataSetUploadComputation.java:64) ~[nuxeo-ai-model-2.1.3-SNAPSHOT.jar:?]
      	at org.nuxeo.lib.stream.computation.log.ComputationRunner.lambda$processRecordWithRetry$10(ComputationRunner.java:366) ~[nuxeo-stream-10.10-HF17.jar:?]
      	at net.jodah.failsafe.Functions$10.call(Functions.java:252) [failsafe-1.1.0.jar:1.1.0]
      	at net.jodah.failsafe.SyncFailsafe.call(SyncFailsafe.java:145) [failsafe-1.1.0.jar:1.1.0]
      	at net.jodah.failsafe.SyncFailsafe.run(SyncFailsafe.java:81) [failsafe-1.1.0.jar:1.1.0]
      	at org.nuxeo.lib.stream.computation.log.ComputationRunner.processRecordWithRetry(ComputationRunner.java:366) [nuxeo-stream-10.10-HF17.jar:?]
      	at org.nuxeo.lib.stream.computation.log.ComputationRunner.processRecord(ComputationRunner.java:349) [nuxeo-stream-10.10-HF17.jar:?]
      	at org.nuxeo.lib.stream.computation.log.ComputationRunner.processLoop(ComputationRunner.java:239) [nuxeo-stream-10.10-HF17.jar:?]
      	at org.nuxeo.lib.stream.computation.log.ComputationRunner.run(ComputationRunner.java:184) [nuxeo-stream-10.10-HF17.jar:?]
      	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [?:1.8.0_232]
      	at java.util.concurrent.FutureTask.run(FutureTask.java:266) [?:1.8.0_232]
      	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [?:1.8.0_232]
      	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [?:1.8.0_232]
      	at java.lang.Thread.run(Thread.java:748) [?:1.8.0_232]
      Caused by: java.io.FileNotFoundException: /apps/nuxeo/tmp/nxbincache.5611589903397908033/81ab8936ad533951ec8d617ac9498b9e (No such file or directory)
      	at java.io.FileInputStream.open0(Native Method) ~[?:1.8.0_232]
      	at java.io.FileInputStream.open(FileInputStream.java:195) ~[?:1.8.0_232]
      	at java.io.FileInputStream.<init>(FileInputStream.java:138) ~[?:1.8.0_232]
      	at org.nuxeo.client.objects.blob.FileBlob.getStream(FileBlob.java:67) ~[nuxeo-java-client-3.2.0.jar:3.2.0]
      	at org.nuxeo.client.objects.upload.BatchUpload.upload(BatchUpload.java:184) ~[nuxeo-java-client-3.2.0.jar:3.2.0]
      	... 17 more
      

        Attachments

          Activity

            People

            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0 minutes
                0m
                Logged:
                Time Spent - 2 days, 1 hour
                2d 1h