Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Does the trasnfer mechanism accept any sort of regular expression?  E.g. transfer_input_files=*.txt
  • Can transfer mechanism accept manifest files?  E.g. a file that is a list of files?
  • What other options are there than holding a job?  I find myself not noticing, sometimes for hours, that a job is on hold.  Is there a way to make jobs fail instead of get held?  I assume others will make this mistake like me.
    • I see I can set periodic_remove = (JobStatus == 5) but HTCondor doesn't seem to think that is an error so if I have notification = Error I don't get any email.
  • condor_q truncates output so which leads to me getting the wrong jobid.  E.g. with condor_q I see my jobids as 138 and 139 instead of 1380 or 1398

    herapost-master krowe >condor_q


    -- Schedd: herapost-master.aoc.nrao.edu : <10.64.10.210:9618?... @ 07/23/20 16:36:28
    OWNER BATCH_NAME                SUBMITTED   DONE   RUN    IDLE   HOLD  TOTAL JOB
    krowe task01-25-parallel-dag   7/22 12:56      6      1      _      _      8 138
    krowe task13.dag+1397          7/23 15:40      _      _      _      1      1 139


    herapost-master krowe >condor_q -dag -nobatch


    -- Schedd: herapost-master.aoc.nrao.edu : <10.64.10.210:9618?... @ 07/23/20 16:37:22
     ID      OWNER/NODENAME      SUBMITTED     RUN_TIME ST PRI SIZE CMD
    1369.0   krowe              7/22 12:56   1+03:40:45 R  0    0.3 condor_dagman -p
    1380.0    |-task24          7/23 10:37   0+06:00:17 R  0   3906 task.sh task24.p
    1397.0   krowe              7/23 15:40   0+00:56:49 R  0    0.3 condor_dagman -p
    1398.0    |-task13          7/23 15:40   0+00:00:03 H  0    0.0 task.sh task13.p



...