Re: [BLAST_SHIFTS] Shift summary 08/01/2004 B (9-17)

From: Chris Crawford (chris2@lns.mit.edu)
Date: Mon Aug 02 2004 - 09:07:16 EDT


hi doug,
  when you killed those jobs, did you make a note somewhere of which
event it got stuck on? that would be real helpful for debugging so we
don't have to restart from the beginning.
--thanks, chris

Electronic Log Book wrote:
>
> Operator: hasell
>
> Everything ran very smoothly.
>
> Around 10:00 the HV tripped and the HV crate R1 would not respond so had to reboot HV in the middle of run 9662.
>
> Sam's daemon for allocating runs to the spud's and bud's works very nicely. However, I noticed that 4 jobs (9623, 9626, 9631, and 9639) appeared to be stuck and after many hours were still at the same position in crunching. So I killed those jobs. Also bud21 which had a job running on it does not get new jobs for some reason even though there are jobs in the queue.



This archive was generated by hypermail 2.1.2 : Mon Feb 24 2014 - 14:07:31 EST