|
Verfasst am: 03. 02. 2012 [12:06]
|
|
hhoffmann
Holger Hoffmann
Themenersteller
Dabei seit: 21.12.2011
Beiträge: 26
|
Hi,
I'm having this issue:
I have calculations on time series (climate data) for n sites and although the data has always the same structure, the duration of jobs is very different. So depending on 1.) The host machine 2.) The data, most of the jobs (~70%) complete in 6 h, but the rest may last up to 20 h.
Therefore I get these complain e-mails to adjust the requested wallclocktime. I cant solve it, because it would be very exhausting to make pre-runs for every site, nor do I know the machine it will run on.
Is this an issue or should I just delete these mails everyday? How do you handle this?
Kind regards,
Holger
|
|
Verfasst am: 03. 02. 2012 [12:25]
|
|
gerdes
Andreas Gerdes
Dabei seit: 14.09.2010
Beiträge: 49
|
Hi Holger,
in your case: just ignore the e-mails.
Best regards
Andreas
|
|
Verfasst am: 21. 02. 2012 [19:52]
|
|
hhoffmann
Holger Hoffmann
Themenersteller
Dabei seit: 21.12.2011
Beiträge: 26
|
Hi,
second wallclocktime-problem: I´m getting jobs aborted. E.g.:
Job: reRun3.o473621-492
--------------------------------------------------------------
Job ran on: lucky.rrzn.uni-hannover.de
Module for Mathworks Matlab, version 2011b loaded.
=>> PBS: job killed: walltime 144036 exceeded limit 144000
--------------------------------------------------------------
A 40 h - Wallclocktime-job, supposely.
The problem: It ran only <20 h, according to the shell:
qstat -t 473621[]
How come?
Kind regards,
Holger
[Dieser Beitrag wurde 1mal bearbeitet, zuletzt am 21.02.2012 um 19:53.]
|