climateprediction.net home page
Task 12956538

Task 12956538

Name hadam3p_eu_2qum_1980_1_007283257_0
Workunit 7480461
Created 8 Jun 2011, 5:13:12 UTC
Sent 8 Jun 2011, 5:13:33 UTC
Report deadline 20 May 2012, 10:33:33 UTC
Received 19 Jun 2011, 23:51:25 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 194 (0x000000C2) EXIT_ABORTED_BY_CLIENT
Computer ID 1097575
Run time 6 days 17 hours 30 min 22 sec
CPU time 1 days 5 hours 27 min 24 sec
Validate state Invalid
Credit 2,187.67
Device peak FLOPS 2.02 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
Got ack for job that's till active
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6036, selfPID=5068, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:56:24 (3860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:56:26 (3860): No heartbeat from core client for 30 sec - exiting
09:56:27 (3860): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11120, iMonCtr
=2
el crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
12:32:55 (5444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:32:57 (5444): No heartbeat from core client for 30 sec - exiting
12:32:58 (5444): No heartbeat from core client for 30 sec - exiting
12:32:59 (5444): No heartbeat from core client for 30 sec - exiting
12:33:00 (5444): No heartbeat from core client for 30 sec - exiting
12:33:01 (5444): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1072, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4136, selfPID=5624, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1252, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3476, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5368, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5288, selfPID=5344, iMonCtr=1
Model crash detected, will try to restart...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...

zip error: Output file write failure (write error on zip file)
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
19 Jun 2011 22:00:36 1097575 12956538 hadam3p_eu_2qum_1980_1_007283257_0 126,816 74,765 0.5896
17 Jun 2011 06:27:54 1097575 12956538 hadam3p_eu_2qum_1980_1_007283257_0 115,296 29,322 0.2543
16 Jun 2011 03:59:27 1097575 12956538 hadam3p_eu_2qum_1980_1_007283257_0 103,776 415,316 4.0020
15 Jun 2011 07:24:08 1097575 12956538 hadam3p_eu_2qum_1980_1_007283257_0 92,256 369,992 4.0105
14 Jun 2011 17:35:44 1097575 12956538 hadam3p_eu_2qum_1980_1_007283257_0 80,736 324,534 4.0197
13 Jun 2011 20:26:50 1097575 12956538 hadam3p_eu_2qum_1980_1_007283257_0 69,216 278,864 4.0289
12 Jun 2011 22:55:43 1097575 12956538 hadam3p_eu_2qum_1980_1_007283257_0 57,696 232,630 4.0320
12 Jun 2011 04:01:34 1097575 12956538 hadam3p_eu_2qum_1980_1_007283257_0 46,176 186,304 4.0347
10 Jun 2011 17:57:07 1097575 12956538 hadam3p_eu_2qum_1980_1_007283257_0 34,656 138,448 3.9949
09 Jun 2011 21:57:29 1097575 12956538 hadam3p_eu_2qum_1980_1_007283257_0 23,136 91,808 3.9682
08 Jun 2011 21:14:38 1097575 12956538 hadam3p_eu_2qum_1980_1_007283257_0 11,616 46,192 3.9766


©2024 climateprediction.net