climateprediction.net home page
Task 15295167

Task 15295167

Name hadam3p_eu_2ixv_1982_1_008202879_0
Workunit 8358003
Created 21 Sep 2012, 13:06:38 UTC
Sent 21 Sep 2012, 13:14:56 UTC
Report deadline 3 Sep 2013, 18:34:56 UTC
Received 5 Dec 2012, 20:13:16 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -187 (0xFFFFFF45) ERR_RESULT_UPLOAD
Computer ID 1217407
Run time 4 days 12 hours 10 min 10 sec
CPU time 4 days 7 hours 19 min 47 sec
Validate state Invalid
Credit 1,988.94
Device peak FLOPS 2.17 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
upload failure
</message>
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4100, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4448, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4228, selfPID=3468, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2320, selfPID=4112, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4548, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3400, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2296, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1896, selfPID=3580, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5076, selfPID=3564, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1872, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4864, selfPID=4356, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4184, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2072, iMonCtr=2
Model crash detected, will try to restart...
18:06:19 (3676): No heartbeat from core client for 30 sec - exiting
18:06:20 (3676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2968, selfPID=4804, iMonCtr=1
Model crash detected, will try to restart...
16:37:27 (4052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3572, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4416, selfPID=3844, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4388, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2180, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4168, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3416, selfPID=3796, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4456, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3352, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...

zip error: Output file write failure (write error on zip file)

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Nov 2012 02:24:49 1217407 15295167 hadam3p_eu_2ixv_1982_1_008202879_0 115,296 339,378 2.9435
22 Oct 2012 20:24:43 1217407 15295167 hadam3p_eu_2ixv_1982_1_008202879_0 103,776 305,236 2.9413
20 Oct 2012 18:50:57 1217407 15295167 hadam3p_eu_2ixv_1982_1_008202879_0 92,256 270,872 2.9361
20 Oct 2012 09:36:36 1217407 15295167 hadam3p_eu_2ixv_1982_1_008202879_0 80,736 237,862 2.9462
20 Oct 2012 00:50:28 1217407 15295167 hadam3p_eu_2ixv_1982_1_008202879_0 69,216 204,738 2.9580
15 Oct 2012 19:25:20 1217407 15295167 hadam3p_eu_2ixv_1982_1_008202879_0 57,696 171,364 2.9701
14 Oct 2012 14:22:10 1217407 15295167 hadam3p_eu_2ixv_1982_1_008202879_0 46,176 137,338 2.9742
01 Oct 2012 19:57:03 1217407 15295167 hadam3p_eu_2ixv_1982_1_008202879_0 34,656 103,435 2.9846
30 Sep 2012 14:07:32 1217407 15295167 hadam3p_eu_2ixv_1982_1_008202879_0 23,136 68,959 2.9806
23 Sep 2012 17:54:14 1217407 15295167 hadam3p_eu_2ixv_1982_1_008202879_0 11,616 34,541 2.9736


©2024 cpdn.org