climateprediction.net home page
Task 13559508

Task 13559508

Name hadcm3n_ya1w_1900_40_007526868_1
Workunit 7724343
Created 28 Oct 2011, 13:48:05 UTC
Sent 29 Oct 2011, 6:12:02 UTC
Report deadline 28 Jan 2012, 13:39:13 UTC
Received 25 Nov 2011, 5:27:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1294024
Run time 17 days 2 hours 56 min 14 sec
CPU time 7 days 11 hours 16 min 24 sec
Validate state Invalid
Credit 7,153.92
Device peak FLOPS 2.37 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.17</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2328, iMonCtr=1
Model crash detected, will try to restart...
15:02:06 (2328): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:02:07 (2328): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22521, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=22521, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
01:05:43 (2326): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:05:46 (2326): No heartbeat from core client for 30 sec - exiting
01:05:47 (2326): No heartbeat from core client for 30 sec - exiting
01:05:50 (2326): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=405, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Nov 2011 02:09:16 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 596,160 648,495 1.0878
24 Nov 2011 07:06:09 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 570,240 588,184 1.0315
23 Nov 2011 13:13:16 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 544,320 527,829 0.9697
22 Nov 2011 19:18:16 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 518,400 467,867 0.9025
22 Nov 2011 01:32:04 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 492,480 407,469 0.8274
21 Nov 2011 07:03:00 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 466,560 347,052 0.7439
20 Nov 2011 12:58:46 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 440,640 286,530 0.6503
19 Nov 2011 19:13:46 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 414,720 225,897 0.5447
19 Nov 2011 01:01:40 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 388,800 203,621 0.5237
18 Nov 2011 05:28:18 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 362,880 824,406 2.2718
17 Nov 2011 10:33:51 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 336,960 765,072 2.2705
16 Nov 2011 16:22:34 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 311,040 704,440 2.2648
15 Nov 2011 22:29:53 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 285,120 644,018 2.2588
15 Nov 2011 16:43:54 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 259,200 586,240 2.2617
15 Nov 2011 16:43:55 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 233,280 528,781 2.2667
15 Nov 2011 16:43:43 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 207,360 471,511 2.2739
15 Nov 2011 16:44:04 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 181,440 414,924 2.2868
15 Nov 2011 16:43:54 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 155,520 355,912 2.2885
15 Nov 2011 16:43:51 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 129,600 296,005 2.2840
15 Nov 2011 16:43:50 982003 13559508 hadcm3n_ya1w_1900_40_007526868_1 103,680 236,671 2.2827


©2024 climateprediction.net