climateprediction.net home page
Task 15802992

Task 15802992

Name hadcm3n_n2ta_1880_40_008374409_0
Workunit 8525268
Created 29 May 2013, 21:30:31 UTC
Sent 29 May 2013, 21:38:08 UTC
Report deadline 29 Aug 2013, 5:05:19 UTC
Received 3 Oct 2013, 1:38:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1281494
Run time 13 days 23 hours 52 min 24 sec
CPU time 13 days 11 hours 0 min 59 sec
Validate state Invalid
Credit 8,087.04
Device peak FLOPS 3.04 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3812, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:10:35 (4312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:10:36 (4312): No heartbeat from core client for 30 sec - exiting
21:10:37 (4312): No heartbeat from core client for 30 sec - exiting
21:10:38 (4312): No heartbeat from core client for 30 sec - exiting
21:10:39 (4312): No heartbeat from core client for 30 sec - exiting
21:10:40 (4312): No heartbeat from core client for 30 sec - exiting
21:10:41 (4312): No heartbeat from core client for 30 sec - exiting
21:10:42 (4312): No heartbeat from core client for 30 sec - exiting
21:10:43 (4312): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6196, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6196, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:35:38 (1388): No heartbeat from core client for 30 sec - exiting
11:35:39 (1388): No heartbeat from core client for 30 sec - exiting
11:35:40 (1388): No heartbeat from core client for 30 sec - exiting
11:35:41 (1388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7116, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5576, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6412, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5948, iMonCtr=1
Model crash detected, will try to restart...
15:45:21 (4908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1904, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
17:23:36 (5292): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5208, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5304, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6392, iMonCtr=1
Model crash detected, will try to restart...
17:24:16 (5728): No heartbeat from core client for 30 sec - exiting
17:24:17 (5728): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5688, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2564, iMonCtr=1
Model crash detected, will try to restart...
13:49:41 (6100): No heartbeat from core client for 30 sec - exiting
13:49:42 (6100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6724, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1204, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5860, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5992, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5916, iMonCtr=1
Model crash detected, will try to restart...
15:20:29 (5324): No heartbeat from core client for 30 sec - exiting
15:20:30 (5324): No heartbeat from core client for 30 sec - exiting
15:20:31 (5324): No heartbeat from core client for 30 sec - exiting
15:20:32 (5324): No heartbeat from core client for 30 sec - exiting
15:20:33 (5324): No heartbeat from core client for 30 sec - exiting
15:20:34 (5324): No heartbeat from core client for 30 sec - exiting
15:20:35 (5324): No heartbeat from core client for 30 sec - exiting
15:20:37 (5324): No heartbeat from core client for 30 sec - exiting
15:20:38 (5324): No heartbeat from core client for 30 sec - exiting
15:20:39 (5324): No heartbeat from core client for 30 sec - exiting
15:20:40 (5324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1288, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 Oct 2013 02:12:50 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 673,920 1,155,147 1.7141
27 Sep 2013 01:34:57 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 648,000 1,106,034 1.7068
21 Sep 2013 03:49:02 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 622,080 1,057,263 1.6996
15 Sep 2013 18:26:47 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 596,160 1,008,992 1.6925
14 Sep 2013 02:45:36 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 570,240 960,060 1.6836
09 Sep 2013 22:45:00 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 544,320 911,366 1.6743
02 Sep 2013 02:56:53 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 518,400 863,664 1.6660
31 Aug 2013 20:04:55 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 492,480 823,407 1.6720
30 Aug 2013 21:28:58 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 466,560 786,302 1.6853
26 Aug 2013 23:07:48 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 440,640 750,271 1.7027
25 Aug 2013 19:05:15 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 414,720 713,724 1.7210
24 Aug 2013 03:01:38 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 388,800 677,095 1.7415
21 Aug 2013 21:25:02 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 362,880 639,560 1.7625
14 Aug 2013 21:30:53 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 336,960 596,605 1.7706
14 Aug 2013 21:30:53 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 311,040 552,623 1.7767
14 Aug 2013 21:30:53 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 285,120 508,941 1.7850
14 Aug 2013 21:30:53 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 259,200 465,319 1.7952
14 Aug 2013 21:30:53 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 233,280 420,097 1.8008
25 Jul 2013 00:05:48 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 207,360 371,509 1.7916
23 Jul 2013 21:18:44 1281494 15802992 hadcm3n_n2ta_1880_40_008374409_0 181,440 325,745 1.7953


©2024 climateprediction.net