climateprediction.net home page
Task 13359638

Task 13359638

Name hadcm3n_t3nr_1940_40_007447635_1
Workunit 7645138
Created 9 Sep 2011, 18:35:54 UTC
Sent 16 Sep 2011, 1:10:48 UTC
Report deadline 16 Dec 2011, 8:37:59 UTC
Received 21 Sep 2011, 2:15:59 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1122757
Run time 4 days 23 hours 29 min 34 sec
CPU time 4 days 22 hours 14 min 37 sec
Validate state Invalid
Credit 1,555.20
Device peak FLOPS 1.70 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
16:59:07 (3912): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:02:09 (3280): No heartbeat from core client for 30 sec - exiting
17:02:29 (3280): No heartbeat from core client for 30 sec - exiting
17:02:30 (3280): No heartbeat from core client for 30 sec - exiting
17:02:31 (3280): No heartbeat from core client for 30 sec - exiting
17:02:32 (3280): No heartbeat from core client for 30 sec - exiting
17:03:07 (3280): No heartbeat from core client for 30 sec - exiting
17:03:08 (3280): No heartbeat from core client for 30 sec - exiting
17:03:09 (3280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:23:28 (4436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:23:32 (4436): No heartbeat from core client for 30 sec - exiting
02:23:33 (4436): No heartbeat from core client for 30 sec - exiting
02:23:35 (4436): No heartbeat from core client for 30 sec - exiting
02:23:36 (4436): No heartbeat from core client for 30 sec - exiting
02:23:37 (4436): No heartbeat from core client for 30 sec - exiting
02:23:38 (4436): No heartbeat from core client for 30 sec - exiting
02:23:39 (4436): No heartbeat from core client for 30 sec - exiting
02:23:40 (4436): No heartbeat from core client for 30 sec - exiting
02:23:41 (4436): No heartbeat from core client for 30 sec - exiting
02:23:42 (4436): No heartbeat from core client for 30 sec - exiting
02:23:43 (4436): No heartbeat from core client for 30 sec - exiting
02:23:44 (4436): No heartbeat from core client for 30 sec - exiting
02:23:45 (4436): No heartbeat from core client for 30 sec - exiting
02:23:47 (4436): No heartbeat from core client for 30 sec - exiting
02:23:48 (4436): No heartbeat from core client for 30 sec - exiting
02:23:49 (4436): No heartbeat from core client for 30 sec - exiting
02:23:50 (4436): No heartbeat from core client for 30 sec - exiting
02:23:51 (4436): No heartbeat from core client for 30 sec - exiting
02:23:52 (4436): No heartbeat from core client for 30 sec - exiting
02:23:53 (4436): No heartbeat from core client for 30 sec - exiting
16:59:20 (3948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:00:21 (5868): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:00:26 (5868): No heartbeat from core client for 30 sec - exiting
17:00:27 (5868): No heartbeat from core client for 30 sec - exiting
17:00:28 (5868): No heartbeat from core client for 30 sec - exiting
17:00:29 (5868): No heartbeat from core client for 30 sec - exiting
17:00:30 (5868): No heartbeat from core client for 30 sec - exiting
17:00:31 (5868): No heartbeat from core client for 30 sec - exiting
17:00:32 (5868): No heartbeat from core client for 30 sec - exiting
17:00:34 (5868): No heartbeat from core client for 30 sec - exiting
17:00:35 (5868): No heartbeat from core client for 30 sec - exiting
17:00:36 (5868): No heartbeat from core client for 30 sec - exiting
17:09:18 (6556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:09:29 (6556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:00:34 (1244): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3032, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3032, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3032, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3032, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3032, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3032, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Sep 2011 17:56:14 1122757 13359638 hadcm3n_t3nr_1940_40_007447635_1 129,600 396,008 3.0556
19 Sep 2011 18:20:27 1122757 13359638 hadcm3n_t3nr_1940_40_007447635_1 103,680 315,503 3.0430
18 Sep 2011 19:47:14 1122757 13359638 hadcm3n_t3nr_1940_40_007447635_1 77,760 235,750 3.0318
17 Sep 2011 22:08:05 1122757 13359638 hadcm3n_t3nr_1940_40_007447635_1 51,840 158,630 3.0600
17 Sep 2011 00:05:53 1122757 13359638 hadcm3n_t3nr_1940_40_007447635_1 25,920 79,398 3.0632


©2024 cpdn.org