climateprediction.net home page
Task 13536189

Task 13536189

Name hadcm3n_yiv3_1900_40_007515260_1
Workunit 7712735
Created 28 Oct 2011, 12:41:08 UTC
Sent 24 Nov 2011, 0:37:15 UTC
Report deadline 23 Feb 2012, 8:04:26 UTC
Received 5 Dec 2011, 22:15:01 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1043044
Run time 7 days 14 hours 56 min 41 sec
CPU time 6 days 8 hours 54 min 11 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 2.42 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.60</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
03:42:51 (5000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:55:04 (6928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:55:05 (6928): No heartbeat from core client for 30 sec - exiting
03:55:06 (6928): No heartbeat from core client for 30 sec - exiting
03:55:07 (6928): No heartbeat from core client for 30 sec - exiting
03:55:08 (6928): No heartbeat from core client for 30 sec - exiting
05:54:19 (6632): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:38:44 (6640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:38:45 (6640): No heartbeat from core client for 30 sec - exiting
20:38:46 (6640): No heartbeat from core client for 30 sec - exiting
20:38:47 (6640): No heartbeat from core client for 30 sec - exiting
20:38:48 (6640): No heartbeat from core client for 30 sec - exiting
20:38:49 (6640): No heartbeat from core client for 30 sec - exiting
20:38:50 (6640): No heartbeat from core client for 30 sec - exiting
03:08:47 (6944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:08:48 (6944): No heartbeat from core client for 30 sec - exiting
03:59:02 (6768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:24:17 (7552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
02:11:42 (4948): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:11:43 (4948): No heartbeat from core client for 30 sec - exiting
02:11:44 (4948): No heartbeat from core client for 30 sec - exiting
02:12:39 (5812): No heartbeat from core client for 30 sec - exiting
02:12:40 (5812): No heartbeat from core client for 30 sec - exiting
02:12:41 (5812): No heartbeat from core client for 30 sec - exiting
02:12:42 (5812): No heartbeat from core client for 30 sec - exiting
02:12:43 (5812): No heartbeat from core client for 30 sec - exiting
02:12:44 (5812): No heartbeat from core client for 30 sec - exiting
02:12:45 (5812): No heartbeat from core client for 30 sec - exiting
02:12:46 (5812): No heartbeat from core client for 30 sec - exiting
02:12:47 (5812): No heartbeat from core client for 30 sec - exiting
02:12:48 (5812): No heartbeat from core client for 30 sec - exiting
02:12:49 (5812): No heartbeat from core client for 30 sec - exiting
02:12:50 (5812): No heartbeat from core client for 30 sec - exiting
02:12:51 (5812): No heartbeat from core client for 30 sec - exiting
02:12:52 (5812): No heartbeat from core client for 30 sec - exiting
02:12:53 (5812): No heartbeat from core client for 30 sec - exiting
02:12:54 (5812): No heartbeat from core client for 30 sec - exiting
02:12:55 (5812): No heartbeat from core client for 30 sec - exiting
02:12:56 (5812): No heartbeat from core client for 30 sec - exiting
02:12:58 (5812): No heartbeat from core client for 30 sec - exiting
02:12:59 (5812): No heartbeat from core client for 30 sec - exiting
02:13:00 (5812): No heartbeat from core client for 30 sec - exiting
02:13:01 (5812): No heartbeat from core client for 30 sec - exiting
02:13:02 (5812): No heartbeat from core client for 30 sec - exiting
02:13:03 (5812): No heartbeat from core client for 30 sec - exiting
02:13:04 (5812): No heartbeat from core client for 30 sec - exiting
02:13:05 (5812): No heartbeat from core client for 30 sec - exiting
02:13:06 (5812): No heartbeat from core client for 30 sec - exiting
02:13:07 (5812): No heartbeat from core client for 30 sec - exiting
02:13:08 (5812): No heartbeat from core client for 30 sec - exiting
02:13:09 (5812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7516, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7516, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7516, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7516, iMonCtr=1
Model crash detected, will try to restart...
20:36:29 (7516): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=352, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=352, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 Dec 2011 10:06:43 1043044 13536189 hadcm3n_yiv3_1900_40_007515260_1 259,200 529,014 2.0409
30 Nov 2011 07:06:08 1043044 13536189 hadcm3n_yiv3_1900_40_007515260_1 233,280 482,232 2.0672
29 Nov 2011 16:02:57 1043044 13536189 hadcm3n_yiv3_1900_40_007515260_1 207,360 432,724 2.0868
29 Nov 2011 01:03:31 1043044 13536189 hadcm3n_yiv3_1900_40_007515260_1 181,440 384,580 2.1196
28 Nov 2011 09:50:56 1043044 13536189 hadcm3n_yiv3_1900_40_007515260_1 155,520 331,740 2.1331
27 Nov 2011 17:18:04 1043044 13536189 hadcm3n_yiv3_1900_40_007515260_1 129,600 278,936 2.1523
26 Nov 2011 20:06:22 1043044 13536189 hadcm3n_yiv3_1900_40_007515260_1 103,680 225,195 2.1720
26 Nov 2011 04:13:29 1043044 13536189 hadcm3n_yiv3_1900_40_007515260_1 77,760 170,443 2.1919
25 Nov 2011 12:01:50 1043044 13536189 hadcm3n_yiv3_1900_40_007515260_1 51,840 114,426 2.2073
24 Nov 2011 19:20:51 1043044 13536189 hadcm3n_yiv3_1900_40_007515260_1 25,920 57,071 2.2018


©2024 cpdn.org