climateprediction.net home page
Task 13126642

Task 13126642

Name hadcm3n_ym0f_1900_40_007361385_1
Workunit 7558815
Created 6 Jul 2011, 15:18:53 UTC
Sent 7 Jul 2011, 14:53:33 UTC
Report deadline 6 Oct 2011, 22:20:44 UTC
Received 25 Oct 2011, 16:03:34 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1053567
Run time 17 days 18 hours 53 min 22 sec
CPU time 17 days 12 hours 33 min 55 sec
Validate state Invalid
Credit 8,709.12
Device peak FLOPS 2.74 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5416, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6816, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=224, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2308, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1416, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=736, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
19:49:03 (5600): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:23:23 (964): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:23:24 (964): No heartbeat from core client for 30 sec - exiting
14:23:25 (964): No heartbeat from core client for 30 sec - exiting
14:25:00 (5064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:25:01 (5064): No heartbeat from core client for 30 sec - exiting
14:25:02 (5064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=652, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2228, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1444, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4436, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
14:47:10 (4680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:47:11 (4680): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5236, iMonCtr=1
Model crash detected, will try to restart...
15:24:02 (5236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=360, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Oct 2011 09:29:58 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 725,760 1,509,815 2.0803
02 Oct 2011 10:32:11 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 699,840 1,476,325 2.1095
30 Sep 2011 06:59:23 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 673,920 1,443,642 2.1422
29 Sep 2011 22:06:09 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 648,000 1,411,059 2.1776
29 Sep 2011 12:44:14 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 622,080 1,378,499 2.2160
29 Sep 2011 03:38:53 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 596,160 1,345,920 2.2576
28 Sep 2011 18:30:08 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 570,240 1,313,309 2.3031
27 Sep 2011 14:07:58 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 544,320 1,280,395 2.3523
26 Sep 2011 14:58:52 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 518,400 1,246,228 2.4040
22 Sep 2011 14:03:10 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 492,480 1,213,946 2.4650
21 Sep 2011 14:51:29 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 466,560 1,179,550 2.5282
20 Sep 2011 17:56:21 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 440,640 1,144,844 2.5981
10 Aug 2011 07:08:14 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 414,720 542,485 1.3081
08 Aug 2011 07:46:33 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 388,800 507,897 1.3063
02 Aug 2011 14:17:01 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 362,880 473,898 1.3059
01 Aug 2011 09:54:24 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 336,960 439,941 1.3056
30 Jul 2011 14:16:59 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 311,040 405,577 1.3039
26 Jul 2011 15:27:43 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 285,120 371,985 1.3047
26 Jul 2011 06:10:33 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 259,200 339,033 1.3080
25 Jul 2011 22:48:53 1053567 13126642 hadcm3n_ym0f_1900_40_007361385_1 233,280 304,602 1.3057


©2024 cpdn.org