climateprediction.net home page
Task 14278414

Task 14278414

Name hadcm3n_yeib_1940_40_007831357_0
Workunit 7986469
Created 17 Mar 2012, 15:57:36 UTC
Sent 17 Mar 2012, 15:57:58 UTC
Report deadline 16 Jun 2012, 23:25:09 UTC
Received 26 Jul 2012, 15:57:02 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1196049
Run time 16 days 19 hours 29 min 44 sec
CPU time 13 days 13 hours 55 min 41 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 1.53 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:06:26 (1612): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:06:27 (1612): No heartbeat from core client for 30 sec - exiting
14:06:28 (1612): No heartbeat from core client for 30 sec - exiting
15:28:31 (5200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:28:32 (5200): No heartbeat from core client for 30 sec - exiting
16:04:18 (2624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:04:19 (2624): No heartbeat from core client for 30 sec - exiting
16:27:29 (7364): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:27:30 (7364): No heartbeat from core client for 30 sec - exiting
16:27:31 (7364): No heartbeat from core client for 30 sec - exiting
16:27:32 (7364): No heartbeat from core client for 30 sec - exiting
16:27:33 (7364): No heartbeat from core client for 30 sec - exiting
16:27:34 (7364): No heartbeat from core client for 30 sec - exiting
18:33:02 (7800): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:33:04 (7800): No heartbeat from core client for 30 sec - exiting
18:33:05 (7800): No heartbeat from core client for 30 sec - exiting
18:33:06 (7800): No heartbeat from core client for 30 sec - exiting
18:33:07 (7800): No heartbeat from core client for 30 sec - exiting
18:33:08 (7800): No heartbeat from core client for 30 sec - exiting
18:33:09 (7800): No heartbeat from core client for 30 sec - exiting
18:33:10 (7800): No heartbeat from core client for 30 sec - exiting
18:33:11 (7800): No heartbeat from core client for 30 sec - exiting
18:33:12 (7800): No heartbeat from core client for 30 sec - exiting
18:33:13 (7800): No heartbeat from core client for 30 sec - exiting
18:33:14 (7800): No heartbeat from core client for 30 sec - exiting
18:33:15 (7800): No heartbeat from core client for 30 sec - exiting
20:32:31 (7612): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:32:32 (7612): No heartbeat from core client for 30 sec - exiting
20:36:32 (6700): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:36:34 (6700): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
21:15:49 (4564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:23:12 (5588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:23:13 (5588): No heartbeat from core client for 30 sec - exiting
21:23:14 (5588): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
09:05:28 (5196): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:05:30 (5196): No heartbeat from core client for 30 sec - exiting
09:05:31 (5196): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Hold Restart file rename failed on atmos_restart.hold
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2228, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2228, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2228, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2228, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2228, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2228, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Jul 2012 16:57:25 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 388,800 1,167,153 3.0019
23 Jul 2012 14:08:56 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 362,880 1,090,897 3.0062
21 Jul 2012 14:52:24 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 336,960 1,014,564 3.0109
20 Jul 2012 12:37:47 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 311,040 938,582 3.0176
19 Jul 2012 10:01:30 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 285,120 862,506 3.0251
18 Jul 2012 05:48:33 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 259,200 786,257 3.0334
16 Jul 2012 20:37:21 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 233,280 708,158 3.0357
15 Jul 2012 19:21:52 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 207,360 630,320 3.0397
15 Jul 2012 19:21:52 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 181,440 553,786 3.0522
15 Jul 2012 19:21:52 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 155,520 477,285 3.0690
15 Jul 2012 19:21:52 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 129,600 400,761 3.0923
15 Jul 2012 19:21:52 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 103,680 324,127 3.1262
15 Jul 2012 19:21:52 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 77,760 245,691 3.1596
07 Jul 2012 17:16:08 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 51,840 167,179 3.2249
06 Jul 2012 09:39:21 1196049 14278414 hadcm3n_yeib_1940_40_007831357_0 25,920 78,809 3.0405


©2024 climateprediction.net