climateprediction.net home page
Task 15765091

Task 15765091

Name hadcm3n_3lt1_1940_40_008260310_3
Workunit 8415434
Created 7 May 2013, 17:18:09 UTC
Sent 7 May 2013, 17:19:11 UTC
Report deadline 7 Aug 2013, 0:46:22 UTC
Received 1 Jun 2013, 22:21:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1114586
Run time 5 days 8 hours 5 min 52 sec
CPU time 5 days 5 hours 21 min 46 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 2.69 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
02:51:36 (11764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:51:37 (11764): No heartbeat from core client for 30 sec - exiting
02:51:38 (11764): No heartbeat from core client for 30 sec - exiting
02:51:39 (11764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
02:54:46 (9420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6180, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
02:36:07 (3372): No heartbeat from core client for 30 sec - exiting
02:36:08 (3372): No heartbeat from core client for 30 sec - exiting
02:36:09 (3372): No heartbeat from core client for 30 sec - exiting
02:36:10 (3372): No heartbeat from core client for 30 sec - exiting
02:36:11 (3372): No heartbeat from core client for 30 sec - exiting
02:36:12 (3372): No heartbeat from core client for 30 sec - exiting
02:36:13 (3372): No heartbeat from core client for 30 sec - exiting
02:36:14 (3372): No heartbeat from core client for 30 sec - exiting
02:36:15 (3372): No heartbeat from core client for 30 sec - exiting
02:36:16 (3372): No heartbeat from core client for 30 sec - exiting
02:36:17 (3372): No heartbeat from core client for 30 sec - exiting
02:36:18 (3372): No heartbeat from core client for 30 sec - exiting
02:36:19 (3372): No heartbeat from core client for 30 sec - exiting
02:36:20 (3372): No heartbeat from core client for 30 sec - exiting
02:36:21 (3372): No heartbeat from core client for 30 sec - exiting
02:36:22 (3372): No heartbeat from core client for 30 sec - exiting
02:36:23 (3372): No heartbeat from core client for 30 sec - exiting
02:36:24 (3372): No heartbeat from core client for 30 sec - exiting
02:36:25 (3372): No heartbeat from core client for 30 sec - exiting
02:36:26 (3372): No heartbeat from core client for 30 sec - exiting
02:36:27 (3372): No heartbeat from core client for 30 sec - exiting
02:36:28 (3372): No heartbeat from core client for 30 sec - exiting
02:36:29 (3372): No heartbeat from core client for 30 sec - exiting
02:36:30 (3372): No heartbeat from core client for 30 sec - exiting
02:36:31 (3372): No heartbeat from core client for 30 sec - exiting
02:36:32 (3372): No heartbeat from core client for 30 sec - exiting
02:36:33 (3372): No heartbeat from core client for 30 sec - exiting
02:36:34 (3372): No heartbeat from core client for 30 sec - exiting
02:36:35 (3372): No heartbeat from core client for 30 sec - exiting
02:36:36 (3372): No heartbeat from core client for 30 sec - exiting
02:36:37 (3372): No heartbeat from core client for 30 sec - exiting
02:36:38 (3372): No heartbeat from core client for 30 sec - exiting
02:36:39 (3372): No heartbeat from core client for 30 sec - exiting
02:36:40 (3372): No heartbeat from core client for 30 sec - exiting
02:36:41 (3372): No heartbeat from core client for 30 sec - exiting
02:36:42 (3372): No heartbeat from core client for 30 sec - exiting
02:36:43 (3372): No heartbeat from core client for 30 sec - exiting
02:36:44 (3372): No heartbeat from core client for 30 sec - exiting
02:36:45 (3372): No heartbeat from core client for 30 sec - exiting
02:36:46 (3372): No heartbeat from core client for 30 sec - exiting
02:36:47 (3372): No heartbeat from core client for 30 sec - exiting
02:36:48 (3372): No heartbeat from core client for 30 sec - exiting
02:36:49 (3372): No heartbeat from core client for 30 sec - exiting
02:36:50 (3372): No heartbeat from core client for 30 sec - exiting
02:36:51 (3372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
00:46:32 (5560): No heartbeat from core client for 30 sec - exiting
00:46:33 (5560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3436, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3436, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5608, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5028, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5028, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5028, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 May 2013 08:28:39 1114586 15765091 hadcm3n_3lt1_1940_40_008260310_3 259,200 427,591 1.6497
23 May 2013 05:57:06 1114586 15765091 hadcm3n_3lt1_1940_40_008260310_3 233,280 386,118 1.6552
22 May 2013 17:25:21 1114586 15765091 hadcm3n_3lt1_1940_40_008260310_3 207,360 344,269 1.6602
22 May 2013 04:01:38 1114586 15765091 hadcm3n_3lt1_1940_40_008260310_3 181,440 301,398 1.6611
21 May 2013 14:00:00 1114586 15765091 hadcm3n_3lt1_1940_40_008260310_3 155,520 255,943 1.6457
20 May 2013 16:04:11 1114586 15765091 hadcm3n_3lt1_1940_40_008260310_3 129,600 212,965 1.6432
20 May 2013 03:18:58 1114586 15765091 hadcm3n_3lt1_1940_40_008260310_3 103,680 171,279 1.6520
19 May 2013 09:42:29 1114586 15765091 hadcm3n_3lt1_1940_40_008260310_3 77,760 128,084 1.6472
18 May 2013 19:56:25 1114586 15765091 hadcm3n_3lt1_1940_40_008260310_3 51,840 84,605 1.6320
15 May 2013 08:10:21 1114586 15765091 hadcm3n_3lt1_1940_40_008260310_3 25,920 41,694 1.6086


©2024 cpdn.org