climateprediction.net home page
Task 15718837

Task 15718837

Name hadcm3n_zhlr_1920_40_008316212_3
Workunit 8467347
Created 9 Apr 2013, 13:49:20 UTC
Sent 9 Apr 2013, 13:49:36 UTC
Report deadline 9 Jul 2013, 21:16:47 UTC
Received 7 May 2013, 15:49:13 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1192595
Run time 14 days 2 hours 39 min 27 sec
CPU time 11 days 16 hours 34 min 13 sec
Validate state Invalid
Credit 5,909.76
Device peak FLOPS 2.08 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:46:42 (752): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:46:43 (752): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3360, iMonCtr=1
Model crash detected, will try to restart...
07:55:14 (3516): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:45:36 (2440): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
09:45:38 (2440): No heartbeat from core client for 30 sec - exiting
09:45:39 (2440): No heartbeat from core client for 30 sec - exiting
09:45:41 (2440): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:13:47 (3660): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
06:13:50 (3660): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
12:18:50 (5008): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
07:28:45 (3392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:36:31 (4020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:24:53 (4196): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:24:56 (4196): No heartbeat from core client for 30 sec - exiting
09:27:12 (1776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:28:29 (3212): No heartbeat from core client for 30 sec - exiting
09:28:30 (3212): No heartbeat from core client for 30 sec - exiting
09:28:31 (3212): No heartbeat from core client for 30 sec - exiting
09:28:32 (3212): No heartbeat from core client for 30 sec - exiting
09:28:33 (3212): No heartbeat from core client for 30 sec - exiting
09:28:34 (3212): No heartbeat from core client for 30 sec - exiting
09:28:35 (3212): No heartbeat from core client for 30 sec - exiting
09:28:36 (3212): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:28:37 (3212): No heartbeat from core client for 30 sec - exiting
09:29:38 (3628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:30:16 (5052): No heartbeat from core client for 30 sec - exiting
09:30:17 (5052): No heartbeat from core client for 30 sec - exiting
09:30:18 (5052): No heartbeat from core client for 30 sec - exiting
09:30:19 (5052): No heartbeat from core client for 30 sec - exiting
09:30:20 (5052): No heartbeat from core client for 30 sec - exiting
09:30:21 (5052): No heartbeat from core client for 30 sec - exiting
09:30:22 (5052): No heartbeat from core client for 30 sec - exiting
09:30:23 (5052): No heartbeat from core client for 30 sec - exiting
09:30:24 (5052): No heartbeat from core client for 30 sec - exiting
09:30:25 (5052): No heartbeat from core client for 30 sec - exiting
09:30:26 (5052): No heartbeat from core client for 30 sec - exiting
09:30:27 (5052): No heartbeat from core client for 30 sec - exiting
09:30:28 (5052): No heartbeat from core client for 30 sec - exiting
09:30:29 (5052): No heartbeat from core client for 30 sec - exiting
09:30:30 (5052): No heartbeat from core client for 30 sec - exiting
09:30:31 (5052): No heartbeat from core client for 30 sec - exiting
09:30:32 (5052): No heartbeat from core client for 30 sec - exiting
09:30:33 (5052): No heartbeat from core client for 30 sec - exiting
09:30:34 (5052): No heartbeat from core client for 30 sec - exiting
09:30:35 (5052): No heartbeat from core client for 30 sec - exiting
09:30:36 (5052): No heartbeat from core client for 30 sec - exiting
09:30:37 (5052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:31:39 (1520): No heartbeat from core client for 30 sec - exiting
09:31:40 (1520): No heartbeat from core client for 30 sec - exiting
09:31:41 (1520): No heartbeat from core client for 30 sec - exiting
09:31:42 (1520): No heartbeat from core client for 30 sec - exiting
09:31:43 (1520): No heartbeat from core client for 30 sec - exiting
09:31:44 (1520): No heartbeat from core client for 30 sec - exiting
09:31:45 (1520): No heartbeat from core client for 30 sec - exiting
09:31:46 (1520): No heartbeat from core client for 30 sec - exiting
09:31:47 (1520): No heartbeat from core client for 30 sec - exiting
09:31:48 (1520): No heartbeat from core client for 30 sec - exiting
09:31:49 (1520): No heartbeat from core client for 30 sec - exiting
09:31:50 (1520): No heartbeat from core client for 30 sec - exiting
09:31:51 (1520): No heartbeat from core client for 30 sec - exiting
09:31:52 (1520): No heartbeat from core client for 30 sec - exiting
09:31:53 (1520): No heartbeat from core client for 30 sec - exiting
09:31:54 (1520): No heartbeat from core client for 30 sec - exiting
09:31:55 (1520): No heartbeat from core client for 30 sec - exiting
09:31:56 (1520): No heartbeat from core client for 30 sec - exiting
09:31:57 (1520): No heartbeat from core client for 30 sec - exiting
09:31:58 (1520): No heartbeat from core client for 30 sec - exiting
09:31:59 (1520): No heartbeat from core client for 30 sec - exiting
09:32:00 (1520): No heartbeat from core client for 30 sec - exiting
09:32:01 (1520): No heartbeat from core client for 30 sec - exiting
09:32:02 (1520): No heartbeat from core client for 30 sec - exiting
09:32:03 (1520): No heartbeat from core client for 30 sec - exiting
09:32:04 (1520): No heartbeat from core client for 30 sec - exiting
09:32:05 (1520): No heartbeat from core client for 30 sec - exiting
09:32:06 (1520): No heartbeat from core client for 30 sec - exiting
09:32:07 (1520): No heartbeat from core client for 30 sec - exiting
09:32:08 (1520): No heartbeat from core client for 30 sec - exiting
09:32:09 (1520): No heartbeat from core client for 30 sec - exiting
09:32:10 (1520): No heartbeat from core client for 30 sec - exiting
09:32:11 (1520): No heartbeat from core client for 30 sec - exiting
09:32:12 (1520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:33:56 (1556): No heartbeat from core client for 30 sec - exiting
09:33:57 (1556): No heartbeat from core client for 30 sec - exiting
09:33:58 (1556): No heartbeat from core client for 30 sec - exiting
09:33:59 (1556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3440, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3440, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3852, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3852, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3852, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3852, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 May 2013 01:38:47 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 492,480 970,936 1.9715
30 Apr 2013 06:04:33 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 466,560 921,431 1.9749
27 Apr 2013 11:33:47 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 440,640 871,203 1.9771
26 Apr 2013 09:33:12 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 414,720 817,097 1.9702
25 Apr 2013 10:57:52 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 388,800 760,895 1.9570
24 Apr 2013 12:51:27 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 362,880 703,858 1.9396
23 Apr 2013 14:45:35 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 336,960 651,040 1.9321
22 Apr 2013 15:20:45 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 311,040 600,970 1.9321
21 Apr 2013 05:56:18 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 285,120 553,041 1.9397
19 Apr 2013 22:05:05 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 259,200 504,830 1.9476
19 Apr 2013 07:28:11 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 233,280 453,433 1.9437
18 Apr 2013 16:57:10 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 207,360 402,300 1.9401
17 Apr 2013 20:22:46 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 181,440 351,481 1.9372
17 Apr 2013 05:50:56 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 155,520 300,476 1.9321
16 Apr 2013 15:25:51 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 129,600 249,781 1.9273
15 Apr 2013 07:09:49 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 103,680 200,160 1.9306
14 Apr 2013 17:23:05 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 77,760 151,046 1.9425
14 Apr 2013 03:24:40 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 51,840 102,078 1.9691
10 Apr 2013 17:26:24 1192595 15718837 hadcm3n_zhlr_1920_40_008316212_3 25,920 53,041 2.0463


©2024 cpdn.org