climateprediction.net home page
Task 15836312

Task 15836312

Name hadcm3n_n31j_1960_40_008389986_2
Workunit 8540845
Created 9 Jun 2013, 9:04:05 UTC
Sent 9 Jun 2013, 9:22:27 UTC
Report deadline 8 Sep 2013, 16:49:38 UTC
Received 14 Aug 2013, 20:09:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1228459
Run time 12 days 9 hours 15 min 35 sec
CPU time 12 days 0 hours 55 min 36 sec
Validate state Invalid
Credit 9,953.28
Device peak FLOPS 3.30 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7868, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
09:27:03 (7516): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:03:32 (6924): No heartbeat from core client for 30 sec - exiting
09:03:33 (6924): No heartbeat from core client for 30 sec - exiting
09:03:34 (6924): No heartbeat from core client for 30 sec - exiting
09:03:35 (6924): No heartbeat from core client for 30 sec - exiting
09:03:36 (6924): No heartbeat from core client for 30 sec - exiting
09:03:37 (6924): No heartbeat from core client for 30 sec - exiting
09:03:38 (6924): No heartbeat from core client for 30 sec - exiting
09:03:39 (6924): No heartbeat from core client for 30 sec - exiting
09:03:40 (6924): No heartbeat from core client for 30 sec - exiting
09:03:41 (6924): No heartbeat from core client for 30 sec - exiting
09:03:42 (6924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
08:34:40 (6200): No heartbeat from core client for 30 sec - exiting
08:34:42 (6200): No heartbeat from core client for 30 sec - exiting
08:34:43 (6200): No heartbeat from core client for 30 sec - exiting
08:34:44 (6200): No heartbeat from core client for 30 sec - exiting
08:34:45 (6200): No heartbeat from core client for 30 sec - exiting
08:34:46 (6200): No heartbeat from core client for 30 sec - exiting
08:34:47 (6200): No heartbeat from core client for 30 sec - exiting
08:34:48 (6200): No heartbeat from core client for 30 sec - exiting
08:34:49 (6200): No heartbeat from core client for 30 sec - exiting
08:34:50 (6200): No heartbeat from core client for 30 sec - exiting
08:34:51 (6200): No heartbeat from core client for 30 sec - exiting
08:34:52 (6200): No heartbeat from core client for 30 sec - exiting
08:34:53 (6200): No heartbeat from core client for 30 sec - exiting
08:34:54 (6200): No heartbeat from core client for 30 sec - exiting
08:34:55 (6200): No heartbeat from core client for 30 sec - exiting
08:34:56 (6200): No heartbeat from core client for 30 sec - exiting
08:34:57 (6200): No heartbeat from core client for 30 sec - exiting
08:34:58 (6200): No heartbeat from core client for 30 sec - exiting
08:34:59 (6200): No heartbeat from core client for 30 sec - exiting
08:35:00 (6200): No heartbeat from core client for 30 sec - exiting
08:35:01 (6200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8344, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7808, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7808, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7808, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7808, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7808, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7808, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Aug 2013 20:09:49 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 829,440 1,023,142 1.2335
14 Aug 2013 20:09:49 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 803,520 992,760 1.2355
14 Aug 2013 20:09:49 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 777,600 962,835 1.2382
14 Aug 2013 20:09:49 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 751,680 932,432 1.2405
14 Aug 2013 20:09:49 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 725,760 902,040 1.2429
14 Aug 2013 20:09:49 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 699,840 871,149 1.2448
14 Aug 2013 20:09:49 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 673,920 839,712 1.2460
14 Aug 2013 20:09:49 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 648,000 810,819 1.2513
14 Aug 2013 20:09:49 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 622,080 782,894 1.2585
14 Aug 2013 20:09:49 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 596,160 753,053 1.2632
14 Aug 2013 20:09:49 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 570,240 722,558 1.2671
25 Jul 2013 11:56:37 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 544,320 691,736 1.2708
23 Jul 2013 22:14:39 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 518,400 660,795 1.2747
23 Jul 2013 21:53:49 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 492,480 629,329 1.2779
23 Jul 2013 20:20:45 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 466,560 596,323 1.2781
08 Jul 2013 23:02:06 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 440,640 563,029 1.2778
07 Jul 2013 12:21:44 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 414,720 530,176 1.2784
06 Jul 2013 05:21:14 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 388,800 498,115 1.2812
04 Jul 2013 14:28:08 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 362,880 465,774 1.2835
02 Jul 2013 10:52:06 1228459 15836312 hadcm3n_n31j_1960_40_008389986_2 336,960 433,178 1.2855


©2024 climateprediction.net