climateprediction.net home page
Task 13406585

Task 13406585

Name hadcm3n_u2j3_1980_40_007458352_0
Workunit 7655855
Created 22 Sep 2011, 14:18:37 UTC
Sent 22 Sep 2011, 14:27:18 UTC
Report deadline 22 Dec 2011, 21:54:29 UTC
Received 12 Oct 2011, 4:52:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1122757
Run time 19 days 3 hours 49 min 15 sec
CPU time 18 days 22 hours 13 min 53 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 1.70 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
00:38:40 (5036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:38:49 (5036): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
02:23:55 (6048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:24:44 (6048): No heartbeat from core client for 30 sec - exiting
02:24:45 (6048): No heartbeat from core client for 30 sec - exiting
02:24:46 (6048): No heartbeat from core client for 30 sec - exiting
02:24:48 (6048): No heartbeat from core client for 30 sec - exiting
02:24:49 (6048): No heartbeat from core client for 30 sec - exiting
02:24:50 (6048): No heartbeat from core client for 30 sec - exiting
02:24:51 (6048): No heartbeat from core client for 30 sec - exiting
02:24:52 (6048): No heartbeat from core client for 30 sec - exiting
02:24:53 (6048): No heartbeat from core client for 30 sec - exiting
02:24:54 (6048): No heartbeat from core client for 30 sec - exiting
02:24:55 (6048): No heartbeat from core client for 30 sec - exiting
02:24:56 (6048): No heartbeat from core client for 30 sec - exiting
02:24:57 (6048): No heartbeat from core client for 30 sec - exiting
02:24:58 (6048): No heartbeat from core client for 30 sec - exiting
02:25:00 (6048): No heartbeat from core client for 30 sec - exiting
02:25:01 (6048): No heartbeat from core client for 30 sec - exiting
02:25:02 (6048): No heartbeat from core client for 30 sec - exiting
02:25:03 (6048): No heartbeat from core client for 30 sec - exiting
02:25:04 (6048): No heartbeat from core client for 30 sec - exiting
02:25:05 (6048): No heartbeat from core client for 30 sec - exiting
02:25:06 (6048): No heartbeat from core client for 30 sec - exiting
02:25:07 (6048): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:11:49 (4136): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:12:07 (4136): No heartbeat from core client for 30 sec - exiting
17:12:08 (4136): No heartbeat from core client for 30 sec - exiting
17:12:09 (4136): No heartbeat from core client for 30 sec - exiting
17:12:10 (4136): No heartbeat from core client for 30 sec - exiting
17:12:12 (4136): No heartbeat from core client for 30 sec - exiting
17:12:13 (4136): No heartbeat from core client for 30 sec - exiting
17:12:14 (4136): No heartbeat from core client for 30 sec - exiting
17:12:15 (4136): No heartbeat from core client for 30 sec - exiting
17:12:16 (4136): No heartbeat from core client for 30 sec - exiting
17:12:17 (4136): No heartbeat from core client for 30 sec - exiting
17:12:18 (4136): No heartbeat from core client for 30 sec - exiting
17:12:19 (4136): No heartbeat from core client for 30 sec - exiting
17:12:20 (4136): No heartbeat from core client for 30 sec - exiting
17:12:21 (4136): No heartbeat from core client for 30 sec - exiting
17:12:22 (4136): No heartbeat from core client for 30 sec - exiting
17:12:24 (4136): No heartbeat from core client for 30 sec - exiting
17:12:25 (4136): No heartbeat from core client for 30 sec - exiting
17:12:26 (4136): No heartbeat from core client for 30 sec - exiting
17:12:27 (4136): No heartbeat from core client for 30 sec - exiting
17:12:28 (4136): No heartbeat from core client for 30 sec - exiting
17:12:29 (4136): No heartbeat from core client for 30 sec - exiting
17:12:30 (4136): No heartbeat from core client for 30 sec - exiting
17:12:31 (4136): No heartbeat from core client for 30 sec - exiting
17:12:32 (4136): No heartbeat from core client for 30 sec - exiting
17:12:33 (4136): No heartbeat from core client for 30 sec - exiting
17:12:34 (4136): No heartbeat from core client for 30 sec - exiting
17:12:36 (4136): No heartbeat from core client for 30 sec - exiting
17:12:37 (4136): No heartbeat from core client for 30 sec - exiting
17:12:38 (4136): No heartbeat from core client for 30 sec - exiting
17:12:39 (4136): No heartbeat from core client for 30 sec - exiting
17:12:40 (4136): No heartbeat from core client for 30 sec - exiting
17:12:41 (4136): No heartbeat from core client for 30 sec - exiting
17:12:42 (4136): No heartbeat from core client for 30 sec - exiting
17:12:43 (4136): No heartbeat from core client for 30 sec - exiting
17:12:44 (4136): No heartbeat from core client for 30 sec - exiting
17:12:45 (4136): No heartbeat from core client for 30 sec - exiting
17:12:46 (4136): No heartbeat from core client for 30 sec - exiting
17:12:48 (4136): No heartbeat from core client for 30 sec - exiting
17:12:49 (4136): No heartbeat from core client for 30 sec - exiting
17:12:50 (4136): No heartbeat from core client for 30 sec - exiting
17:12:51 (4136): No heartbeat from core client for 30 sec - exiting
17:12:52 (4136): No heartbeat from core client for 30 sec - exiting
17:12:53 (4136): No heartbeat from core client for 30 sec - exiting
17:12:54 (4136): No heartbeat from core client for 30 sec - exiting
17:12:55 (4136): No heartbeat from core client for 30 sec - exiting
17:12:56 (4136): No heartbeat from core client for 30 sec - exiting
17:12:57 (4136): No heartbeat from core client for 30 sec - exiting
17:12:58 (4136): No heartbeat from core client for 30 sec - exiting
17:13:00 (4136): No heartbeat from core client for 30 sec - exiting
17:13:01 (4136): No heartbeat from core client for 30 sec - exiting
17:13:02 (4136): No heartbeat from core client for 30 sec - exiting
17:13:03 (4136): No heartbeat from core client for 30 sec - exiting
17:13:04 (4136): No heartbeat from core client for 30 sec - exiting
17:13:05 (4136): No heartbeat from core client for 30 sec - exiting
17:13:06 (4136): No heartbeat from core client for 30 sec - exiting
17:13:07 (4136): No heartbeat from core client for 30 sec - exiting
17:13:08 (4136): No heartbeat from core client for 30 sec - exiting
17:13:09 (4136): No heartbeat from core client for 30 sec - exiting
17:13:10 (4136): No heartbeat from core client for 30 sec - exiting
17:13:12 (4136): No heartbeat from core client for 30 sec - exiting
17:13:13 (4136): No heartbeat from core client for 30 sec - exiting
17:13:14 (4136): No heartbeat from core client for 30 sec - exiting
17:13:15 (4136): No heartbeat from core client for 30 sec - exiting
17:13:16 (4136): No heartbeat from core client for 30 sec - exiting
17:13:17 (4136): No heartbeat from core client for 30 sec - exiting
17:13:18 (4136): No heartbeat from core client for 30 sec - exiting
17:13:19 (4136): No heartbeat from core client for 30 sec - exiting
17:13:20 (4136): No heartbeat from core client for 30 sec - exiting
17:13:21 (4136): No heartbeat from core client for 30 sec - exiting
17:13:22 (4136): No heartbeat from core client for 30 sec - exiting
17:23:27 (4768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:24:42 (3324): No heartbeat from core client for 30 sec - exiting
17:24:43 (3324): No heartbeat from core client for 30 sec - exiting
17:24:44 (3324): No heartbeat from core client for 30 sec - exiting
17:24:45 (3324): No heartbeat from core client for 30 sec - exiting
17:24:46 (3324): No heartbeat from core client for 30 sec - exiting
17:24:47 (3324): No heartbeat from core client for 30 sec - exiting
17:24:49 (3324): No heartbeat from core client for 30 sec - exiting
17:24:50 (3324): No heartbeat from core client for 30 sec - exiting
17:24:51 (3324): No heartbeat from core client for 30 sec - exiting
17:24:52 (3324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
17:32:28 (5380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:32:29 (5380): No heartbeat from core client for 30 sec - exiting
17:32:30 (5380): No heartbeat from core client for 30 sec - exiting
17:32:31 (5380): No heartbeat from core client for 30 sec - exiting
17:32:32 (5380): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Oct 2011 12:59:28 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 518,400 1,587,967 3.0632
10 Oct 2011 14:16:07 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 492,480 1,507,973 3.0620
09 Oct 2011 15:58:23 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 466,560 1,428,276 3.0613
08 Oct 2011 17:52:12 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 440,640 1,349,073 3.0616
07 Oct 2011 19:49:08 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 414,720 1,269,972 3.0622
06 Oct 2011 22:03:38 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 388,800 1,190,963 3.0632
05 Oct 2011 22:48:46 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 362,880 1,111,810 3.0639
04 Oct 2011 22:54:49 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 336,960 1,032,157 3.0631
03 Oct 2011 23:53:20 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 311,040 952,290 3.0616
03 Oct 2011 02:05:48 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 285,120 873,101 3.0622
02 Oct 2011 02:00:24 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 259,200 794,021 3.0634
01 Oct 2011 03:53:53 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 233,280 715,536 3.0673
30 Sep 2011 05:48:29 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 207,360 636,795 3.0710
29 Sep 2011 07:06:46 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 181,440 555,339 3.0607
28 Sep 2011 07:49:37 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 155,520 475,885 3.0600
27 Sep 2011 09:43:43 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 129,600 396,449 3.0590
26 Sep 2011 11:41:36 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 103,680 317,385 3.0612
25 Sep 2011 13:18:27 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 77,760 237,503 3.0543
24 Sep 2011 15:01:04 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 51,840 158,785 3.0630
23 Sep 2011 16:47:11 1122757 13406585 hadcm3n_u2j3_1980_40_007458352_0 25,920 79,307 3.0597


©2024 cpdn.org