climateprediction.net home page
Task 16077501

Task 16077501

Name hadcm3n_o4uh_1980_40_008386541_3
Workunit 8537400
Created 3 Nov 2013, 21:23:26 UTC
Sent 3 Nov 2013, 21:23:28 UTC
Report deadline 3 Feb 2014, 4:50:39 UTC
Received 31 Jan 2014, 16:11:21 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1260844
Run time 25 days 0 hours 48 min 51 sec
CPU time 24 days 0 hours 8 min 25 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 1.37 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.11</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code 193 (0xc1)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2164, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1240, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:53:54 (3892): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4004, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3344, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4636, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:06:43 (3488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:26:33 (2228): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1052, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2228, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4028, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3788, iMonCtr=1
Model crash detected, will try to restart...
16:25:25 (3340): No heartbeat from core client for 30 sec - exiting
16:25:26 (3340): No heartbeat from core client for 30 sec - exiting
16:25:27 (3340): No heartbeat from core client for 30 sec - exiting
16:25:28 (3340): No heartbeat from core client for 30 sec - exiting
16:25:29 (3340): No heartbeat from core client for 30 sec - exiting
16:25:30 (3340): No heartbeat from core client for 30 sec - exiting
16:25:31 (3340): No heartbeat from core client for 30 sec - exiting
16:25:32 (3340): No heartbeat from core client for 30 sec - exiting
16:25:33 (3340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4588, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3152, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3380, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3056, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
17:30:08 (3068): No heartbeat from core client for 30 sec - exiting
17:30:09 (3068): No heartbeat from core client for 30 sec - exiting
17:30:10 (3068): No heartbeat from core client for 30 sec - exiting
17:30:11 (3068): No heartbeat from core client for 30 sec - exiting
17:30:12 (3068): No heartbeat from core client for 30 sec - exiting
17:30:13 (3068): No heartbeat from core client for 30 sec - exiting
17:30:14 (3068): No heartbeat from core client for 30 sec - exiting
17:30:15 (3068): No heartbeat from core client for 30 sec - exiting
17:30:16 (3068): No heartbeat from core client for 30 sec - exiting
17:30:17 (3068): No heartbeat from core client for 30 sec - exiting
17:30:18 (3068): No heartbeat from core client for 30 sec - exiting
17:30:19 (3068): No heartbeat from core client for 30 sec - exiting
17:30:20 (3068): No heartbeat from core client for 30 sec - exiting
17:30:21 (3068): No heartbeat from core client for 30 sec - exiting
17:30:22 (3068): No heartbeat from core client for 30 sec - exiting
17:30:23 (3068): No heartbeat from core client for 30 sec - exiting
17:30:24 (3068): No heartbeat from core client for 30 sec - exiting
17:30:25 (3068): No heartbeat from core client for 30 sec - exiting
17:30:26 (3068): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:34:36 (4036): No heartbeat from core client for 30 sec - exiting
15:34:37 (4036): No heartbeat from core client for 30 sec - exiting
15:34:38 (4036): No heartbeat from core client for 30 sec - exiting
15:34:39 (4036): No heartbeat from core client for 30 sec - exiting
15:34:40 (4036): No heartbeat from core client for 30 sec - exiting
15:34:41 (4036): No heartbeat from core client for 30 sec - exiting
15:34:42 (4036): No heartbeat from core client for 30 sec - exiting
15:34:43 (4036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:16:21 (3488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:54:14 (2120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:25:48 (3600): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:57:10 (1284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:13:33 (1244): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 Jan 2014 16:12:00 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 518,400 2,074,090 4.0009
25 Jan 2014 15:03:34 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 492,480 1,962,662 3.9853
20 Jan 2014 20:22:18 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 466,560 1,856,016 3.9781
16 Jan 2014 17:12:14 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 440,640 1,749,173 3.9696
11 Jan 2014 21:15:42 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 414,720 1,639,274 3.9527
07 Jan 2014 17:45:30 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 388,800 1,534,432 3.9466
03 Jan 2014 14:32:21 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 362,880 1,423,257 3.9221
30 Dec 2013 14:53:45 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 336,960 1,310,736 3.8899
25 Dec 2013 21:20:59 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 311,040 1,204,698 3.8731
22 Dec 2013 20:17:49 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 285,120 1,099,821 3.8574
18 Dec 2013 21:37:07 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 259,200 995,368 3.8402
14 Dec 2013 19:52:34 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 233,280 889,158 3.8115
11 Dec 2013 20:08:38 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 207,360 781,455 3.7686
07 Dec 2013 18:22:44 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 181,440 685,754 3.7795
02 Dec 2013 18:35:03 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 155,520 585,563 3.7652
25 Nov 2013 21:22:36 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 129,600 486,197 3.7515
21 Nov 2013 16:59:04 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 103,680 387,278 3.7353
16 Nov 2013 18:45:42 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 77,760 289,741 3.7261
13 Nov 2013 15:17:56 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 51,840 194,848 3.7586
09 Nov 2013 12:32:08 1260844 16077501 hadcm3n_o4uh_1980_40_008386541_3 25,920 100,770 3.8877


©2024 climateprediction.net