climateprediction.net home page
Task 14105993

Task 14105993

Name hadcm3n_y99z_1900_40_007521708_4
Workunit 7719183
Created 17 Feb 2012, 19:30:09 UTC
Sent 17 Feb 2012, 19:30:13 UTC
Report deadline 19 May 2012, 2:57:24 UTC
Received 29 Mar 2012, 19:31:46 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 725427
Run time 13 days 8 hours 50 min 42 sec
CPU time 9 days 18 hours 10 min 54 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 2.18 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.6.36</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5116, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4704, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3592, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4344, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5624, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7776, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=908, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3404, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3404, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5616, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5024, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9432, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5992, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5388, iMonCtr=1
Model crash detected, will try to restart...
15:53:17 (2768): No heartbeat from core client for 30 sec - exiting
15:53:18 (2768): No heartbeat from core client for 30 sec - exiting
15:53:19 (2768): No heartbeat from core client for 30 sec - exiting
15:53:20 (2768): No heartbeat from core client for 30 sec - exiting
15:53:21 (2768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5380, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting,CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5572, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4600, iMonCtr=1
Model crash detected, will try to restart...
16:13:50 (4532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:13:51 (4532): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6492, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6596, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4164, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4396, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4396, iMonCtr=1
Model crash detected, will try to restart...
16:00:23 (6452): No heartbeat from core client for 30 sec - exiting
16:00:24 (6452): No heartbeat from core client for 30 sec - exiting
16:00:25 (6452): No heartbeat from core client for 30 sec - exiting
16:00:26 (6452): No heartbeat from core client for 30 sec - exiting
16:00:27 (6452): No heartbeat from core client for 30 sec - exiting
16:00:28 (6452): No heartbeat from core client for 30 sec - exiting
16:00:29 (6452): No heartbeat from core client for 30 sec - exiting
16:00:30 (6452): No heartbeat from core client for 30 sec - exiting
16:00:31 (6452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6940, iMonCtr=1
Model crash detected, will try to restart...
07:56:06 (5388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:00:24 (4916): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5276, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3108, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3864, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5208, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5984, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5236, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5236, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/y99zko.pja8c10
Error converting file to netcdf: dataout/y99zko.pia8c10
Error converting file to netcdf: dataout/y99zko.pfa8c10
Error converting file to netcdf: dataout/y99zka.pha8c10
Error converting file to netcdf: dataout/y99zka.pga8c10
Error converting file to netcdf: dataout/y99zka.pea8c10
Error converting file to netcdf: dataout/y99zka.pda8c10
16:33:17 (1488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5868, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3776, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
16:27:12 (5408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:52:59 (6116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2500, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Mar 2012 19:34:21 725427 14105993 hadcm3n_y99z_1900_40_007521708_4 259,200 843,035 3.2524
25 Mar 2012 16:27:33 725427 14105993 hadcm3n_y99z_1900_40_007521708_4 233,280 759,445 3.2555
23 Mar 2012 16:34:35 725427 14105993 hadcm3n_y99z_1900_40_007521708_4 207,360 671,660 3.2391
19 Mar 2012 17:01:29 725427 14105993 hadcm3n_y99z_1900_40_007521708_4 181,440 584,526 3.2216
17 Mar 2012 08:54:18 725427 14105993 hadcm3n_y99z_1900_40_007521708_4 155,520 502,801 3.2330
12 Mar 2012 20:23:23 725427 14105993 hadcm3n_y99z_1900_40_007521708_4 129,600 422,109 3.2570
09 Mar 2012 21:38:29 725427 14105993 hadcm3n_y99z_1900_40_007521708_4 103,680 335,072 3.2318
07 Mar 2012 17:58:52 725427 14105993 hadcm3n_y99z_1900_40_007521708_4 77,760 252,517 3.2474
02 Mar 2012 22:12:10 725427 14105993 hadcm3n_y99z_1900_40_007521708_4 51,840 166,939 3.2203
24 Feb 2012 18:22:31 725427 14105993 hadcm3n_y99z_1900_40_007521708_4 25,920 82,929 3.1994


©2024 climateprediction.net