climateprediction.net home page
Task 13663222

Task 13663222

Name hadcm3n_o7jq_1980_40_007426166_2
Workunit 7623669
Created 26 Nov 2011, 8:43:30 UTC
Sent 26 Nov 2011, 8:50:43 UTC
Report deadline 25 Feb 2012, 16:17:54 UTC
Received 18 Mar 2012, 18:55:18 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1164195
Run time 21 days 7 hours 15 min 44 sec
CPU time 19 days 14 hours 11 min 29 sec
Validate state Invalid
Credit 12,441.60
Device peak FLOPS 2.50 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
10:04:31 (3776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2552, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2732, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2732, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2732, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2520, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:17:12 (2796): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:06:44 (4488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1436, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4848, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3764, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4520, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/o7jqko.pjj6c10
Error converting file to netcdf: dataout/o7jqko.pij6c10
Error converting file to netcdf: dataout/o7jqko.pfj6c10
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5108, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4592, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
12:51:46 (3216): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:51:47 (3216): No heartbeat from core client for 30 sec - exiting
12:51:48 (3216): No heartbeat from core client for 30 sec - exiting
12:51:49 (3216): No heartbeat from core client for 30 sec - exiting
12:51:50 (3216): No heartbeat from core client for 30 sec - exiting
12:51:51 (3216): No heartbeat from core client for 30 sec - exiting
12:51:52 (3216): No heartbeat from core client for 30 sec - exiting
12:51:53 (3216): No heartbeat from core client for 30 sec - exiting
12:51:54 (3216): No heartbeat from core client for 30 sec - exiting
12:51:55 (3216): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4300, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1108, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:20:03 (4588): No heartbeat from core client for 30 sec - exiting
08:20:04 (4588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3912, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4176, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4864, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4996, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4980, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4692, iMonCtr=1
Model crash detected, will try to restart...
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4184, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4184, iMonCtr=1
Model crash detected, will try to restart...
08:21:14 (1252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4920, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
07:53:42 (3388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:04:36 (3424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=780, iMonCtr=1
Model crash detected, will try to restart...
07:56:32 (4332): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3148, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4204, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4012, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3724, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Mar 2012 19:14:26 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 1,036,800 1,692,684 1.6326
13 Mar 2012 22:27:47 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 1,010,880 1,649,692 1.6319
13 Mar 2012 22:27:47 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 984,960 1,608,266 1.6328
09 Mar 2012 22:50:27 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 959,040 1,566,778 1.6337
09 Mar 2012 22:50:27 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 933,120 1,526,229 1.6356
09 Mar 2012 22:50:27 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 907,200 1,485,831 1.6378
09 Mar 2012 22:50:27 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 881,280 1,445,738 1.6405
26 Feb 2012 16:16:52 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 855,360 1,405,673 1.6434
22 Feb 2012 20:08:03 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 829,440 1,365,764 1.6466
21 Feb 2012 12:46:59 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 803,520 1,324,563 1.6485
21 Feb 2012 12:46:59 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 777,600 1,284,152 1.6514
21 Feb 2012 12:46:59 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 751,680 1,243,964 1.6549
21 Feb 2012 12:46:59 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 725,760 1,204,018 1.6590
09 Feb 2012 22:57:52 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 699,840 1,161,516 1.6597
06 Feb 2012 19:31:36 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 673,920 1,118,993 1.6604
06 Feb 2012 19:31:36 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 648,000 1,090,055 1.6822
30 Jan 2012 23:05:57 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 622,080 1,049,369 1.6869
30 Jan 2012 23:05:57 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 596,160 1,009,443 1.6932
30 Jan 2012 23:05:57 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 570,240 968,101 1.6977
30 Jan 2012 23:05:57 1164195 13663222 hadcm3n_o7jq_1980_40_007426166_2 544,320 922,438 1.6947


©2024 cpdn.org