climateprediction.net home page
Task 16162213

Task 16162213

Name hadcm3n_og0a_1900_40_008475869_1
Workunit 8626708
Created 27 Dec 2013, 18:49:26 UTC
Sent 27 Dec 2013, 18:49:40 UTC
Report deadline 29 Mar 2014, 2:16:51 UTC
Received 2 Feb 2014, 15:29:30 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1184169
Run time 6 days 18 hours 2 min 44 sec
CPU time 6 days 2 hours 2 min 17 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 3.67 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code 193 (0xc1)
</message>
<stderr_txt>
21:27:14 (6268): No heartbeat from core client for 30 sec - exiting
21:27:15 (6268): No heartbeat from core client for 30 sec - exiting
21:27:16 (6268): No heartbeat from core client for 30 sec - exiting
21:27:17 (6268): No heartbeat from core client for 30 sec - exiting
21:27:18 (6268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:32:49 (2676): No heartbeat from core client for 30 sec - exiting
15:32:50 (2676): No heartbeat from core client for 30 sec - exiting
15:32:51 (2676): No heartbeat from core client for 30 sec - exiting
15:32:52 (2676): No heartbeat from core client for 30 sec - exiting
15:32:53 (2676): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:14:59 (9008): No heartbeat from core client for 30 sec - exiting
16:15:00 (9008): No heartbeat from core client for 30 sec - exiting
16:15:01 (9008): No heartbeat from core client for 30 sec - exiting
16:15:02 (9008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
11:56:17 (7720): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8708, iMonCtr=1
Model crash detected, will try to restart...
15:07:09 (3980): No heartbeat from core client for 30 sec - exiting
15:07:10 (3980): No heartbeat from core client for 30 sec - exiting
15:07:11 (3980): No heartbeat from core client for 30 sec - exiting
15:07:12 (3980): No heartbeat from core client for 30 sec - exiting
15:07:13 (3980): No heartbeat from core client for 30 sec - exiting
15:07:14 (3980): No heartbeat from core client for 30 sec - exiting
15:07:15 (3980): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
21:11:19 (9088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:40:09 (7880): No heartbeat from core client for 30 sec - exiting
20:40:10 (7880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
10:20:17 (8092): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:12:07 (7580): No heartbeat from core client for 30 sec - exiting
21:12:08 (7580): No heartbeat from core client for 30 sec - exiting
21:12:09 (7580): No heartbeat from core client for 30 sec - exiting
21:12:10 (7580): No heartbeat from core client for 30 sec - exiting
21:12:11 (7580): No heartbeat from core client for 30 sec - exiting
21:12:12 (7580): No heartbeat from core client for 30 sec - exiting
21:12:13 (7580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:29:14 (8000): No heartbeat from core client for 30 sec - exiting
08:29:15 (8000): No heartbeat from core client for 30 sec - exiting
08:29:16 (8000): No heartbeat from core client for 30 sec - exiting
08:29:17 (8000): No heartbeat from core client for 30 sec - exiting
08:29:18 (8000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:02:59 (5340): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
20:24:59 (7880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/og0ako.pjb1c10
Error converting file to netcdf: dataout/og0ako.pib1c10
Error converting file to netcdf: dataout/og0ako.pfb1c10
Error converting file to netcdf: dataout/og0aka.phb1c10
Error converting file to netcdf: dataout/og0aka.pgb1c10
Error converting file to netcdf: dataout/og0aka.peb1c10
Error converting file to netcdf: dataout/og0aka.pdb1c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8444, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
01:05:55 (2664): No heartbeat from core client for 30 sec - exiting
01:05:56 (2664): No heartbeat from core client for 30 sec - exiting
01:05:57 (2664): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:57:48 (3452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
15:12:32 (3320): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
21:29:24 (5780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:00:06 (8208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
20:55:00 (1192): No heartbeat from core client for 30 sec - exiting
20:55:01 (1192): No heartbeat from core client for 30 sec - exiting
20:55:02 (1192): No heartbeat from core client for 30 sec - exiting
20:55:03 (1192): No heartbeat from core client for 30 sec - exiting
20:55:04 (1192): No heartbeat from core client for 30 sec - exiting
20:55:05 (1192): No heartbeat from core client for 30 sec - exiting
20:55:06 (1192): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
19:15:24 (7116): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/og0ako.pjb5c10
Error converting file to netcdf: dataout/og0ako.pib5c10
Error converting file to netcdf: dataout/og0ako.pfb5c10
Error converting file to netcdf: dataout/og0aka.phb5c10
Error converting file to netcdf: dataout/og0aka.pgb5c10
Error converting file to netcdf: dataout/og0aka.peb5c10
Error converting file to netcdf: dataout/og0aka.pdb5c10
CPDN Monitor - Quit request from BOINC...
19:01:01 (13108): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
13:15:39 (8028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
18:50:46 (9412): No heartbeat from core client for 30 sec - exiting
18:50:47 (9412): No heartbeat from core client for 30 sec - exiting
18:50:48 (9412): No heartbeat from core client for 30 sec - exiting
18:50:49 (9412): No heartbeat from core client for 30 sec - exiting
18:50:50 (9412): No heartbeat from core client for 30 sec - exiting
18:50:51 (9412): No heartbeat from core client for 30 sec - exiting
18:50:52 (9412): No heartbeat from core client for 30 sec - exiting
18:50:53 (9412): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
23:06:36 (8916): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:18:10 (5744): No heartbeat from core client for 30 sec - exiting
19:18:11 (5744): No heartbeat from core client for 30 sec - exiting
19:18:12 (5744): No heartbeat from core client for 30 sec - exiting
19:18:13 (5744): No heartbeat from core client for 30 sec - exiting
19:18:14 (5744): No heartbeat from core client for 30 sec - exiting
19:18:15 (5744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6824, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
23:20:26 (5644): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:21:04 (11008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:43:15 (1972): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
11:39:03 (3056): No heartbeat from core client for 30 sec - exiting
11:39:04 (3056): No heartbeat from core client for 30 sec - exiting
11:39:05 (3056): No heartbeat from core client for 30 sec - exiting
11:39:06 (3056): No heartbeat from core client for 30 sec - exiting
11:39:07 (3056): No heartbeat from core client for 30 sec - exiting
11:39:08 (3056): No heartbeat from core client for 30 sec - exiting
11:39:09 (3056): No heartbeat from core client for 30 sec - exiting
11:39:10 (3056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 Feb 2014 21:06:20 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 518,400 525,733 1.0141
31 Jan 2014 20:59:32 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 492,480 498,986 1.0132
27 Jan 2014 21:59:05 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 466,560 472,779 1.0133
25 Jan 2014 16:47:52 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 440,640 446,621 1.0136
25 Jan 2014 02:00:56 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 414,720 420,004 1.0127
24 Jan 2014 18:03:54 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 388,800 393,424 1.0119
20 Jan 2014 22:12:48 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 362,880 366,686 1.0105
19 Jan 2014 10:05:26 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 336,960 340,084 1.0093
18 Jan 2014 12:58:20 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 311,040 313,664 1.0084
07 Jan 2014 19:31:36 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 285,120 287,787 1.0094
06 Jan 2014 13:26:34 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 259,200 261,328 1.0082
05 Jan 2014 17:33:32 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 233,280 234,988 1.0073
05 Jan 2014 09:35:09 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 207,360 208,837 1.0071
04 Jan 2014 17:30:46 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 181,440 183,513 1.0114
04 Jan 2014 10:15:26 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 155,520 158,091 1.0165
01 Jan 2014 19:31:45 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 129,600 131,915 1.0179
01 Jan 2014 01:02:28 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 103,680 105,809 1.0205
31 Dec 2013 17:19:42 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 77,760 79,511 1.0225
30 Dec 2013 17:10:19 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 51,840 53,299 1.0281
28 Dec 2013 22:55:30 1184169 16162213 hadcm3n_og0a_1900_40_008475869_1 25,920 26,863 1.0364


©2024 cpdn.org