climateprediction.net (CPDN) home page
Task 15797282

Task 15797282

Name hadcm3n_o3cn_2140_40_008269493_4
Workunit 8424617
Created 26 May 2013, 19:36:50 UTC
Sent 26 May 2013, 19:37:03 UTC
Report deadline 26 Aug 2013, 3:04:14 UTC
Received 25 Jun 2013, 0:10:02 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1272463
Run time 26 days 9 hours 48 min 36 sec
CPU time 24 days 6 hours 8 min 49 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.13 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
09:09:46 (3292): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:16:31 (5468): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:34:33 (5920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:28:16 (4568): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:27:16 (19080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:26:16 (21492): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
12:13:52 (5408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:12:49 (3400): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
13:48:30 (5956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:47:28 (9956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	06:48:00 AM	No files match the supplied pattern.
MainError:	06:48:01 AM	No files match the supplied pattern.
06:30:16 (4524): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	03:30:19 AM	No files match the supplied pattern.
MainError:	03:30:20 AM	No files match the supplied pattern.
04:29:13 (5928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	01:47:21 AM	No files match the supplied pattern.
MainError:	01:47:21 AM	No files match the supplied pattern.
22:28:10 (5936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	10:46:21 PM	No files match the supplied pattern.
MainError:	10:46:22 PM	No files match the supplied pattern.
04:27:08 (5512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	07:50:02 PM	No files match the supplied pattern.
MainError:	07:50:02 PM	No files match the supplied pattern.
09:26:09 (4836): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	04:36:02 PM	No files match the supplied pattern.
MainError:	04:36:03 PM	No files match the supplied pattern.
MainError:	02:06:08 PM	No files match the supplied pattern.
MainError:	02:06:08 PM	No files match the supplied pattern.
10:21:41 (8140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	10:47:43 AM	No files match the supplied pattern.
MainError:	10:47:43 AM	No files match the supplied pattern.
MainError:	08:45:37 AM	No files match the supplied pattern.
MainError:	08:45:38 AM	No files match the supplied pattern.
05:23:04 (4248): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	06:28:24 AM	No files match the supplied pattern.
MainError:	06:28:25 AM	No files match the supplied pattern.
05:22:17 (5876): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Error converting file to netcdf: dataout/o3cnka.ph11c10
Error converting file to netcdf: dataout/o3cnka.pg11c10
Error converting file to netcdf: dataout/o3cnka.pe11c10
MainError:	04:54:07 AM	No files match the supplied pattern.
MainError:	04:54:07 AM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Jun 2013 04:58:12 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 777,600 2,130,541 2.7399
23 Jun 2013 06:30:47 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 751,680 2,056,793 2.7363
22 Jun 2013 08:46:26 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 725,760 1,984,524 2.7344
21 Jun 2013 10:49:18 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 699,840 1,913,088 2.7336
20 Jun 2013 14:05:15 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 673,920 1,842,722 2.7343
19 Jun 2013 16:35:42 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 648,000 1,770,077 2.7316
18 Jun 2013 19:52:58 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 622,080 1,699,399 2.7318
17 Jun 2013 22:48:35 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 596,160 1,627,940 2.7307
17 Jun 2013 01:48:06 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 570,240 1,557,470 2.7313
16 Jun 2013 03:33:33 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 544,320 1,484,333 2.7269
15 Jun 2013 06:48:56 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 518,400 1,413,532 2.7267
14 Jun 2013 10:13:13 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 492,480 1,343,313 2.7276
13 Jun 2013 13:35:43 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 466,560 1,273,119 2.7287
12 Jun 2013 08:15:47 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 440,640 1,202,017 2.7279
11 Jun 2013 10:54:09 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 414,720 1,130,768 2.7266
10 Jun 2013 14:20:01 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 388,800 1,060,233 2.7269
09 Jun 2013 06:38:52 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 362,880 988,368 2.7237
08 Jun 2013 09:18:53 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 336,960 916,965 2.7213
07 Jun 2013 12:27:04 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 311,040 846,156 2.7204
06 Jun 2013 15:42:28 1272463 15797282 hadcm3n_o3cn_2140_40_008269493_4 285,120 775,784 2.7209


©2024 cpdn.org