climateprediction.net home page
Task 15639132

Task 15639132

Name hadcm3n_o72l_2140_40_008268953_2
Workunit 8424077
Created 25 Feb 2013, 7:27:03 UTC
Sent 25 Feb 2013, 7:27:17 UTC
Report deadline 27 May 2013, 14:54:28 UTC
Received 27 Mar 2013, 22:08:31 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1061327
Run time 13 days 9 hours 8 min 41 sec
CPU time 7 days 17 hours 18 min 15 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.65 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.60</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8336, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:29:24 (43184): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:00:15 (46424): No heartbeat from core client for 30 sec - exiting
20:00:16 (46424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:00:17 (46424): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:43:41 (68232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	04:21:04 AM	No files match the supplied pattern.
MainError:	04:21:04 AM	No files match the supplied pattern.
MainError:	01:15:03 PM	No files match the supplied pattern.
MainError:	01:15:03 PM	No files match the supplied pattern.
MainError:	10:06:02 PM	No files match the supplied pattern.
MainError:	10:06:02 PM	No files match the supplied pattern.
MainError:	06:59:17 AM	No files match the supplied pattern.
MainError:	06:59:17 AM	No files match the supplied pattern.
MainError:	03:50:58 PM	No files match the supplied pattern.
MainError:	03:50:58 PM	No files match the supplied pattern.
MainError:	12:42:23 AM	No files match the supplied pattern.
MainError:	12:42:23 AM	No files match the supplied pattern.
MainError:	09:36:55 AM	No files match the supplied pattern.
MainError:	09:36:55 AM	No files match the supplied pattern.
MainError:	06:28:49 PM	No files match the supplied pattern.
MainError:	06:28:49 PM	No files match the supplied pattern.
MainError:	03:20:35 AM	No files match the supplied pattern.
MainError:	03:20:35 AM	No files match the supplied pattern.
MainError:	12:14:43 AM	No files match the supplied pattern.
MainError:	12:14:43 AM	No files match the supplied pattern.
Error converting file to netcdf: dataout/o72lka.ph11c10
Error converting file to netcdf: dataout/o72lka.pg11c10
Error converting file to netcdf: dataout/o72lka.pe11c10
MainError:	09:08:03 PM	No files match the supplied pattern.
MainError:	09:08:03 PM	No files match the supplied pattern.
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16

Model crashed: STWORK  : I/O error - PP fixed length header                                                                                                                                                                                                                    tmp/pipe_dummy                                                                  2048    
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Mar 2013 21:12:32 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 777,600 1,145,935 1.4737
27 Mar 2013 12:18:43 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 751,680 1,113,953 1.4820
27 Mar 2013 03:25:24 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 725,760 1,081,948 1.4908
26 Mar 2013 18:38:09 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 699,840 1,050,055 1.5004
26 Mar 2013 09:41:28 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 673,920 1,018,158 1.5108
26 Mar 2013 00:42:59 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 648,000 986,131 1.5218
25 Mar 2013 15:51:23 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 622,080 954,261 1.5340
25 Mar 2013 07:01:49 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 596,160 922,378 1.5472
24 Mar 2013 22:09:11 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 570,240 890,420 1.5615
24 Mar 2013 13:18:27 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 544,320 858,578 1.5773
24 Mar 2013 04:25:42 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 518,400 826,583 1.5945
23 Mar 2013 19:39:58 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 492,480 794,727 1.6137
23 Mar 2013 10:39:02 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 466,560 762,870 1.6351
23 Mar 2013 01:45:13 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 440,640 730,873 1.6587
22 Mar 2013 16:55:50 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 414,720 698,962 1.6854
22 Mar 2013 08:03:43 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 388,800 667,085 1.7158
18 Mar 2013 07:28:20 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 362,880 635,115 1.7502
17 Mar 2013 22:35:41 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 336,960 603,100 1.7898
17 Mar 2013 13:45:16 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 311,040 571,283 1.8367
17 Mar 2013 04:48:11 1061327 15639132 hadcm3n_o72l_2140_40_008268953_2 285,120 539,318 1.8915


©2024 climateprediction.net