climateprediction.net home page
Task 12821012

Task 12821012

Name hadcm3n_p2a9_1900_40_007220545_0
Workunit 7418785
Created 26 Apr 2011, 15:20:32 UTC
Sent 2 May 2011, 19:51:14 UTC
Report deadline 2 Aug 2011, 3:18:25 UTC
Received 24 May 2011, 15:42:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 841115
Run time 4 days 14 hours 52 min 10 sec
CPU time 4 days 18 hours 12 min 35 sec
Validate state Invalid
Credit 1,244.16
Device peak FLOPS 1.33 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/p2a9ko.pja2c10
Error converting file to netcdf: dataout/p2a9ko.pia2c10
Error converting file to netcdf: dataout/p2a9ko.pfa2c10
Error converting file to netcdf: dataout/p2a9ka.pha2c10
Error converting file to netcdf: dataout/p2a9ka.pga2c10
Error converting file to netcdf: dataout/p2a9ka.pea2c10
Error converting file to netcdf: dataout/p2a9ka.pda2c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:03:55 (3980): No heartbeat from core client for 30 sec - exiting
12:03:56 (3980): No heartbeat from core client for 30 sec - exiting
12:03:57 (3980): No heartbeat from core client for 30 sec - exiting
12:03:58 (3980): No heartbeat from core client for 30 sec - exiting
12:03:59 (3980): No heartbeat from core client for 30 sec - exiting
12:04:00 (3980): No heartbeat from core client for 30 sec - exiting
12:04:01 (3980): No heartbeat from core client for 30 sec - exiting
12:04:02 (3980): No heartbeat from core client for 30 sec - exiting
12:04:03 (3980): No heartbeat from core client for 30 sec - exiting
12:04:04 (3980): No heartbeat from core client for 30 sec - exiting
12:04:06 (3980): No heartbeat from core client for 30 sec - exiting
12:04:07 (3980): No heartbeat from core client for 30 sec - exiting
12:04:08 (3980): No heartbeat from core client for 30 sec - exiting
12:04:09 (3980): No heartbeat from core client for 30 sec - exiting
12:04:10 (3980): No heartbeat from core client for 30 sec - exiting
12:04:11 (3980): No heartbeat from core client for 30 sec - exiting
12:04:12 (3980): No heartbeat from core client for 30 sec - exiting
12:04:13 (3980): No heartbeat from core client for 30 sec - exiting
12:04:14 (3980): No heartbeat from core client for 30 sec - exiting
12:04:15 (3980): No heartbeat from core client for 30 sec - exiting
12:04:17 (3980): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
19:58:14 (2996): No heartbeat from core client for 30 sec - exiting
19:58:15 (2996): No heartbeat from core client for 30 sec - exiting
19:58:16 (2996): No heartbeat from core client for 30 sec - exiting
19:58:18 (2996): No heartbeat from core client for 30 sec - exiting
19:58:19 (2996): No heartbeat from core client for 30 sec - exiting
19:58:20 (2996): No heartbeat from core client for 30 sec - exiting
19:58:21 (2996): No heartbeat from core client for 30 sec - exiting
19:58:22 (2996): No heartbeat from core client for 30 sec - exiting
19:58:23 (2996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
23:02:32 (3240): No heartbeat from core client for 30 sec - exiting
23:02:33 (3240): No heartbeat from core client for 30 sec - exiting
23:02:34 (3240): No heartbeat from core client for 30 sec - exiting
23:02:35 (3240): No heartbeat from core client for 30 sec - exiting
23:02:36 (3240): No heartbeat from core client for 30 sec - exiting
23:02:38 (3240): No heartbeat from core client for 30 sec - exiting
23:02:39 (3240): No heartbeat from core client for 30 sec - exiting
23:02:40 (3240): No heartbeat from core client for 30 sec - exiting
23:02:41 (3240): No heartbeat from core client for 30 sec - exiting
23:02:42 (3240): No heartbeat from core client for 30 sec - exiting
23:02:43 (3240): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3468, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3468, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3468, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3468, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3468, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2252, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 May 2011 11:11:07 841115 12821012 hadcm3n_p2a9_1900_40_007220545_0 103,680 337,021 3.2506
15 May 2011 08:27:06 841115 12821012 hadcm3n_p2a9_1900_40_007220545_0 77,760 253,872 3.2648
11 May 2011 18:29:27 841115 12821012 hadcm3n_p2a9_1900_40_007220545_0 51,840 169,729 3.2741
07 May 2011 18:22:25 841115 12821012 hadcm3n_p2a9_1900_40_007220545_0 25,920 84,622 3.2647


©2024 cpdn.org