climateprediction.net home page
Task 13103914

Task 13103914

Name hadcm3n_yd8t_1900_40_007350023_1
Workunit 7547453
Created 6 Jul 2011, 14:02:58 UTC
Sent 17 Jul 2011, 6:34:52 UTC
Report deadline 16 Oct 2011, 14:02:03 UTC
Received 26 Aug 2011, 7:50:38 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1311971
Run time 7 days 16 hours 40 min 17 sec
CPU time 7 days 15 hours 43 min 36 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 2.54 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
22:59:44 (1360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITDUMP: BAD BUFFOUT OF DATA                                                                                                                                                                                                                                   tmp/pipe_dummy                                                                  2048    
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITHEAD: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
23:06:08 (4724): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
BUFFOUT: C I/O Error - Return code = 32

Model crashed: WRITHEAD: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
forrtl: There is not enough space on the disk.

23:17:06 (3096): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
forrtl: There is not enough space on the disk.

23:22:40 (3632): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3172, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4144, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4144, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4144, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4144, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4144, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Aug 2011 00:36:44 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 388,800 636,388 1.6368
25 Aug 2011 12:48:59 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 362,880 593,814 1.6364
25 Aug 2011 01:09:54 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 336,960 552,016 1.6382
24 Aug 2011 13:36:32 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 311,040 510,742 1.6420
24 Aug 2011 02:07:25 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 285,120 469,457 1.6465
23 Aug 2011 14:38:43 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 259,200 428,186 1.6520
23 Aug 2011 03:12:54 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 233,280 386,930 1.6587
22 Aug 2011 15:41:33 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 207,360 345,777 1.6675
22 Aug 2011 03:22:10 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 181,440 303,799 1.6744
21 Aug 2011 15:48:06 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 155,520 262,236 1.6862
03 Aug 2011 02:33:33 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 129,600 218,772 1.6881
02 Aug 2011 14:22:28 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 103,680 174,982 1.6877
25 Jul 2011 17:36:37 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 77,760 131,262 1.6880
25 Jul 2011 17:15:54 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 51,840 87,586 1.6895
25 Jul 2011 16:26:35 1070959 13103914 hadcm3n_yd8t_1900_40_007350023_1 25,920 43,884 1.6931


©2024 cpdn.org