climateprediction.net home page
Task 15917906

Task 15917906

Name hadcm3n_49i1_1980_40_008324300_4
Workunit 8475435
Created 14 Aug 2013, 15:50:17 UTC
Sent 14 Aug 2013, 19:42:15 UTC
Report deadline 14 Nov 2013, 3:09:26 UTC
Received 12 Sep 2013, 9:51:39 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1286177
Run time 6 days 8 hours 1 min 32 sec
CPU time 6 days 5 hours 2 min 46 sec
Validate state Invalid
Credit 2,799.36
Device peak FLOPS 2.65 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5140, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4296, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:04:43 (3236): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Sep 2013 15:16:16 1286177 15917906 hadcm3n_49i1_1980_40_008324300_4 233,280 480,978 2.0618
09 Sep 2013 02:31:40 1286177 15917906 hadcm3n_49i1_1980_40_008324300_4 207,360 423,942 2.0445
06 Sep 2013 22:46:38 1286177 15917906 hadcm3n_49i1_1980_40_008324300_4 181,440 366,733 2.0212
05 Sep 2013 16:56:15 1286177 15917906 hadcm3n_49i1_1980_40_008324300_4 155,520 310,262 1.9950
03 Sep 2013 17:59:38 1286177 15917906 hadcm3n_49i1_1980_40_008324300_4 129,600 256,667 1.9805
02 Sep 2013 18:49:27 1286177 15917906 hadcm3n_49i1_1980_40_008324300_4 103,680 205,947 1.9864
29 Aug 2013 12:26:33 1286177 15917906 hadcm3n_49i1_1980_40_008324300_4 77,760 153,892 1.9791
26 Aug 2013 20:01:45 1286177 15917906 hadcm3n_49i1_1980_40_008324300_4 51,840 101,821 1.9641
20 Aug 2013 16:45:33 1286177 15917906 hadcm3n_49i1_1980_40_008324300_4 25,920 51,338 1.9806


©2024 cpdn.org