climateprediction.net home page
Task 13568059

Task 13568059

Name hadcm3n_ydsk_1900_40_007525012_2
Workunit 7722487
Created 30 Oct 2011, 12:48:17 UTC
Sent 30 Oct 2011, 12:56:16 UTC
Report deadline 29 Jan 2012, 20:23:27 UTC
Received 8 Nov 2011, 15:58:20 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1137201
Run time 8 days 11 hours 42 min 40 sec
CPU time 7 days 22 hours 14 min 35 sec
Validate state Invalid
Credit 4,043.52
Device peak FLOPS 2.78 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:20:07 (3708): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Ocean Restart file copy failed on ydskko.dab0c20
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=284, iMonCtr=1
Model crash detected, will try to restart...
15:44:48 (284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Nov 2011 14:22:20 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 336,960 684,091 2.0302
07 Nov 2011 21:07:49 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 311,040 630,930 2.0285
07 Nov 2011 03:26:44 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 285,120 578,832 2.0301
06 Nov 2011 12:13:52 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 259,200 526,892 2.0328
05 Nov 2011 20:56:40 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 233,280 474,858 2.0356
05 Nov 2011 05:28:58 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 207,360 423,176 2.0408
04 Nov 2011 10:40:19 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 181,440 370,177 2.0402
03 Nov 2011 18:57:37 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 155,520 318,579 2.0485
03 Nov 2011 02:00:43 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 129,600 265,328 2.0473
02 Nov 2011 10:26:38 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 103,680 213,037 2.0548
01 Nov 2011 17:49:37 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 77,760 161,352 2.0750
31 Oct 2011 23:15:31 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 51,840 107,712 2.0778
31 Oct 2011 19:33:50 1137201 13568059 hadcm3n_ydsk_1900_40_007525012_2 25,920 52,413 2.0221


©2024 climateprediction.net