climateprediction.net home page
Task 13624641

Task 13624641

Name hadcm3n_ylc8_1940_40_007538576_2
Workunit 7735808
Created 9 Nov 2011, 21:22:28 UTC
Sent 9 Nov 2011, 21:27:43 UTC
Report deadline 9 Feb 2012, 4:54:54 UTC
Received 17 Nov 2011, 22:56:43 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1176884
Run time 1 days 3 hours 23 min 58 sec
CPU time 20 hours 41 min 14 sec
Validate state Invalid
Credit 622.08
Device peak FLOPS 4.12 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:35:47 (5380): No heartbeat from core client for 30 sec - exiting
07:35:48 (5380): No heartbeat from core client for 30 sec - exiting
07:35:49 (5380): No heartbeat from core client for 30 sec - exiting
07:35:50 (5380): No heartbeat from core client for 30 sec - exiting
07:35:51 (5380): No heartbeat from core client for 30 sec - exiting
07:35:52 (5380): No heartbeat from core client for 30 sec - exiting
07:35:53 (5380): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:30:20 (6064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:31:45 (2488): Can't acquire lockfile (32) - waiting 35s
16:31:58 (8060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:41:57 (6080): No heartbeat from core client for 30 sec - exiting
19:41:58 (6080): No heartbeat from core client for 30 sec - exiting
19:41:59 (6080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:59:11 (7104): No heartbeat from core client for 30 sec - exiting
00:59:12 (7104): No heartbeat from core client for 30 sec - exiting
00:59:13 (7104): No heartbeat from core client for 30 sec - exiting
00:59:14 (7104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:45:00 (6780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:12:39 (1964): No heartbeat from core client for 30 sec - exiting
02:12:40 (1964): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6132, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6132, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6132, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7704, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7704, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7704, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
15 Nov 2011 22:29:55 1176884 13624641 hadcm3n_ylc8_1940_40_007538576_2 51,840 52,897 1.0204
15 Nov 2011 22:29:55 1176884 13624641 hadcm3n_ylc8_1940_40_007538576_2 25,920 26,856 1.0361


©2024 cpdn.org