climateprediction.net home page
Task 15522312

Task 15522312

Name hadcm3n_o37a_2060_40_008239521_4
Workunit 8394645
Created 3 Jan 2013, 12:53:00 UTC
Sent 3 Jan 2013, 12:53:04 UTC
Report deadline 4 Apr 2013, 20:20:15 UTC
Received 13 Jan 2013, 19:04:58 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1197348
Run time 5 days 15 hours 55 min 55 sec
CPU time 4 days 12 hours 16 min 56 sec
Validate state Invalid
Credit 5,598.72
Device peak FLOPS 3.37 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
13:04:49 (3384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:04:50 (3384): No heartbeat from core client for 30 sec - exiting
15:47:21 (10896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:47:22 (10896): No heartbeat from core client for 30 sec - exiting
15:53:42 (8704): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:22:43 (5828): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:47:10 (8240): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:47:53 (8240): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1132, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:45:21 (9684): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:45:48 (9684): No heartbeat from core client for 30 sec - exiting
13:47:48 (7516): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:48:58 (8392): No heartbeat from core client for 30 sec - exiting
13:49:01 (8392): No heartbeat from core client for 30 sec - exiting
13:49:02 (8392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6728, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3456, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3456, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3456, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3456, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3456, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Jan 2013 17:21:08 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 466,560 376,788 0.8076
12 Jan 2013 09:19:20 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 440,640 354,233 0.8039
11 Jan 2013 23:02:14 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 414,720 332,573 0.8019
11 Jan 2013 16:05:47 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 388,800 311,219 0.8005
11 Jan 2013 09:34:25 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 362,880 290,591 0.8008
09 Jan 2013 06:14:39 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 336,960 269,778 0.8006
08 Jan 2013 23:23:00 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 311,040 250,890 0.8066
08 Jan 2013 15:05:40 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 285,120 228,357 0.8009
08 Jan 2013 08:42:03 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 259,200 207,478 0.8005
07 Jan 2013 05:15:18 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 233,280 186,530 0.7996
06 Jan 2013 04:10:06 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 207,360 165,433 0.7978
05 Jan 2013 20:46:54 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 181,440 143,707 0.7920
05 Jan 2013 13:45:31 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 155,520 122,782 0.7895
05 Jan 2013 07:10:26 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 129,600 101,814 0.7856
04 Jan 2013 14:38:33 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 103,680 80,933 0.7806
04 Jan 2013 08:10:13 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 77,760 60,095 0.7728
04 Jan 2013 01:48:35 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 51,840 39,853 0.7688
03 Jan 2013 19:45:07 1197348 15522312 hadcm3n_o37a_2060_40_008239521_4 25,920 19,951 0.7697


©2024 cpdn.org