climateprediction.net home page
Task 13106490

Task 13106490

Name hadcm3n_ye8k_1900_40_007351310_1
Workunit 7548740
Created 6 Jul 2011, 14:12:40 UTC
Sent 16 Jul 2011, 10:02:39 UTC
Report deadline 15 Oct 2011, 17:29:50 UTC
Received 30 Aug 2011, 14:47:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1149100
Run time 6 days 23 hours 31 min 56 sec
CPU time 6 days 23 hours 31 min 56 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 2.42 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4804, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=248, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:13:16 (5276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2868, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4768, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5312, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5152, iMonCtr=1
Model crash detected, will try to restart...
17:42:11 (3328): No heartbeat from core client for 30 sec - exiting
17:42:12 (3328): No heartbeat from core client for 30 sec - exiting
17:42:13 (3328): No heartbeat from core client for 30 sec - exiting
17:42:14 (3328): No heartbeat from core client for 30 sec - exiting
17:42:15 (3328): No heartbeat from core client for 30 sec - exiting
17:42:16 (3328): No heartbeat from core client for 30 sec - exiting
17:42:17 (3328): No heartbeat from core client for 30 sec - exiting
17:42:18 (3328): No heartbeat from core client for 30 sec - exiting
17:42:19 (3328): No heartbeat from core client for 30 sec - exiting
17:42:20 (3328): No heartbeat from core client for 30 sec - exiting
17:42:21 (3328): No heartbeat from core client for 30 sec - exiting
17:42:22 (3328): No heartbeat from core client for 30 sec - exiting
17:42:23 (3328): No heartbeat from core client for 30 sec - exiting
17:42:24 (3328): No heartbeat from core client for 30 sec - exiting
17:42:25 (3328): No heartbeat from core client for 30 sec - exiting
17:42:26 (3328): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5308, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3568, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1
Model crash detected, will try to restart...
09:50:44 (5416): No heartbeat from core client for 30 sec - exiting
09:50:45 (5416): No heartbeat from core client for 30 sec - exiting
09:50:46 (5416): No heartbeat from core client for 30 sec - exiting
09:50:47 (5416): No heartbeat from core client for 30 sec - exiting
09:50:48 (5416): No heartbeat from core client for 30 sec - exiting
09:50:49 (5416): No heartbeat from core client for 30 sec - exiting
09:50:50 (5416): No heartbeat from core client for 30 sec - exiting
09:50:51 (5416): No heartbeat from core client for 30 sec - exiting
09:50:52 (5416): No heartbeat from core client for 30 sec - exiting
09:50:53 (5416): No heartbeat from core client for 30 sec - exiting
09:50:54 (5416): No heartbeat from core client for 30 sec - exiting
09:50:55 (5416): No heartbeat from core client for 30 sec - exiting
09:50:56 (5416): No heartbeat from core client for 30 sec - exiting
09:50:57 (5416): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
11:43:41 (6012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4728, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
21:50:07 (4100): No heartbeat from core client for 30 sec - exiting
21:50:08 (4100): No heartbeat from core client for 30 sec - exiting
21:50:09 (4100): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:07:40 (4496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5328, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5232, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7036, iMonCtr=1
Model crash detected, will try to restart...
14:00:11 (4908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1340, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
22:48:04 (480): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:25:52 (3932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:18:51 (4324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 Aug 2011 14:50:40 1149100 13106490 hadcm3n_ye8k_1900_40_007351310_1 259,200 603,113 2.3268
24 Aug 2011 22:58:27 1149100 13106490 hadcm3n_ye8k_1900_40_007351310_1 233,280 543,658 2.3305
22 Aug 2011 07:18:03 1149100 13106490 hadcm3n_ye8k_1900_40_007351310_1 207,360 483,700 2.3327
17 Aug 2011 16:12:11 1149100 13106490 hadcm3n_ye8k_1900_40_007351310_1 181,440 423,296 2.3330
11 Aug 2011 10:23:38 1149100 13106490 hadcm3n_ye8k_1900_40_007351310_1 155,520 362,745 2.3325
07 Aug 2011 17:38:04 1149100 13106490 hadcm3n_ye8k_1900_40_007351310_1 129,600 302,291 2.3325
02 Aug 2011 19:42:53 1149100 13106490 hadcm3n_ye8k_1900_40_007351310_1 103,680 242,168 2.3357
30 Jul 2011 19:16:56 1149100 13106490 hadcm3n_ye8k_1900_40_007351310_1 77,760 181,227 2.3306
25 Jul 2011 22:22:31 1149100 13106490 hadcm3n_ye8k_1900_40_007351310_1 51,840 120,832 2.3309
25 Jul 2011 18:14:30 1149100 13106490 hadcm3n_ye8k_1900_40_007351310_1 25,920 60,302 2.3265


©2024 cpdn.org