climateprediction.net home page
Task 15615592

Task 15615592

Name hadcm3n_zj12_1880_40_008247883_3
Workunit 8403007
Created 20 Feb 2013, 17:55:18 UTC
Sent 20 Feb 2013, 17:55:34 UTC
Report deadline 23 May 2013, 1:22:45 UTC
Received 21 Apr 2013, 22:44:50 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1125445
Run time 20 days 0 hours 43 min 17 sec
CPU time 17 days 22 hours 11 min 36 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.68 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5808, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5200, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=636, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:41:19 (5180): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2184, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4340, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2864, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5932, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4608, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5004, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1384, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5088, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
17:26:14 (4704): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:26:15 (4704): No heartbeat from core client for 30 sec - exiting
17:26:16 (4704): No heartbeat from core client for 30 sec - exiting
17:26:18 (4704): No heartbeat from core client for 30 sec - exiting
17:26:19 (4704): No heartbeat from core client for 30 sec - exiting
17:26:20 (4704): No heartbeat from core client for 30 sec - exiting
17:26:21 (4704): No heartbeat from core client for 30 sec - exiting
17:26:22 (4704): No heartbeat from core client for 30 sec - exiting
17:26:23 (4704): No heartbeat from core client for 30 sec - exiting
17:26:24 (4704): No heartbeat from core client for 30 sec - exiting
17:26:25 (4704): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4716, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Apr 2013 21:44:40 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 777,600 1,548,691 1.9916
21 Apr 2013 06:31:27 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 751,680 1,500,027 1.9956
20 Apr 2013 05:07:43 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 725,760 1,453,502 2.0027
18 Apr 2013 03:59:19 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 699,840 1,401,365 2.0024
15 Apr 2013 01:12:58 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 673,920 1,349,115 2.0019
14 Apr 2013 09:51:58 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 648,000 1,296,656 2.0010
13 Apr 2013 11:19:20 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 622,080 1,243,853 1.9995
07 Apr 2013 23:58:48 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 596,160 1,191,490 1.9986
07 Apr 2013 06:32:10 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 570,240 1,138,527 1.9966
06 Apr 2013 14:30:28 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 544,320 1,086,061 1.9953
05 Apr 2013 03:47:45 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 518,400 1,034,100 1.9948
02 Apr 2013 05:47:28 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 492,480 980,686 1.9913
31 Mar 2013 10:54:46 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 466,560 928,190 1.9894
29 Mar 2013 16:00:46 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 440,640 873,956 1.9834
28 Mar 2013 03:47:00 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 414,720 822,500 1.9833
25 Mar 2013 03:46:00 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 388,800 772,993 1.9882
24 Mar 2013 12:33:13 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 362,880 719,890 1.9838
22 Mar 2013 06:03:06 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 336,960 665,468 1.9749
17 Mar 2013 18:06:23 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 311,040 612,296 1.9685
16 Mar 2013 13:25:07 1125445 15615592 hadcm3n_zj12_1880_40_008247883_3 285,120 559,345 1.9618


©2024 climateprediction.net