climateprediction.net home page
Task 14042317

Task 14042317

Name hadcm3n_yb9u_1980_40_007743716_1
Workunit 7898825
Created 1 Feb 2012, 13:35:29 UTC
Sent 1 Feb 2012, 13:36:46 UTC
Report deadline 2 May 2012, 21:03:57 UTC
Received 24 Mar 2012, 16:07:29 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -2 (0xFFFFFFFE) Unknown error code
Computer ID 1230193
Run time 19 days 18 hours 5 min 47 sec
CPU time 13 days 13 hours 19 min 10 sec
Validate state Invalid
Credit 5,909.76
Device peak FLOPS 2.82 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
 - exit code -2 (0xfffffffe)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
forrtl: Not enough storage is available to process this command.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10032, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10032, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4144, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
15:32:34 (4376): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
forrtl: Not enough storage is available to process this command.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3664, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
forrtl: Insufficient system resources exist to complete the requested service.

Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8800, iMonCtr=1
Model crash detected, will try to restart...
18:39:29 (8800): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7288, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error ferror - Unit 30 - Return code = 32

Model crashed: READHEAD: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  2048    
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7288, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7288, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7288, iMonCtr=1
Model crash detected, will try to restart...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Mar 2012 22:13:48 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 492,480 1,134,884 2.3044
17 Mar 2012 12:02:44 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 466,560 1,034,645 2.2176
16 Mar 2012 14:57:04 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 440,640 966,881 2.1943
15 Mar 2012 23:19:19 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 414,720 912,257 2.1997
15 Mar 2012 05:58:02 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 388,800 852,136 2.1917
13 Mar 2012 11:27:02 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 362,880 785,066 2.1634
12 Mar 2012 14:09:03 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 336,960 715,831 2.1244
11 Mar 2012 20:52:11 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 311,040 657,206 2.1129
20 Feb 2012 10:37:13 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 259,200 531,336 2.0499
19 Feb 2012 15:02:26 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 233,280 466,483 1.9997
18 Feb 2012 12:59:42 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 207,360 403,348 1.9452
16 Feb 2012 05:24:43 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 181,440 337,440 1.8598
14 Feb 2012 21:40:20 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 155,520 271,525 1.7459
13 Feb 2012 11:00:22 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 129,600 262,681 2.0269
11 Feb 2012 19:24:22 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 103,680 229,973 2.2181
08 Feb 2012 20:20:53 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 77,760 153,097 1.9688
04 Feb 2012 14:42:03 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 51,840 125,536 2.4216
03 Feb 2012 04:28:10 1068203 14042317 hadcm3n_yb9u_1980_40_007743716_1 25,920 62,358 2.4058


©2024 climateprediction.net