climateprediction.net home page
Task 16044458

Task 16044458

Name hadcm3n_oedi_1900_40_008473753_0
Workunit 8624592
Created 27 Sep 2013, 10:23:48 UTC
Sent 28 Sep 2013, 17:43:10 UTC
Report deadline 29 Dec 2013, 1:10:21 UTC
Received 29 Oct 2013, 18:27:09 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1292666
Run time 6 days 18 hours 59 min 38 sec
CPU time 6 days 3 hours 57 min 14 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 3.64 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2020, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3736, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3740, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2816, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2816, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3596, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1908, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3304, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3304, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2884, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2884, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2884, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2884, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2436, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2436, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6716, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=660, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3816, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5464, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4060, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:57:51 (5544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:57:52 (5544): No heartbeat from core client for 30 sec - exiting
07:57:53 (5544): No heartbeat from core client for 30 sec - exiting
07:57:54 (5544): No heartbeat from core client for 30 sec - exiting
07:57:55 (5544): No heartbeat from core client for 30 sec - exiting
07:57:56 (5544): No heartbeat from core client for 30 sec - exiting
07:57:57 (5544): No heartbeat from core client for 30 sec - exiting
07:57:58 (5544): No heartbeat from core client for 30 sec - exiting
07:57:59 (5544): No heartbeat from core client for 30 sec - exiting
07:58:00 (5544): No heartbeat from core client for 30 sec - exiting
07:58:01 (5544): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5932, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6940, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2484, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3936, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/oediko.pjb7c10
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=976, selfPID=976, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x7788FF6B write attempt to address 0xFFFFFFF8

Engaging BOINC Windows Runtime Debugger...



Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77877383 read attempt to address 0xFFFFFFF8

Engaging BOINC Windows Runtime Debugger...


</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Oct 2013 15:09:29 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 518,400 530,998 1.0243
28 Oct 2013 17:18:35 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 492,480 500,308 1.0159
25 Oct 2013 11:19:58 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 466,560 468,702 1.0046
24 Oct 2013 14:18:51 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 440,640 436,203 0.9899
23 Oct 2013 12:59:00 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 414,720 405,462 0.9777
22 Oct 2013 07:52:38 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 388,800 380,906 0.9797
21 Oct 2013 09:28:43 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 362,880 359,033 0.9894
20 Oct 2013 17:52:44 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 336,960 336,878 0.9998
17 Oct 2013 18:49:31 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 311,040 315,434 1.0141
16 Oct 2013 10:15:03 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 285,120 295,973 1.0381
15 Oct 2013 14:10:36 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 259,200 274,966 1.0608
14 Oct 2013 05:40:36 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 233,280 244,274 1.0471
12 Oct 2013 11:13:18 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 207,360 213,128 1.0278
10 Oct 2013 23:24:55 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 181,440 182,466 1.0057
10 Oct 2013 02:07:27 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 155,520 151,397 0.9735
09 Oct 2013 07:21:47 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 129,600 124,969 0.9643
02 Oct 2013 18:44:13 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 103,680 97,661 0.9419
01 Oct 2013 22:26:43 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 77,760 72,823 0.9365
01 Oct 2013 08:45:44 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 51,840 49,174 0.9486
30 Sep 2013 14:10:11 1292666 16044458 hadcm3n_oedi_1900_40_008473753_0 25,920 19,162 0.7393


©2024 cpdn.org