climateprediction.net home page
Task 16033063

Task 16033063

Name hadcm3n_8141_1980_40_008459380_4
Workunit 8610236
Created 24 Sep 2013, 12:13:49 UTC
Sent 24 Sep 2013, 12:14:03 UTC
Report deadline 24 Dec 2013, 19:41:14 UTC
Received 15 Jan 2014, 8:41:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1225270
Run time 23 days 17 hours 45 min 21 sec
CPU time 12 days 6 hours 40 min 25 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 2.26 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code 193 (0xc1)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPIDController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5856, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4528, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5000, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5000, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4344, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4164, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1
Model crash detected, will try to restart...
CCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CoCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4344, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4236, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4768, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4220, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4900, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3572, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4472, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=1
MController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3376, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4272, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4420, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:31:59 (5516): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:32:01 (5516): No heartbeat from core client for 30 sec - exiting
14:32:02 (5516): No heartbeat from core client for 30 sec - exiting
14:32:03 (5516): No heartbeat from core client for 30 sec - exiting
14:32:04 (5516): No heartbeat from core client for 30 sec - exiting
14:59:54 (6044): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:59:55 (6044): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3912, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4452, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4280, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5316, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2564, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5036, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4880, iMonCtr=1
Model crash detected, will try to restart...
09:05:03 (3848): No heartbeat from core client for 30 sec - exiting
09:05:05 (3848): No heartbeat from core client for 30 sec - exiting
09:05:06 (3848): No heartbeat from core client for 30 sec - exiting
09:05:07 (3848): No heartbeat from core client for 30 sec - exiting
09:05:08 (3848): No heartbeat from core client for 30 sec - exiting
09:05:09 (3848): No heartbeat from core client for 30 sec - exiting
09:05:10 (3848): No heartbeat from core client for 30 sec - exiting
09:05:11 (3848): No heartbeat from core client for 30 sec - exiting
09:05:12 (3848): No heartbeat from core client for 30 sec - exiting
09:05:13 (3848): No heartbeat from core client for 30 sec - exiting
09:05:14 (3848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6072, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3200, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2936, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4832, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3556, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4960, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4808, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5100, iMonCtr=1
Model crash detected, will try to restart...
CSuspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4840, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
09:09:45 (5032): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:15:43 (5048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4820, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=708, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3084, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5088, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5112, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4296, iMonCtr=1
Model crash detected, will try to restart...
C

Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77B57383 read attempt to address 0x40342CCB

Engaging BOINC Windows Runtime Debugger...



Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77457383 read attempt to address 0xFFFFFFF8

Engaging BOINC Windows Runtime Debugger...

Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_8141_1980_40_008459380/dataout/shmem_restart.day
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Jan 2014 15:17:56 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 518,400 1,057,926 2.0408
08 Jan 2014 13:37:25 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 492,480 1,002,731 2.0361
03 Jan 2014 12:36:40 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 466,560 945,796 2.0272
26 Dec 2013 11:49:30 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 440,640 892,425 2.0253
18 Dec 2013 14:59:16 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 414,720 834,958 2.0133
13 Dec 2013 10:28:40 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 388,800 779,783 2.0056
08 Dec 2013 11:10:56 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 362,880 723,505 1.9938
03 Dec 2013 09:28:31 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 336,960 667,300 1.9804
27 Nov 2013 10:31:19 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 311,040 612,608 1.9695
21 Nov 2013 11:54:48 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 285,120 558,797 1.9599
14 Nov 2013 16:46:10 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 259,200 506,855 1.9555
11 Nov 2013 16:26:38 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 233,280 457,451 1.9610
06 Nov 2013 13:24:20 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 207,360 408,356 1.9693
31 Oct 2013 16:19:23 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 181,440 358,585 1.9763
29 Oct 2013 08:37:48 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 155,520 308,858 1.9860
23 Oct 2013 12:33:29 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 129,600 258,764 1.9966
16 Oct 2013 16:35:00 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 103,680 211,541 2.0403
11 Oct 2013 10:32:26 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 77,760 158,646 2.0402
06 Oct 2013 16:53:24 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 51,840 104,317 2.0123
30 Sep 2013 13:23:07 1225270 16033063 hadcm3n_8141_1980_40_008459380_4 25,920 53,908 2.0798


©2024 cpdn.org