climateprediction.net home page
Task 15781556

Task 15781556

Name hadcm3n_n11u_1960_40_008367338_1
Workunit 8518197
Created 13 May 2013, 10:38:09 UTC
Sent 13 May 2013, 10:38:16 UTC
Report deadline 12 Aug 2013, 18:05:27 UTC
Received 21 Jul 2013, 8:35:53 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1146914
Run time 18 days 11 hours 35 min 38 sec
CPU time 17 days 23 hours 4 min 34 sec
Validate state Invalid
Credit 9,331.20
Device peak FLOPS 2.36 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2052, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5340, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5760, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6052, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5596, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5968, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4632, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4632, iMonCtr=1
Model crash detected, will try to restart...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5148, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5860, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6036, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6036, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6036, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6036, iMonCtr=1
Model crash detected, will try to restart...
CBUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5140, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5932, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5888, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6088, iMonCtr=1
Model crash detected, will try to restart...
11:25:56 (2216): No heartbeat from core client for 30 sec - exiting
11:25:57 (2216): No heartbeat from core client for 30 sec - exiting
11:25:58 (2216): No heartbeat from core client for 30 sec - exiting
11:25:59 (2216): No heartbeat from core client for 30 sec - exiting
11:26:00 (2216): No heartbeat from core client for 30 sec - exiting
11:26:01 (2216): No heartbeat from core client for 30 sec - exiting
11:26:03 (2216): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:25:33 (5484): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:16:34 (5420): No heartbeat from core client for 30 sec - exiting
12:16:35 (5420): No heartbeat from core client for 30 sec - exiting
12:16:37 (5420): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77A5331F read attempt to address 0x00000004

Engaging BOINC Windows Runtime Debugger...

Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_n11u_1960_40_008367338/dataout/shmem_restart.day
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Jul 2013 20:54:56 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 777,600 1,551,868 1.9957
23 Jul 2013 19:41:15 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 751,680 1,499,431 1.9948
23 Jul 2013 19:16:55 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 725,760 1,446,504 1.9931
23 Jul 2013 19:16:54 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 699,840 1,395,136 1.9935
07 Jul 2013 07:35:48 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 673,920 1,341,821 1.9911
06 Jul 2013 04:55:07 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 648,000 1,290,258 1.9911
04 Jul 2013 14:24:23 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 622,080 1,238,540 1.9910
02 Jul 2013 12:04:53 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 596,160 1,188,452 1.9935
02 Jul 2013 10:02:16 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 570,240 1,135,590 1.9914
28 Jun 2013 03:36:14 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 544,320 1,081,065 1.9861
26 Jun 2013 06:48:42 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 518,400 1,028,583 1.9841
24 Jun 2013 09:20:10 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 492,480 975,889 1.9816
21 Jun 2013 11:24:33 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 466,560 922,363 1.9769
20 Jun 2013 06:52:08 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 440,640 865,728 1.9647
14 Jun 2013 06:51:46 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 414,720 809,831 1.9527
13 Jun 2013 04:52:21 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 388,800 758,294 1.9503
10 Jun 2013 05:41:35 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 362,880 707,688 1.9502
07 Jun 2013 08:20:20 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 336,960 655,514 1.9454
06 Jun 2013 06:10:00 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 311,040 603,031 1.9388
04 Jun 2013 03:37:06 1146914 15781556 hadcm3n_n11u_1960_40_008367338_1 285,120 552,513 1.9378


©2024 cpdn.org