climateprediction.net home page
Task 13026306

Task 13026306

Name hadcm3n_t5iy_1940_40_007316170_2
Workunit 7513600
Created 29 Jun 2011, 2:43:10 UTC
Sent 29 Jun 2011, 2:44:09 UTC
Report deadline 28 Sep 2011, 10:11:20 UTC
Received 18 Jul 2011, 17:27:15 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1132842
Run time 15 days 13 hours 5 min 39 sec
CPU time 7 days 12 hours 34 min 14 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 1.82 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:08:49 (4936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:42:02 (3580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:13:10 (1908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1
Model crash detected, will try to restart...
04:57:03 (4468): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:02:40 (5544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4516, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4516, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4516, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4516, iMonCtr=1
Model crash detected, will try to restart...
18:57:28 (4516): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1988, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1988, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1988, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1988, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1988, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1988, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3300, iMonCtr=1
Model crash detected, will try to restart...
11:02:48 (3300): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:18:29 (2520): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:22:25 (4152): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:24:05 (6128): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:25:04 (2808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:25:54 (4004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5668, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5668, iMonCtr=1
Model crash detected, will try to restart...


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77D63F79 read attempt to address 0xFFFFFFF8

Engaging BOINC Windows Runtime Debugger...

Signal 11 received, exiting...
Called boinc_finish


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77713A93 read attempt to address 0x00000000

Engaging BOINC Windows Runtime Debugger...

Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_t5iy_1940_40_007316170/dataout/shmem_restart.day
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Jul 2011 17:31:43 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 518,400 649,094 1.2521
25 Jul 2011 17:31:43 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 492,480 621,180 1.2613
25 Jul 2011 17:31:43 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 466,560 686,354 1.4711
25 Jul 2011 14:50:38 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 440,640 608,088 1.3800
25 Jul 2011 14:14:54 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 414,720 527,498 1.2719
25 Jul 2011 14:14:54 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 388,800 761,544 1.9587
25 Jul 2011 14:14:54 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 362,880 685,574 1.8893
25 Jul 2011 14:14:54 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 336,960 609,084 1.8076
25 Jul 2011 14:14:54 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 311,040 530,743 1.7063
10 Jul 2011 07:06:09 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 285,120 675,695 2.3699
09 Jul 2011 09:35:38 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 259,200 599,019 2.3110
08 Jul 2011 14:23:50 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 233,280 520,566 2.2315
07 Jul 2011 16:09:55 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 207,360 531,127 2.5614
06 Jul 2011 01:18:11 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 181,440 502,308 2.7685
05 Jul 2011 17:15:44 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 155,520 424,420 2.7290
05 Jul 2011 17:15:44 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 129,600 381,239 2.9417
03 Jul 2011 03:48:54 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 103,680 306,141 2.9527
02 Jul 2011 04:33:16 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 77,760 230,276 2.9614
01 Jul 2011 03:51:29 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 51,840 153,303 2.9572
30 Jun 2011 03:02:17 1132842 13026306 hadcm3n_t5iy_1940_40_007316170_2 25,920 76,793 2.9627


©2024 climateprediction.net