Name | hadcm3n_o0xr_1900_40_007196546_0 |
Workunit | 7394826 |
Created | 28 Mar 2011, 13:58:35 UTC |
Sent | 2 Apr 2011, 11:03:59 UTC |
Report deadline | 2 Jul 2011, 18:31:10 UTC |
Received | 29 Jun 2011, 0:58:30 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1111934 |
Run time | 19 days 10 hours 31 min 59 sec |
CPU time | 17 days 2 hours 29 min 16 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 2.64 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.26</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5704, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3728, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CNo Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5732, selfPID=5732, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1208, iMonCtr=1 Model crash detected, will try to restart... CCPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5300, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5320, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:55:46 (4812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:19:23 (2956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:27:52 (6764): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:52:32 (5356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:53:09 (4820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:54:22 (6112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:00:17 (4264): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 00:16:12 (5680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77DF3232 read attempt to address 0x40CAA402 Engaging BOINC Windows Runtime Debugger... Cannot serialize file G:\Seti/projects/climateprediction.net/hadcm3n_o0xr_1900_40_007196546/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 Jun 2011 03:27:22 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 1,036,800 | 1,477,749 | 1.4253 |
13 Jun 2011 02:11:59 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 1,010,880 | 1,442,082 | 1.4266 |
12 Jun 2011 14:30:12 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 984,960 | 1,407,220 | 1.4287 |
12 Jun 2011 03:06:11 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 959,040 | 1,372,586 | 1.4312 |
07 Jun 2011 12:21:42 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 933,120 | 1,336,671 | 1.4325 |
06 Jun 2011 08:19:55 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 907,200 | 1,300,604 | 1.4336 |
06 Jun 2011 08:19:55 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 881,280 | 1,264,976 | 1.4354 |
06 Jun 2011 08:19:55 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 855,360 | 1,228,381 | 1.4361 |
03 Jun 2011 21:50:10 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 829,440 | 1,191,943 | 1.4370 |
03 Jun 2011 10:13:06 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 803,520 | 1,155,945 | 1.4386 |
02 Jun 2011 22:23:14 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 777,600 | 1,119,448 | 1.4396 |
01 Jun 2011 15:16:58 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 751,680 | 1,082,206 | 1.4397 |
01 Jun 2011 04:47:05 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 725,760 | 1,047,356 | 1.4431 |
30 May 2011 03:22:10 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 699,840 | 1,012,718 | 1.4471 |
28 May 2011 21:34:01 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 673,920 | 977,520 | 1.4505 |
28 May 2011 09:59:58 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 648,000 | 941,477 | 1.4529 |
27 May 2011 23:01:29 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 622,080 | 905,151 | 1.4550 |
26 May 2011 13:56:47 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 596,160 | 868,823 | 1.4574 |
26 May 2011 02:27:33 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 570,240 | 832,189 | 1.4594 |
23 May 2011 22:59:49 | 1111934 | 12734271 | hadcm3n_o0xr_1900_40_007196546_0 | 544,320 | 795,338 | 1.4612 |
©2024 cpdn.org