Name | hadcm3n_yd4u_1900_40_007349880_1 |
Workunit | 7547310 |
Created | 6 Jul 2011, 14:01:54 UTC |
Sent | 17 Jul 2011, 8:47:28 UTC |
Report deadline | 16 Oct 2011, 16:14:39 UTC |
Received | 2 Oct 2011, 18:27:37 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 725427 |
Run time | 15 days 16 hours 40 min 26 sec |
CPU time | 9 days 21 hours 52 min 35 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.16 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> 10:34:19 (5700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:49:07 (4120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4960, iMonCtr=1 Model crash detected, will try to restart... 19:54:58 (6908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4220, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7856, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2944, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8288, iMonCtr=1 Model crash detected, will try to restart... 09:53:00 (4300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:54:39 (4000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:56:34 (6044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:58:16 (7320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7248, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 10:12:10 (5184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9092, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7932, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9588, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4940, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5456, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7620, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8840, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4632, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5156, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=1 Model crash detected, will try to restart... 08:58:46 (5192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:01:13 (3424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7984, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9432, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7484, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6012, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5408, iMonCtr=1 Model crash detected, will try to restart... 23:21:11 (2976): No heartbeat from core client for 30 sec - exiting 23:21:12 (2976): No heartbeat from core client for 30 sec - exiting 23:21:13 (2976): No heartbeat from core client for 30 sec - exiting 23:21:14 (2976): No heartbeat from core client for 30 sec - exiting 23:21:15 (2976): No heartbeat from core client for 30 sec - exiting 23:21:16 (2976): No heartbeat from core client for 30 sec - exiting 23:21:17 (2976): No heartbeat from core client for 30 sec - exiting 23:21:18 (2976): No heartbeat from core client for 30 sec - exiting 23:21:19 (2976): No heartbeat from core client for 30 sec - exiting 23:21:20 (2976): No heartbeat from core client for 30 sec - exiting 23:21:21 (2976): No heartbeat from core client for 30 sec - exiting 23:21:22 (2976): No heartbeat from core client for 30 sec - exiting 23:21:23 (2976): No heartbeat from core client for 30 sec - exiting 23:21:24 (2976): No heartbeat from core client for 30 sec - exiting 23:21:25 (2976): No heartbeat from core client for 30 sec - exiting 23:21:26 (2976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:22:35 (4208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5832, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3736, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6712, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7828, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7828, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4548, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3864, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5100, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4872, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4872, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7964, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 08:37:30 (7300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6928, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2196, iMonCtr=1 Model crash detected, will try to restart... 08:09:24 (6480): No heartbeat from core client for 30 sec - exiting 08:09:25 (6480): No heartbeat from core client for 30 sec - exiting 08:09:26 (6480): No heartbeat from core client for 30 sec - exiting 08:09:27 (6480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x771B8755 read attempt to address 0x40A8460C Engaging BOINC Windows Runtime Debugger... Signal 11 received, exiting... Called boinc_finish ERROR: Invalid parameter detected in function (null). File: (null) Line: 0 ERROR: Expression: (null) </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
01 Oct 2011 14:48:56 | 725427 | 13103628 | hadcm3n_yd4u_1900_40_007349880_1 | 259,200 | 835,955 | 3.2251 |
25 Sep 2011 19:19:47 | 725427 | 13103628 | hadcm3n_yd4u_1900_40_007349880_1 | 233,280 | 752,625 | 3.2263 |
21 Sep 2011 15:21:59 | 725427 | 13103628 | hadcm3n_yd4u_1900_40_007349880_1 | 207,360 | 672,976 | 3.2454 |
17 Sep 2011 16:21:02 | 725427 | 13103628 | hadcm3n_yd4u_1900_40_007349880_1 | 181,440 | 590,375 | 3.2538 |
11 Sep 2011 10:59:03 | 725427 | 13103628 | hadcm3n_yd4u_1900_40_007349880_1 | 155,520 | 505,302 | 3.2491 |
28 Aug 2011 20:16:57 | 725427 | 13103628 | hadcm3n_yd4u_1900_40_007349880_1 | 129,600 | 418,642 | 3.2303 |
26 Aug 2011 14:46:57 | 725427 | 13103628 | hadcm3n_yd4u_1900_40_007349880_1 | 103,680 | 335,710 | 3.2379 |
06 Aug 2011 18:31:26 | 725427 | 13103628 | hadcm3n_yd4u_1900_40_007349880_1 | 77,760 | 252,334 | 3.2450 |
31 Jul 2011 16:06:47 | 725427 | 13103628 | hadcm3n_yd4u_1900_40_007349880_1 | 51,840 | 169,797 | 3.2754 |
25 Jul 2011 19:37:50 | 725427 | 13103628 | hadcm3n_yd4u_1900_40_007349880_1 | 25,920 | 86,056 | 3.3201 |
©2024 cpdn.org