Name | hadcm3n_u5gf_1980_40_008049335_3 |
Workunit | 8204449 |
Created | 12 Jul 2012, 23:39:38 UTC |
Sent | 12 Jul 2012, 23:43:54 UTC |
Report deadline | 12 Oct 2012, 7:11:05 UTC |
Received | 15 Oct 2012, 19:50:09 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1218644 |
Run time | 20 days 21 hours 28 min 7 sec |
CPU time | 16 days 7 hours 25 min 3 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 3.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.25</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15876, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:49:22 (2868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 11:13:02 (5200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:13:03 (5200): No heartbeat from core client for 30 sec - exiting 11:13:04 (5200): No heartbeat from core client for 30 sec - exiting 11:13:05 (5200): No heartbeat from core client for 30 sec - exiting 11:13:06 (5200): No heartbeat from core client for 30 sec - exiting 11:13:07 (5200): No heartbeat from core client for 30 sec - exiting 11:13:08 (5200): No heartbeat from core client for 30 sec - exiting 11:13:57 (3424): No heartbeat from core client for 30 sec - exiting 11:13:58 (3424): No heartbeat from core client for 30 sec - exiting 11:13:59 (3424): No heartbeat from core client for 30 sec - exiting 11:14:00 (3424): No heartbeat from core client for 30 sec - exiting 11:14:01 (3424): No heartbeat from core client for 30 sec - exiting 11:14:02 (3424): No heartbeat from core client for 30 sec - exiting 11:14:03 (3424): No heartbeat from core client for 30 sec - exiting 11:14:04 (3424): No heartbeat from core client for 30 sec - exiting 11:14:05 (3424): No heartbeat from core client for 30 sec - exiting 11:14:06 (3424): No heartbeat from core client for 30 sec - exiting 11:14:07 (3424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:14:08 (3424): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:47:01 (4972): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4692, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4880, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4580, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4580, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 03:22:11 (5092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:40:10 (4868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 00:00:53 (6672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:00:54 (6672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 18:09:52 (5116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:11:29 (4668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 03:29:16 (4620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:29:17 (4620): No heartbeat from core client for 30 sec - exiting 03:29:18 (4620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x7754FF2B write attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_u5gf_1980_40_008049335/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Oct 2012 19:50:43 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 1,036,800 | 1,409,100 | 1.3591 |
14 Oct 2012 10:51:37 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 1,010,880 | 1,373,704 | 1.3589 |
11 Oct 2012 19:46:32 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 984,960 | 1,336,515 | 1.3569 |
07 Oct 2012 13:20:59 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 959,040 | 1,300,828 | 1.3564 |
23 Sep 2012 23:52:53 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 933,120 | 1,265,788 | 1.3565 |
23 Sep 2012 05:55:36 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 907,200 | 1,229,717 | 1.3555 |
22 Sep 2012 23:53:58 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 881,280 | 1,195,711 | 1.3568 |
18 Sep 2012 22:02:18 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 855,360 | 1,160,359 | 1.3566 |
16 Sep 2012 00:02:25 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 829,440 | 1,126,243 | 1.3578 |
12 Sep 2012 23:44:40 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 803,520 | 1,091,125 | 1.3579 |
10 Sep 2012 23:08:17 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 777,600 | 1,056,851 | 1.3591 |
10 Sep 2012 11:27:32 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 751,680 | 1,022,092 | 1.3597 |
07 Sep 2012 23:54:19 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 725,760 | 988,841 | 1.3625 |
04 Sep 2012 02:29:41 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 699,840 | 952,216 | 1.3606 |
03 Sep 2012 00:09:16 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 673,920 | 916,551 | 1.3600 |
02 Sep 2012 00:08:49 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 648,000 | 881,853 | 1.3609 |
31 Aug 2012 18:33:34 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 622,080 | 845,037 | 1.3584 |
30 Aug 2012 22:58:03 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 596,160 | 808,339 | 1.3559 |
28 Aug 2012 11:26:31 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 570,240 | 772,517 | 1.3547 |
26 Aug 2012 17:18:14 | 1218644 | 14909068 | hadcm3n_u5gf_1980_40_008049335_3 | 544,320 | 736,030 | 1.3522 |
©2024 cpdn.org