Name | hadcm3n_n5lf_1880_40_008285136_2 |
Workunit | 8436271 |
Created | 24 Jan 2013, 5:01:42 UTC |
Sent | 24 Jan 2013, 5:01:53 UTC |
Report deadline | 25 Apr 2013, 12:29:04 UTC |
Received | 18 Feb 2013, 13:38:37 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1161576 |
Run time | 15 days 19 hours 52 min 2 sec |
CPU time | 14 days 16 hours 35 min 30 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.36 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.31</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3376, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3676, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4032, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4032, iMonCtr=1 Model crash detected, will try to restart... 02:58:18 (1288): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2692, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=128, iMonCtr=1 Model crash detected, will try to restart... 19:00:33 (3352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 05:47:29 (2956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77E03AB3 read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77E071F3 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Cannot serialize file B:\BOINC\Data/projects/climateprediction.net/hadcm3n_n5lf_1880_40_008285136/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 Feb 2013 02:36:54 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 777,600 | 1,247,759 | 1.6046 |
17 Feb 2013 10:57:21 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 751,680 | 1,205,609 | 1.6039 |
16 Feb 2013 18:50:38 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 725,760 | 1,163,633 | 1.6033 |
16 Feb 2013 05:23:15 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 699,840 | 1,120,450 | 1.6010 |
15 Feb 2013 01:50:41 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 673,920 | 1,079,038 | 1.6011 |
14 Feb 2013 03:59:42 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 648,000 | 1,037,305 | 1.6008 |
12 Feb 2013 15:56:23 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 622,080 | 995,672 | 1.6006 |
12 Feb 2013 03:03:20 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 596,160 | 954,142 | 1.6005 |
11 Feb 2013 05:05:24 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 570,240 | 912,898 | 1.6009 |
10 Feb 2013 06:06:20 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 544,320 | 871,235 | 1.6006 |
09 Feb 2013 07:26:30 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 518,400 | 829,381 | 1.5999 |
08 Feb 2013 03:40:06 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 492,480 | 787,278 | 1.5986 |
07 Feb 2013 04:11:25 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 466,560 | 745,860 | 1.5986 |
06 Feb 2013 07:58:01 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 440,640 | 704,137 | 1.5980 |
05 Feb 2013 09:57:08 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 414,720 | 662,493 | 1.5974 |
04 Feb 2013 11:53:26 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 388,800 | 621,537 | 1.5986 |
03 Feb 2013 23:05:28 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 362,880 | 579,580 | 1.5972 |
03 Feb 2013 10:43:15 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 336,960 | 537,774 | 1.5960 |
02 Feb 2013 13:25:21 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 311,040 | 496,597 | 1.5966 |
01 Feb 2013 23:58:34 | 1161576 | 15556192 | hadcm3n_n5lf_1880_40_008285136_2 | 285,120 | 454,958 | 1.5957 |
©2024 cpdn.org