Name | hadcm3n_zlgb_1920_40_008256623_0 |
Workunit | 8411747 |
Created | 16 Dec 2012, 22:00:14 UTC |
Sent | 16 Dec 2012, 22:01:56 UTC |
Report deadline | 18 Mar 2013, 5:29:07 UTC |
Received | 20 Jan 2013, 19:27:13 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 943847 |
Run time | 30 days 15 hours 7 min 9 sec |
CPU time | 24 days 13 hours 33 min 10 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 2.23 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7028, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Ocean Restart file copy failed on zlgbko.dac6440 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:46:38 (752): No heartbeat from core client for 30 sec - exiting 20:46:39 (752): No heartbeat from core client for 30 sec - exiting 20:46:40 (752): No heartbeat from core client for 30 sec - exiting 20:46:41 (752): No heartbeat from core client for 30 sec - exiting 20:46:42 (752): No heartbeat from core client for 30 sec - exiting 20:46:43 (752): No heartbeat from core client for 30 sec - exiting 20:46:44 (752): No heartbeat from core client for 30 sec - exiting 20:46:45 (752): No heartbeat from core client for 30 sec - exiting 20:46:46 (752): No heartbeat from core client for 30 sec - exiting 20:46:47 (752): No heartbeat from core client for 30 sec - exiting 20:46:48 (752): No heartbeat from core client for 30 sec - exiting 20:46:49 (752): No heartbeat from core client for 30 sec - exiting 20:46:50 (752): No heartbeat from core client for 30 sec - exiting 20:46:51 (752): No heartbeat from core client for 30 sec - exiting 20:46:52 (752): No heartbeat from core client for 30 sec - exiting 20:46:53 (752): No heartbeat from core client for 30 sec - exiting 20:46:54 (752): No heartbeat from core client for 30 sec - exiting 20:46:55 (752): No heartbeat from core client for 30 sec - exiting 20:46:56 (752): No heartbeat from core client for 30 sec - exiting 20:46:57 (752): No heartbeat from core client for 30 sec - exiting 20:46:58 (752): No heartbeat from core client for 30 sec - exiting 20:46:59 (752): No heartbeat from core client for 30 sec - exiting 20:47:00 (752): No heartbeat from core client for 30 sec - exiting 20:47:01 (752): No heartbeat from core client for 30 sec - exiting 20:47:02 (752): No heartbeat from core client for 30 sec - exiting 20:47:03 (752): No heartbeat from core client for 30 sec - exiting 20:47:04 (752): No heartbeat from core client for 30 sec - exiting 20:47:05 (752): No heartbeat from core client for 30 sec - exiting 20:47:06 (752): No heartbeat from core client for 30 sec - exiting 20:47:07 (752): No heartbeat from core client for 30 sec - exiting 20:47:08 (752): No heartbeat from core client for 30 sec - exiting 20:47:09 (752): No heartbeat from core client for 30 sec - exiting 20:47:10 (752): No heartbeat from core client for 30 sec - exiting 20:47:11 (752): No heartbeat from core client for 30 sec - exiting 20:47:12 (752): No heartbeat from core client for 30 sec - exiting 20:47:13 (752): No heartbeat from core client for 30 sec - exiting 20:47:14 (752): No heartbeat from core client for 30 sec - exiting 20:47:15 (752): No heartbeat from core client for 30 sec - exiting 20:47:16 (752): No heartbeat from core client for 30 sec - exiting 20:47:17 (752): No heartbeat from core client for 30 sec - exiting 20:47:18 (752): No heartbeat from core client for 30 sec - exiting 20:47:19 (752): No heartbeat from core client for 30 sec - exiting 20:47:20 (752): No heartbeat from core client for 30 sec - exiting 20:47:21 (752): No heartbeat from core client for 30 sec - exiting 20:47:22 (752): No heartbeat from core client for 30 sec - exiting 20:47:23 (752): No heartbeat from core client for 30 sec - exiting 20:47:24 (752): No heartbeat from core client for 30 sec - exiting 20:47:25 (752): No heartbeat from core client for 30 sec - exiting 20:47:26 (752): No heartbeat from core client for 30 sec - exiting 20:47:27 (752): No heartbeat from core client for 30 sec - exiting 20:47:28 (752): No heartbeat from core client for 30 sec - exiting 20:47:29 (752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:50:09 (5464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 09:31:34 (4084): No heartbeat from core client for 30 sec - exiting 09:31:36 (4084): No heartbeat from core client for 30 sec - exiting 09:31:37 (4084): No heartbeat from core client for 30 sec - exiting 09:31:38 (4084): No heartbeat from core client for 30 sec - exiting 09:31:39 (4084): No heartbeat from core client for 30 sec - exiting 09:31:40 (4084): No heartbeat from core client for 30 sec - exiting 09:31:41 (4084): No heartbeat from core client for 30 sec - exiting 09:31:42 (4084): No heartbeat from core client for 30 sec - exiting 09:31:43 (4084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/zlgbko.pje1c10 Error converting file to netcdf: dataout/zlgbko.pie1c10 Error converting file to netcdf: dataout/zlgbko.pfe1c10 Error converting file to netcdf: dataout/zlgbka.phe1c10 Error converting file to netcdf: dataout/zlgbka.pge1c10 Error converting file to netcdf: dataout/zlgbka.pee1c10 Error converting file to netcdf: dataout/zlgbka.pde1c10 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:55:53 (1988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 11:39:13 (5612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:44:19 (5612): No heartbeat from core client for 30 sec - exiting 18:42:35 (6088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Ocean Restart file copy failed on zlgbko.daf0560 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:58:44 (2236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:11:26 (5308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77893541 read attempt to address 0x40A6C301 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77893541 read attempt to address 0x40A6C301 Engaging BOINC Windows Runtime Debugger... Cannot serialize file D:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_zlgb_1920_40_008256623/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 Jan 2013 10:49:59 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 1,036,800 | 2,121,647 | 2.0463 |
19 Jan 2013 14:50:33 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 1,010,880 | 2,063,997 | 2.0418 |
18 Jan 2013 17:32:23 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 984,960 | 2,004,522 | 2.0351 |
17 Jan 2013 11:20:04 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 959,040 | 1,945,954 | 2.0291 |
16 Jan 2013 02:16:07 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 933,120 | 1,887,971 | 2.0233 |
14 Jan 2013 22:46:05 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 907,200 | 1,830,341 | 2.0176 |
14 Jan 2013 01:37:00 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 881,280 | 1,772,613 | 2.0114 |
12 Jan 2013 22:52:33 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 855,360 | 1,715,512 | 2.0056 |
12 Jan 2013 02:53:12 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 829,440 | 1,660,271 | 2.0017 |
11 Jan 2013 09:44:27 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 803,520 | 1,606,498 | 1.9993 |
10 Jan 2013 13:35:21 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 777,600 | 1,552,555 | 1.9966 |
08 Jan 2013 18:49:41 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 751,680 | 1,501,690 | 1.9978 |
08 Jan 2013 05:21:19 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 725,760 | 1,453,268 | 2.0024 |
07 Jan 2013 16:21:55 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 699,840 | 1,409,987 | 2.0147 |
05 Jan 2013 10:56:16 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 648,000 | 1,313,482 | 2.0270 |
04 Jan 2013 16:59:52 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 622,080 | 1,262,428 | 2.0294 |
03 Jan 2013 22:07:33 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 596,160 | 1,211,272 | 2.0318 |
03 Jan 2013 01:19:17 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 570,240 | 1,159,448 | 2.0333 |
02 Jan 2013 05:49:28 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 544,320 | 1,107,271 | 2.0342 |
01 Jan 2013 08:51:22 | 943847 | 15481347 | hadcm3n_zlgb_1920_40_008256623_0 | 518,400 | 1,055,903 | 2.0368 |
©2024 cpdn.org