Name | hadcm3n_y89e_1940_40_007858324_3 |
Workunit | 8013436 |
Created | 6 Apr 2012, 0:05:59 UTC |
Sent | 6 Apr 2012, 0:06:12 UTC |
Report deadline | 6 Jul 2012, 7:33:23 UTC |
Received | 21 Apr 2012, 22:19:22 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1157799 |
Run time | 11 days 22 hours 34 min 57 sec |
CPU time | 11 days 13 hours 2 min 41 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 2.92 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:19:19 (4128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:35:58 (6752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:49:27 (736): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CNo Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8248, selfPID=8248, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:01:23 (10412): No heartbeat from core client for 30 sec - exiting 08:01:24 (10412): No heartbeat from core client for 30 sec - exiting No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11216, selfPID=11216, iMonCtr=1 BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/y89eko.pjf7c10 Error converting file to netcdf: dataout/y89eko.pif7c10 Error converting file to netcdf: dataout/y89eko.pff7c10 Error converting file to netcdf: dataout/y89eka.phf7c10 Error converting file to netcdf: dataout/y89eka.pgf7c10 Error converting file to netcdf: dataout/y89eka.pef7c10 Error converting file to netcdf: dataout/y89eka.pdf7c10 10:34:11 (6808): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Apr 2012 21:20:26 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 518,400 | 997,354 | 1.9239 |
21 Apr 2012 07:40:00 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 492,480 | 948,328 | 1.9256 |
20 Apr 2012 05:33:36 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 466,560 | 901,307 | 1.9318 |
19 Apr 2012 15:32:51 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 440,640 | 854,568 | 1.9394 |
19 Apr 2012 02:00:42 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 414,720 | 808,483 | 1.9495 |
18 Apr 2012 10:09:41 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 388,800 | 759,487 | 1.9534 |
17 Apr 2012 17:55:58 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 362,880 | 710,793 | 1.9588 |
17 Apr 2012 04:29:20 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 336,960 | 666,406 | 1.9777 |
16 Apr 2012 09:39:36 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 311,040 | 620,522 | 1.9950 |
13 Apr 2012 15:24:01 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 285,120 | 571,195 | 2.0033 |
13 Apr 2012 00:07:41 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 259,200 | 519,380 | 2.0038 |
12 Apr 2012 06:52:12 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 233,280 | 467,043 | 2.0021 |
11 Apr 2012 13:24:01 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 207,360 | 414,944 | 2.0011 |
10 Apr 2012 22:24:59 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 181,440 | 362,980 | 2.0006 |
10 Apr 2012 07:00:34 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 155,520 | 311,400 | 2.0023 |
09 Apr 2012 15:58:50 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 129,600 | 259,549 | 2.0027 |
09 Apr 2012 00:45:47 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 103,680 | 207,624 | 2.0025 |
08 Apr 2012 09:39:42 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 77,760 | 155,787 | 2.0034 |
07 Apr 2012 18:30:28 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 51,840 | 104,050 | 2.0071 |
07 Apr 2012 03:51:47 | 1157799 | 14368174 | hadcm3n_y89e_1940_40_007858324_3 | 25,920 | 52,325 | 2.0187 |
©2024 cpdn.org