Name | hadcm3n_t5kk_1940_40_007616561_0 |
Workunit | 7794691 |
Created | 21 Dec 2011, 20:40:24 UTC |
Sent | 21 Dec 2011, 20:41:06 UTC |
Report deadline | 22 Mar 2012, 4:08:17 UTC |
Received | 12 Jan 2012, 12:53:02 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 283705 |
Run time | 18 days 6 hours 38 min 11 sec |
CPU time | 18 days 6 hours 38 min 11 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 0.79 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>5.2.13</core_client_version> <message>The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Suspended CPDN Monitor - Suspend request from BOINC... Signal 4 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2096, iMonCtr=1 Model crash detected, will try to restart... 02:31:36 (2096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:31:37 (2096): No heartbeat from core client for 30 sec - exiting 00:44:46 (4088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:51:13 (592): No heartbeat from core client for 30 sec - exiting 13:51:15 (592): No heartbeat from core client for 30 sec - exiting 13:51:17 (592): No heartbeat from core client for 30 sec - exiting 13:51:18 (592): No heartbeat from core client for 30 sec - exiting 13:51:20 (592): No heartbeat from core client for 30 sec - exiting 13:51:21 (592): No heartbeat from core client for 30 sec - exiting 13:51:22 (592): No heartbeat from core client for 30 sec - exiting 13:51:23 (592): No heartbeat from core client for 30 sec - exiting 13:51:24 (592): No heartbeat from core client for 30 sec - exiting 13:51:26 (592): No heartbeat from core client for 30 sec - exiting 13:51:27 (592): No heartbeat from core client for 30 sec - exiting 13:51:28 (592): No heartbeat from core client for 30 sec - exiting 13:51:29 (592): No heartbeat from core client for 30 sec - exiting 13:51:30 (592): No heartbeat from core client for 30 sec - exiting 13:51:31 (592): No heartbeat from core client for 30 sec - exiting 13:51:33 (592): No heartbeat from core client for 30 sec - exiting 13:51:34 (592): No heartbeat from core client for 30 sec - exiting 13:51:35 (592): No heartbeat from core client for 30 sec - exiting 13:51:36 (592): No heartbeat from core client for 30 sec - exiting 13:51:37 (592): No heartbeat from core client for 30 sec - exiting 13:51:38 (592): No heartbeat from core client for 30 sec - exiting 13:51:39 (592): No heartbeat from core client for 30 sec - exiting 13:51:40 (592): No heartbeat from core client for 30 sec - exiting 13:51:41 (592): No heartbeat from core client for 30 sec - exiting 13:51:42 (592): No heartbeat from core client for 30 sec - exiting 13:51:43 (592): No heartbeat from core client for 30 sec - exiting 13:51:44 (592): No heartbeat from core client for 30 sec - exiting 13:51:46 (592): No heartbeat from core client for 30 sec - exiting 13:51:47 (592): No heartbeat from core client for 30 sec - exiting 13:51:48 (592): No heartbeat from core client for 30 sec - exiting 13:51:49 (592): No heartbeat from core client for 30 sec - exiting 13:51:50 (592): No heartbeat from core client for 30 sec - exiting 13:51:55 (592): No heartbeat from core client for 30 sec - exiting 13:51:56 (592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... zip error: Could not create output file (was replacing the original zip file) cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_t5kk_1940_40_007616561/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_t5kk_1940_40_007616561/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_t5kk_1940_40_007616561/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_t5kk_1940_40_007616561/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_t5kk_1940_40_007616561/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_t5kk_1940_40_007616561/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_t5kk_1940_40_007616561/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_t5kk_1940_40_007616561/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_t5kk_1940_40_007616561/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_t5kk_1940_40_007616561/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_t5kk_1940_40_007616561/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_t5kk_1940_40_007616561/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 Jan 2012 11:54:05 | 283705 | 13804240 | hadcm3n_t5kk_1940_40_007616561_0 | 259,200 | 1,579,347 | 6.0932 |
10 Jan 2012 09:13:24 | 283705 | 13804240 | hadcm3n_t5kk_1940_40_007616561_0 | 233,280 | 1,409,482 | 6.0420 |
08 Jan 2012 07:24:36 | 283705 | 13804240 | hadcm3n_t5kk_1940_40_007616561_0 | 207,360 | 1,239,168 | 5.9759 |
06 Jan 2012 05:57:27 | 283705 | 13804240 | hadcm3n_t5kk_1940_40_007616561_0 | 181,440 | 1,069,171 | 5.8927 |
04 Jan 2012 01:09:13 | 283705 | 13804240 | hadcm3n_t5kk_1940_40_007616561_0 | 155,520 | 899,454 | 5.7835 |
01 Jan 2012 23:42:41 | 283705 | 13804240 | hadcm3n_t5kk_1940_40_007616561_0 | 129,600 | 730,540 | 5.6369 |
30 Dec 2011 22:48:33 | 283705 | 13804240 | hadcm3n_t5kk_1940_40_007616561_0 | 103,680 | 561,990 | 5.4204 |
28 Dec 2011 01:38:04 | 283705 | 13804240 | hadcm3n_t5kk_1940_40_007616561_0 | 77,760 | 392,514 | 5.0478 |
25 Dec 2011 23:38:19 | 283705 | 13804240 | hadcm3n_t5kk_1940_40_007616561_0 | 51,840 | 222,843 | 4.2987 |
23 Dec 2011 22:22:49 | 283705 | 13804240 | hadcm3n_t5kk_1940_40_007616561_0 | 25,920 | 53,309 | 2.0567 |
©2024 cpdn.org