Name | hadcm3n_y9sg_1940_40_007433566_2 |
Workunit | 7631069 |
Created | 31 Aug 2011, 23:52:17 UTC |
Sent | 31 Aug 2011, 23:56:15 UTC |
Report deadline | 1 Dec 2011, 7:23:26 UTC |
Received | 15 Sep 2011, 3:17:30 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1160749 |
Run time | 5 days 16 hours 42 min 37 sec |
CPU time | 5 days 10 hours 28 min 3 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.63 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.33</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:22:07 (400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:22:08 (400): No heartbeat from core client for 30 sec - exiting 07:22:09 (400): No heartbeat from core client for 30 sec - exiting 07:22:10 (400): No heartbeat from core client for 30 sec - exiting 07:22:11 (400): No heartbeat from core client for 30 sec - exiting 07:22:12 (400): No heartbeat from core client for 30 sec - exiting 07:22:13 (400): No heartbeat from core client for 30 sec - exiting 07:22:14 (400): No heartbeat from core client for 30 sec - exiting 07:22:16 (400): No heartbeat from core client for 30 sec - exiting 07:22:17 (400): No heartbeat from core client for 30 sec - exiting 07:22:18 (400): No heartbeat from core client for 30 sec - exiting 07:22:19 (400): No heartbeat from core client for 30 sec - exiting 07:22:20 (400): No heartbeat from core client for 30 sec - exiting 07:22:21 (400): No heartbeat from core client for 30 sec - exiting 07:22:22 (400): No heartbeat from core client for 30 sec - exiting 07:22:23 (400): No heartbeat from core client for 30 sec - exiting 07:22:24 (400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:37:42 (7536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Ocean Restart file copy failed on y9sgko.dae1cp0 Ocean Restart file copy failed on y9sgko.dae1cq0 Ocean Restart file copy failed on y9sgko.dae1cr0 Ocean Restart file copy failed on y9sgko.dae1cs0 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Ocean Restart file copy failed on y9sgko.dae3b30 Ocean Restart file copy failed on y9sgko.dae3b40 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Ocean Restart file copy failed on y9sgko.dae58d0 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Ocean Restart file copy failed on y9sgko.dae5ah0 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold Ocean Restart file copy failed on y9sgko.dae8bo0 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:24:53 (8172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:24:55 (8172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1400, selfPID=1400, iMonCtr=1 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9sg_1940_40_007433566/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9sg_1940_40_007433566/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9sg_1940_40_007433566/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9sg_1940_40_007433566/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9sg_1940_40_007433566/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9sg_1940_40_007433566/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9sg_1940_40_007433566/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9sg_1940_40_007433566/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9sg_1940_40_007433566/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9sg_1940_40_007433566/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9sg_1940_40_007433566/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9sg_1940_40_007433566/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 Sep 2011 02:07:02 | 1160749 | 13319980 | hadcm3n_y9sg_1940_40_007433566_2 | 259,200 | 469,749 | 1.8123 |
11 Sep 2011 10:59:00 | 1160749 | 13319980 | hadcm3n_y9sg_1940_40_007433566_2 | 233,280 | 422,077 | 1.8093 |
10 Sep 2011 17:19:52 | 1160749 | 13319980 | hadcm3n_y9sg_1940_40_007433566_2 | 207,360 | 374,482 | 1.8060 |
10 Sep 2011 00:39:18 | 1160749 | 13319980 | hadcm3n_y9sg_1940_40_007433566_2 | 181,440 | 327,117 | 1.8029 |
09 Sep 2011 09:38:16 | 1160749 | 13319980 | hadcm3n_y9sg_1940_40_007433566_2 | 155,520 | 279,097 | 1.7946 |
08 Sep 2011 19:05:47 | 1160749 | 13319980 | hadcm3n_y9sg_1940_40_007433566_2 | 129,600 | 231,151 | 1.7836 |
08 Sep 2011 04:52:53 | 1160749 | 13319980 | hadcm3n_y9sg_1940_40_007433566_2 | 103,680 | 184,307 | 1.7777 |
07 Sep 2011 15:41:05 | 1160749 | 13319980 | hadcm3n_y9sg_1940_40_007433566_2 | 77,760 | 138,456 | 1.7806 |
07 Sep 2011 01:51:36 | 1160749 | 13319980 | hadcm3n_y9sg_1940_40_007433566_2 | 51,840 | 92,533 | 1.7850 |
06 Sep 2011 12:41:35 | 1160749 | 13319980 | hadcm3n_y9sg_1940_40_007433566_2 | 25,920 | 46,908 | 1.8097 |
©2024 cpdn.org