Name | hadcm3n_yhpl_1940_40_007834476_2 |
Workunit | 7989588 |
Created | 20 Mar 2012, 21:20:02 UTC |
Sent | 20 Mar 2012, 21:20:07 UTC |
Report deadline | 20 Jun 2012, 4:47:18 UTC |
Received | 8 Apr 2012, 20:53:06 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1109026 |
Run time | 10 days 7 hours 15 min 35 sec |
CPU time | 9 days 6 hours 0 min 43 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 2.51 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 19:29:45 (4256): No heartbeat from core client for 30 sec - exiting 19:29:46 (4256): No heartbeat from core client for 30 sec - exiting 19:29:47 (4256): No heartbeat from core client for 30 sec - exiting 19:29:48 (4256): No heartbeat from core client for 30 sec - exiting 19:29:49 (4256): No heartbeat from core client for 30 sec - exiting 19:29:50 (4256): No heartbeat from core client for 30 sec - exiting 19:29:51 (4256): No heartbeat from core client for 30 sec - exiting 19:29:52 (4256): No heartbeat from core client for 30 sec - exiting 19:29:53 (4256): No heartbeat from core client for 30 sec - exiting 19:29:54 (4256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:29:55 (4256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 02:21:34 (4008): No heartbeat from core client for 30 sec - exiting 02:21:35 (4008): No heartbeat from core client for 30 sec - exiting 02:21:36 (4008): No heartbeat from core client for 30 sec - exiting 02:21:37 (4008): No heartbeat from core client for 30 sec - exiting 02:21:38 (4008): No heartbeat from core client for 30 sec - exiting 02:21:39 (4008): No heartbeat from core client for 30 sec - exiting 02:21:40 (4008): No heartbeat from core client for 30 sec - exiting 02:21:41 (4008): No heartbeat from core client for 30 sec - exiting 02:21:42 (4008): No heartbeat from core client for 30 sec - exiting 02:21:43 (4008): No heartbeat from core client for 30 sec - exiting 02:21:44 (4008): No heartbeat from core client for 30 sec - exiting 02:21:45 (4008): No heartbeat from core client for 30 sec - exiting 02:21:46 (4008): No heartbeat from core client for 30 sec - exiting 02:21:47 (4008): No heartbeat from core client for 30 sec - exiting 02:21:48 (4008): No heartbeat from core client for 30 sec - exiting 02:21:49 (4008): No heartbeat from core client for 30 sec - exiting 02:21:50 (4008): No heartbeat from core client for 30 sec - exiting 02:21:51 (4008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:18:20 (4340): No heartbeat from core client for 30 sec - exiting 21:18:21 (4340): No heartbeat from core client for 30 sec - exiting 21:18:22 (4340): No heartbeat from core client for 30 sec - exiting 21:18:23 (4340): No heartbeat from core client for 30 sec - exiting 21:18:24 (4340): No heartbeat from core client for 30 sec - exiting 21:18:26 (4340): No heartbeat from core client for 30 sec - exiting 21:18:27 (4340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 01:01:32 (2432): No heartbeat from core client for 30 sec - exiting 01:01:33 (2432): No heartbeat from core client for 30 sec - exiting 01:01:34 (2432): No heartbeat from core client for 30 sec - exiting 01:01:35 (2432): No heartbeat from core client for 30 sec - exiting 01:01:36 (2432): No heartbeat from core client for 30 sec - exiting 01:01:37 (2432): No heartbeat from core client for 30 sec - exiting 01:01:38 (2432): No heartbeat from core client for 30 sec - exiting 01:01:40 (2432): No heartbeat from core client for 30 sec - exiting 01:01:41 (2432): No heartbeat from core client for 30 sec - exiting 01:01:42 (2432): No heartbeat from core client for 30 sec - exiting 01:01:43 (2432): No heartbeat from core client for 30 sec - exiting 01:01:44 (2432): No heartbeat from core client for 30 sec - exiting 01:01:45 (2432): No heartbeat from core client for 30 sec - exiting 01:01:46 (2432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:01:47 (2432): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 22:29:05 (2796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 23:24:19 (1196): No heartbeat from core client for 30 sec - exiting 23:24:20 (1196): No heartbeat from core client for 30 sec - exiting 23:24:21 (1196): No heartbeat from core client for 30 sec - exiting 23:24:22 (1196): No heartbeat from core client for 30 sec - exiting 23:24:23 (1196): No heartbeat from core client for 30 sec - exiting 23:24:24 (1196): No heartbeat from core client for 30 sec - exiting 23:24:25 (1196): No heartbeat from core client for 30 sec - exiting 23:24:26 (1196): No heartbeat from core client for 30 sec - exiting 23:24:27 (1196): No heartbeat from core client for 30 sec - exiting 23:24:28 (1196): No heartbeat from core client for 30 sec - exiting 23:24:29 (1196): No heartbeat from core client for 30 sec - exiting 23:24:30 (1196): No heartbeat from core client for 30 sec - exiting 23:24:31 (1196): No heartbeat from core client for 30 sec - exiting 23:24:32 (1196): No heartbeat from core client for 30 sec - exiting 23:24:33 (1196): No heartbeat from core client for 30 sec - exiting 23:24:34 (1196): No heartbeat from core client for 30 sec - exiting 23:24:35 (1196): No heartbeat from core client for 30 sec - exiting 23:24:36 (1196): No heartbeat from core client for 30 sec - exiting 23:24:37 (1196): No heartbeat from core client for 30 sec - exiting 23:24:38 (1196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:00:12 (3144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 04:10:31 (316): No heartbeat from core client for 30 sec - exiting 04:10:32 (316): No heartbeat from core client for 30 sec - exiting 04:10:33 (316): No heartbeat from core client for 30 sec - exiting 04:10:34 (316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:10:35 (316): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 04:13:50 (3384): No heartbeat from core client for 30 sec - exiting 04:13:51 (3384): No heartbeat from core client for 30 sec - exiting 04:13:52 (3384): No heartbeat from core client for 30 sec - exiting 04:13:53 (3384): No heartbeat from core client for 30 sec - exiting 04:13:54 (3384): No heartbeat from core client for 30 sec - exiting 04:13:55 (3384): No heartbeat from core client for 30 sec - exiting 04:13:56 (3384): No heartbeat from core client for 30 sec - exiting 04:13:57 (3384): No heartbeat from core client for 30 sec - exiting 04:13:58 (3384): No heartbeat from core client for 30 sec - exiting 04:13:59 (3384): No heartbeat from core client for 30 sec - exiting 04:14:00 (3384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=308, selfPID=308, iMonCtr=1 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yhpl_1940_40_007834476/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yhpl_1940_40_007834476/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yhpl_1940_40_007834476/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yhpl_1940_40_007834476/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yhpl_1940_40_007834476/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yhpl_1940_40_007834476/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yhpl_1940_40_007834476/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yhpl_1940_40_007834476/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yhpl_1940_40_007834476/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yhpl_1940_40_007834476/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yhpl_1940_40_007834476/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yhpl_1940_40_007834476/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Apr 2012 20:55:09 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 518,400 | 799,321 | 1.5419 |
08 Apr 2012 00:45:07 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 492,480 | 759,351 | 1.5419 |
06 Apr 2012 08:25:46 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 466,560 | 719,339 | 1.5418 |
05 Apr 2012 05:52:46 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 440,640 | 680,654 | 1.5447 |
04 Apr 2012 07:40:52 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 414,720 | 641,836 | 1.5476 |
03 Apr 2012 06:21:39 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 388,800 | 602,348 | 1.5492 |
02 Apr 2012 03:07:45 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 362,880 | 562,421 | 1.5499 |
01 Apr 2012 15:51:54 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 336,960 | 521,117 | 1.5465 |
31 Mar 2012 11:22:35 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 311,040 | 480,828 | 1.5459 |
30 Mar 2012 07:09:34 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 285,120 | 440,400 | 1.5446 |
29 Mar 2012 06:02:43 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 259,200 | 400,974 | 1.5470 |
27 Mar 2012 14:51:40 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 233,280 | 360,933 | 1.5472 |
26 Mar 2012 13:52:40 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 207,360 | 321,437 | 1.5501 |
25 Mar 2012 20:57:34 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 181,440 | 282,094 | 1.5548 |
25 Mar 2012 09:09:55 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 155,520 | 242,400 | 1.5586 |
24 Mar 2012 13:36:25 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 129,600 | 202,616 | 1.5634 |
23 Mar 2012 08:40:31 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 103,680 | 162,328 | 1.5657 |
22 Mar 2012 17:46:59 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 77,760 | 122,152 | 1.5709 |
21 Mar 2012 23:30:47 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 51,840 | 81,672 | 1.5755 |
21 Mar 2012 11:48:31 | 1109026 | 14298890 | hadcm3n_yhpl_1940_40_007834476_2 | 25,920 | 41,265 | 1.5920 |
©2024 cpdn.org