Name | hadcm3n_o10i_1940_40_007693482_2 |
Workunit | 7848590 |
Created | 23 Jan 2012, 18:17:28 UTC |
Sent | 23 Jan 2012, 18:19:01 UTC |
Report deadline | 24 Apr 2012, 1:46:12 UTC |
Received | 7 Feb 2012, 14:10:18 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1028748 |
Run time | 6 days 9 hours 9 min 42 sec |
CPU time | 5 days 8 hours 53 min 11 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.25 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 05:29:38 (6228): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:20:14 (5016): No heartbeat from core client for 30 sec - exiting 16:20:15 (5016): No heartbeat from core client for 30 sec - exiting 16:20:16 (5016): No heartbeat from core client for 30 sec - exiting 16:20:17 (5016): No heartbeat from core client for 30 sec - exiting 16:20:18 (5016): No heartbeat from core client for 30 sec - exiting 16:20:19 (5016): No heartbeat from core client for 30 sec - exiting 16:20:20 (5016): No heartbeat from core client for 30 sec - exiting 16:20:21 (5016): No heartbeat from core client for 30 sec - exiting 16:20:22 (5016): No heartbeat from core client for 30 sec - exiting 16:20:23 (5016): No heartbeat from core client for 30 sec - exiting 16:20:24 (5016): No heartbeat from core client for 30 sec - exiting 16:20:25 (5016): No heartbeat from core client for 30 sec - exiting 16:20:26 (5016): No heartbeat from core client for 30 sec - exiting 16:20:27 (5016): No heartbeat from core client for 30 sec - exiting 16:20:28 (5016): No heartbeat from core client for 30 sec - exiting 16:20:29 (5016): No heartbeat from core client for 30 sec - exiting 16:20:30 (5016): No heartbeat from core client for 30 sec - exiting 16:20:31 (5016): No heartbeat from core client for 30 sec - exiting 16:20:32 (5016): No heartbeat from core client for 30 sec - exiting 16:20:33 (5016): No heartbeat from core client for 30 sec - exiting 16:20:34 (5016): No heartbeat from core client for 30 sec - exiting 16:20:35 (5016): No heartbeat from core client for 30 sec - exiting 16:20:36 (5016): No heartbeat from core client for 30 sec - exiting 16:20:37 (5016): No heartbeat from core client for 30 sec - exiting 16:20:38 (5016): No heartbeat from core client for 30 sec - exiting 16:20:39 (5016): No heartbeat from core client for 30 sec - exiting 16:20:40 (5016): No heartbeat from core client for 30 sec - exiting 16:20:41 (5016): No heartbeat from core client for 30 sec - exiting 16:20:42 (5016): No heartbeat from core client for 30 sec - exiting 16:20:43 (5016): No heartbeat from core client for 30 sec - exiting 16:20:44 (5016): No heartbeat from core client for 30 sec - exiting 16:20:45 (5016): No heartbeat from core client for 30 sec - exiting 16:20:46 (5016): No heartbeat from core client for 30 sec - exiting 16:20:47 (5016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:52:19 (2688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:57:16 (5716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2552, iMonCtr=1 Model crash detected, will try to restart... 21:30:54 (336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:30:55 (336): No heartbeat from core client for 30 sec - exiting 22:21:46 (3152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 04:55:02 (3688): No heartbeat from core client for 30 sec - exiting 04:55:03 (3688): No heartbeat from core client for 30 sec - exiting 04:55:04 (3688): No heartbeat from core client for 30 sec - exiting 04:55:05 (3688): No heartbeat from core client for 30 sec - exiting 04:55:06 (3688): No heartbeat from core client for 30 sec - exiting 04:55:07 (3688): No heartbeat from core client for 30 sec - exiting 04:55:08 (3688): No heartbeat from core client for 30 sec - exiting 04:55:09 (3688): No heartbeat from core client for 30 sec - exiting 04:55:10 (3688): No heartbeat from core client for 30 sec - exiting 04:55:11 (3688): No heartbeat from core client for 30 sec - exiting 04:55:12 (3688): No heartbeat from core client for 30 sec - exiting 04:55:13 (3688): No heartbeat from core client for 30 sec - exiting 04:55:14 (3688): No heartbeat from core client for 30 sec - exiting 04:55:15 (3688): No heartbeat from core client for 30 sec - exiting 04:55:16 (3688): No heartbeat from core client for 30 sec - exiting 04:55:17 (3688): No heartbeat from core client for 30 sec - exiting 04:55:18 (3688): No heartbeat from core client for 30 sec - exiting 04:55:19 (3688): No heartbeat from core client for 30 sec - exiting 04:55:20 (3688): No heartbeat from core client for 30 sec - exiting 04:55:21 (3688): No heartbeat from core client for 30 sec - exiting 04:55:22 (3688): No heartbeat from core client for 30 sec - exiting 04:55:23 (3688): No heartbeat from core client for 30 sec - exiting 04:55:24 (3688): No heartbeat from core client for 30 sec - exiting 04:55:25 (3688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:55:26 (3688): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 21:27:46 (5156): No heartbeat from core client for 30 sec - exiting 21:27:47 (5156): No heartbeat from core client for 30 sec - exiting 21:27:48 (5156): No heartbeat from core client for 30 sec - exiting 21:27:49 (5156): No heartbeat from core client for 30 sec - exiting 21:27:50 (5156): No heartbeat from core client for 30 sec - exiting 21:27:51 (5156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:21:59 (5448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:14:56 (3612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:14:57 (3612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 02:27:29 (1328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:00:28 (4980): No heartbeat from core client for 30 sec - exiting 07:00:29 (4980): No heartbeat from core client for 30 sec - exiting 07:00:30 (4980): No heartbeat from core client for 30 sec - exiting 07:00:31 (4980): No heartbeat from core client for 30 sec - exiting 07:00:32 (4980): No heartbeat from core client for 30 sec - exiting 07:00:33 (4980): No heartbeat from core client for 30 sec - exiting 07:00:34 (4980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... zip error: Could not create output file (was replacing the original zip file) cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o10i_1940_40_007693482/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o10i_1940_40_007693482/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o10i_1940_40_007693482/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o10i_1940_40_007693482/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o10i_1940_40_007693482/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o10i_1940_40_007693482/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o10i_1940_40_007693482/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o10i_1940_40_007693482/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o10i_1940_40_007693482/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o10i_1940_40_007693482/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o10i_1940_40_007693482/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_o10i_1940_40_007693482/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
07 Feb 2012 13:05:51 | 1028748 | 13955843 | hadcm3n_o10i_1940_40_007693482_2 | 259,200 | 464,045 | 1.7903 |
06 Feb 2012 20:48:22 | 1028748 | 13955843 | hadcm3n_o10i_1940_40_007693482_2 | 233,280 | 417,263 | 1.7887 |
06 Feb 2012 04:45:15 | 1028748 | 13955843 | hadcm3n_o10i_1940_40_007693482_2 | 207,360 | 370,751 | 1.7880 |
05 Feb 2012 12:58:56 | 1028748 | 13955843 | hadcm3n_o10i_1940_40_007693482_2 | 181,440 | 324,098 | 1.7863 |
04 Feb 2012 20:26:19 | 1028748 | 13955843 | hadcm3n_o10i_1940_40_007693482_2 | 155,520 | 277,849 | 1.7866 |
04 Feb 2012 06:03:20 | 1028748 | 13955843 | hadcm3n_o10i_1940_40_007693482_2 | 129,600 | 231,447 | 1.7859 |
03 Feb 2012 15:00:43 | 1028748 | 13955843 | hadcm3n_o10i_1940_40_007693482_2 | 103,680 | 184,947 | 1.7838 |
02 Feb 2012 13:43:15 | 1028748 | 13955843 | hadcm3n_o10i_1940_40_007693482_2 | 77,760 | 138,662 | 1.7832 |
01 Feb 2012 23:02:13 | 1028748 | 13955843 | hadcm3n_o10i_1940_40_007693482_2 | 51,840 | 92,580 | 1.7859 |
01 Feb 2012 08:19:16 | 1028748 | 13955843 | hadcm3n_o10i_1940_40_007693482_2 | 25,920 | 46,413 | 1.7906 |
©2024 cpdn.org