Name | hadcm3n_zjl6_1920_40_008316491_4 |
Workunit | 8467626 |
Created | 30 Mar 2013, 4:13:46 UTC |
Sent | 30 Mar 2013, 4:14:58 UTC |
Report deadline | 29 Jun 2013, 11:42:09 UTC |
Received | 9 Apr 2013, 5:08:44 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1237173 |
Run time | 1 days 8 hours 13 min 40 sec |
CPU time | 1 days 0 hours 50 min 12 sec |
Validate state | Invalid |
Credit | 933.12 |
Device peak FLOPS | 3.68 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 18:10:42 (4104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 02:48:45 (20664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:53:54 (24328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:54:08 (24328): No heartbeat from core client for 30 sec - exiting 02:54:09 (24328): No heartbeat from core client for 30 sec - exiting 02:54:10 (24328): No heartbeat from core client for 30 sec - exiting 02:54:11 (24328): No heartbeat from core client for 30 sec - exiting 02:54:12 (24328): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 02:57:00 (9188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:57:01 (9188): No heartbeat from core client for 30 sec - exiting 02:57:02 (9188): No heartbeat from core client for 30 sec - exiting 03:08:05 (26172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:08:08 (26172): No heartbeat from core client for 30 sec - exiting 03:08:09 (26172): No heartbeat from core client for 30 sec - exiting 03:08:10 (26172): No heartbeat from core client for 30 sec - exiting 03:08:11 (26172): No heartbeat from core client for 30 sec - exiting 03:08:12 (26172): No heartbeat from core client for 30 sec - exiting 03:08:13 (26172): No heartbeat from core client for 30 sec - exiting 03:08:14 (26172): No heartbeat from core client for 30 sec - exiting 03:08:15 (26172): No heartbeat from core client for 30 sec - exiting 03:08:16 (26172): No heartbeat from core client for 30 sec - exiting 03:08:17 (26172): No heartbeat from core client for 30 sec - exiting 03:15:22 (23944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:15:27 (23944): No heartbeat from core client for 30 sec - exiting 03:15:28 (23944): No heartbeat from core client for 30 sec - exiting 03:15:29 (23944): No heartbeat from core client for 30 sec - exiting 03:15:30 (23944): No heartbeat from core client for 30 sec - exiting 03:15:31 (23944): No heartbeat from core client for 30 sec - exiting 03:15:32 (23944): No heartbeat from core client for 30 sec - exiting 03:15:33 (23944): No heartbeat from core client for 30 sec - exiting 03:15:34 (23944): No heartbeat from core client for 30 sec - exiting 03:15:35 (23944): No heartbeat from core client for 30 sec - exiting 03:26:42 (16396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 03:39:46 (27576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:39:47 (27576): No heartbeat from core client for 30 sec - exiting 03:39:48 (27576): No heartbeat from core client for 30 sec - exiting 03:39:49 (27576): No heartbeat from core client for 30 sec - exiting 03:39:50 (27576): No heartbeat from core client for 30 sec - exiting 03:49:51 (22704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:49:53 (22704): No heartbeat from core client for 30 sec - exiting 03:49:54 (22704): No heartbeat from core client for 30 sec - exiting 03:49:55 (22704): No heartbeat from core client for 30 sec - exiting 03:49:56 (22704): No heartbeat from core client for 30 sec - exiting 03:49:57 (22704): No heartbeat from core client for 30 sec - exiting 03:49:58 (22704): No heartbeat from core client for 30 sec - exiting 03:49:59 (22704): No heartbeat from core client for 30 sec - exiting 03:50:00 (22704): No heartbeat from core client for 30 sec - exiting 03:50:01 (22704): No heartbeat from core client for 30 sec - exiting 03:50:02 (22704): No heartbeat from core client for 30 sec - exiting 03:53:04 (26696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:06:09 (24496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:21:54 (26884): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 05:28:25 (21956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:38:03 (28620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 06:26:44 (26320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:26:45 (26320): No heartbeat from core client for 30 sec - exiting 06:26:46 (26320): No heartbeat from core client for 30 sec - exiting 06:26:47 (26320): No heartbeat from core client for 30 sec - exiting 06:26:48 (26320): No heartbeat from core client for 30 sec - exiting 06:26:49 (26320): No heartbeat from core client for 30 sec - exiting 06:26:50 (26320): No heartbeat from core client for 30 sec - exiting 06:26:51 (26320): No heartbeat from core client for 30 sec - exiting 06:26:52 (26320): No heartbeat from core client for 30 sec - exiting 06:26:53 (26320): No heartbeat from core client for 30 sec - exiting 06:26:54 (26320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:10:42 (39788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 19:43:00 (41872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 20:09:26 (44848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:50:07 (44388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:50:08 (44388): No heartbeat from core client for 30 sec - exiting 20:50:09 (44388): No heartbeat from core client for 30 sec - exiting 20:50:10 (44388): No heartbeat from core client for 30 sec - exiting 20:50:11 (44388): No heartbeat from core client for 30 sec - exiting 20:50:12 (44388): No heartbeat from core client for 30 sec - exiting 20:50:13 (44388): No heartbeat from core client for 30 sec - exiting 20:50:14 (44388): No heartbeat from core client for 30 sec - exiting 20:50:15 (44388): No heartbeat from core client for 30 sec - exiting 20:50:16 (44388): No heartbeat from core client for 30 sec - exiting 20:50:17 (44388): No heartbeat from core client for 30 sec - exiting 20:53:30 (45136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:53:46 (45136): No heartbeat from core client for 30 sec - exiting 20:53:47 (45136): No heartbeat from core client for 30 sec - exiting 20:53:48 (45136): No heartbeat from core client for 30 sec - exiting 20:53:49 (45136): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 20:55:51 (38708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:55:52 (38708): No heartbeat from core client for 30 sec - exiting 20:55:53 (38708): No heartbeat from core client for 30 sec - exiting 20:55:54 (38708): No heartbeat from core client for 30 sec - exiting 20:55:55 (38708): No heartbeat from core client for 30 sec - exiting 20:55:56 (38708): No heartbeat from core client for 30 sec - exiting 20:55:57 (38708): No heartbeat from core client for 30 sec - exiting 20:55:58 (38708): No heartbeat from core client for 30 sec - exiting 20:55:59 (38708): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 20:58:23 (35040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:58:24 (35040): No heartbeat from core client for 30 sec - exiting 20:58:25 (35040): No heartbeat from core client for 30 sec - exiting 20:58:26 (35040): No heartbeat from core client for 30 sec - exiting 20:58:27 (35040): No heartbeat from core client for 30 sec - exiting 20:58:28 (35040): No heartbeat from core client for 30 sec - exiting 20:58:29 (35040): No heartbeat from core client for 30 sec - exiting 20:58:30 (35040): No heartbeat from core client for 30 sec - exiting 20:58:31 (35040): No heartbeat from core client for 30 sec - exiting 20:58:32 (35040): No heartbeat from core client for 30 sec - exiting 20:58:33 (35040): No heartbeat from core client for 30 sec - exiting 21:00:39 (43500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:00:51 (43500): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 21:02:13 (46912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:02:14 (46912): No heartbeat from core client for 30 sec - exiting 21:02:15 (46912): No heartbeat from core client for 30 sec - exiting 21:02:16 (46912): No heartbeat from core client for 30 sec - exiting 21:02:17 (46912): No heartbeat from core client for 30 sec - exiting 21:02:18 (46912): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 21:33:10 (46148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:33:11 (46148): No heartbeat from core client for 30 sec - exiting 22:44:02 (47032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:10:37 (43412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:10:53 (43412): No heartbeat from core client for 30 sec - exiting 03:28:56 (43380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:28:59 (43380): No heartbeat from core client for 30 sec - exiting 03:29:00 (43380): No heartbeat from core client for 30 sec - exiting 03:29:01 (43380): No heartbeat from core client for 30 sec - exiting 03:29:02 (43380): No heartbeat from core client for 30 sec - exiting 03:32:48 (49676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:44:46 (48960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50200, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50200, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50200, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50200, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50200, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50200, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Apr 2013 05:13:44 | 1237173 | 15694896 | hadcm3n_zjl6_1920_40_008316491_4 | 77,760 | 79,696 | 1.0249 |
06 Apr 2013 08:38:36 | 1237173 | 15694896 | hadcm3n_zjl6_1920_40_008316491_4 | 51,840 | 56,205 | 1.0842 |
05 Apr 2013 09:24:43 | 1237173 | 15694896 | hadcm3n_zjl6_1920_40_008316491_4 | 25,920 | 28,301 | 1.0919 |
©2024 cpdn.org