Name | hadam3p_pnw_6zsh_2004_1_007594969_0 |
Workunit | 7773099 |
Created | 5 Dec 2011, 11:42:05 UTC |
Sent | 5 Dec 2011, 11:45:00 UTC |
Report deadline | 16 Nov 2012, 17:05:00 UTC |
Received | 12 Dec 2011, 9:07:30 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1088462 |
Run time | 3 days 2 hours 39 min 45 sec |
CPU time | 2 days 18 hours 9 min 14 sec |
Validate state | Invalid |
Credit | 1,503.98 |
Device peak FLOPS | 2.65 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.33</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> 17:30:28 (4600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=35Suspended CPDN Monitor - Suspend request from BOINC... 11:48:16 (1968): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: C06:01:20 (3452): No heartbeat from core client for 30 sec - exiting 06:01:21 (3452): No heartbeat from core client for 30 sec - exiting 06:01:22 (3452): No heartbeat from core client for 30 sec - exiting 06:01:23 (3452): No heartbeat from core client for 30 sec - exiting 06:01:24 (3452): No heartbeat from core client for 30 sec - exiting 06:01:26 (3452): No heartbeat from core client for 30 sec - exiting 06:01:27 (3452): No heartbeat from core client for 30 sec - exiting 06:01:28 (3452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:50:22 (4408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:51:30 (1084): No heartbeat from core client for 30 sec - exiting 09:51:31 (1084): No heartbeat from core client for 30 sec - exiting 09:51:32 (1084): No heartbeat from core client for 30 sec - exiting 09:51:33 (1084): No heartbeat from core client for 30 sec - exiting 09:51:35 (1084): No heartbeat from core client for 30 sec - exiting 09:51:36 (1084): No heartbeat from core client for 30 sec - exiting 09:51:37 (1084): No heartbeat from core client for 30 sec - exiting 09:51:38 (1084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4196, iMonCtr=2 Model crash detected, will try to restart..G lobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3860, iMonCtr=2 Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:37:20 (4008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:38:13 (4988): No heartbeat from core client for 30 sec - exiting 18:38:14 (4988): No heartbeat from core client for 30 sec - exiting 18:38:15 (4988): No heartbeat from core client for 30 sec - exiting 18:38:16 (4988): No heartbeat from core client for 30 sec - exiting 18:38:17 (4988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GSuspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 13:38:31 (3656): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:01:18 (3568): No heartbeat from core client for 30 sec - exiting 06:01:19 (3568): No heartbeat from core client for 30 sec - exiting 06:01:20 (3568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:06:47 (3676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID15:07:35 (4780): No heartbeat from core client for 30 sec - exiting 15:07:36 (4780): No heartbeat from core client for 30 sec - exiting 15:07:37 (4780): No heartbeat from core client for 30 sec - exiting 15:07:39 (4780): No heartbeat from core client for 30 sec - exiting 15:07:40 (4780): No heartbeat from core client for 30 sec - exiting 15:07:41 (4780): No heartbeat from core client for 30 sec - exiting 15:07:42 (4780): No heartbeat from core client for 30 sec - exiting 15:07:43 (4780): No heartbeat from core client for 30 sec - exiting 15:07:44 (4780): No heartbeat from core client for 30 sec - exiting 15:07:45 (4780): No heartbeat from core client for 30 sec - exiting 15:07:46 (4780): No heartbeat from core client for 30 sec - exiting 15:07:47 (4780): No heartbeat from core client for 30 sec - exiting 15:07:48 (4780): No heartbeat from core client for 30 sec - exiting 15:07:50 (4780): No heartbeat from core client for 30 sec - exiting 15:07:51 (4780): No heartbeat from core client for 30 sec - exiting 15:07:52 (4780): No heartbeat from core client for 30 sec - exiting 15:07:53 (4780): No heartbeat from core client for 30 sec - exiting 15:07:54 (4780): No heartbeat from core client for 30 sec - exiting 15:07:55 (4780): No heartbeat from core client for 30 sec - exiting 15:07:56 (4780): No heartbeat from core client for 30 sec - exiting 15:07:57 (4780): No heartbeat from core client for 30 sec - exiting 15:07:58 (4780): No heartbeat from core client for 30 sec - exiting 15:07:59 (4780): No heartbeat from core client for 30 sec - exiting 15:08:01 (4780): No heartbeat from core client for 30 sec - exiting 15:08:02 (4780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:48:29 (3700): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 21:41:36 (4516): No heartbeat from core client for 30 sec - exiting 21:41:37 (4516): No heartbeat from core client for 30 sec - exiting 21:41:38 (4516): No heartbeat from core client for 30 sec - exiting 21:41:39 (4516): No heartbeat from core client for 30 sec - exiting 21:41:40 (4516): No heartbeat from core client for 30 sec - exiting 21:41:41 (4516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Dec 2011 12:47:44 | 1088462 | 13717724 | hadam3p_pnw_6zsh_2004_1_007594969_0 | 69,216 | 216,656 | 3.1301 |
10 Dec 2011 09:46:17 | 1088462 | 13717724 | hadam3p_pnw_6zsh_2004_1_007594969_0 | 57,696 | 181,993 | 3.1543 |
09 Dec 2011 04:31:38 | 1088462 | 13717724 | hadam3p_pnw_6zsh_2004_1_007594969_0 | 46,176 | 145,202 | 3.1445 |
08 Dec 2011 07:47:55 | 1088462 | 13717724 | hadam3p_pnw_6zsh_2004_1_007594969_0 | 34,656 | 108,205 | 3.1223 |
07 Dec 2011 08:51:16 | 1088462 | 13717724 | hadam3p_pnw_6zsh_2004_1_007594969_0 | 23,158 | 72,892 | 3.1476 |
07 Dec 2011 07:50:49 | 1088462 | 13717724 | hadam3p_pnw_6zsh_2004_1_007594969_0 | 23,136 | 72,095 | 3.1161 |
06 Dec 2011 10:30:28 | 1088462 | 13717724 | hadam3p_pnw_6zsh_2004_1_007594969_0 | 11,616 | 35,895 | 3.0901 |
©2024 cpdn.org