Name | hadam3p_pnw_2xfb_1969_1_007176815_0 |
Workunit | 7375097 |
Created | 22 Feb 2011, 11:36:04 UTC |
Sent | 9 Mar 2011, 14:19:47 UTC |
Report deadline | 19 Feb 2012, 19:39:47 UTC |
Received | 23 Mar 2011, 5:24:02 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1124341 |
Run time | 2 days 22 hours 42 min 42 sec |
CPU time | 2 days 20 hours 8 min 31 sec |
Validate state | Invalid |
Credit | 2,004.61 |
Device peak FLOPS | 2.63 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4704, selfPID=4704, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5508, selfPID=5508, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5472, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4168, selfPID=4168, iMonCtr=2 05:37:06 (4716): No heartbeat from core client for 30 sec - exiting 05:37:07 (4716): No heartbeat from core client for 30 sec - exiting 05:37:08 (4716): No heartbeat from core client for 30 sec - exiting 05:37:09 (4716): No heartbeat from core client for 30 sec - exiting 05:37:10 (4716): No heartbeat from core client for 30 sec - exiting 05:37:11 (4716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=2 Model crash detected, will try to restart... 16:12:28 (4436): No heartbeat from core client for 30 sec - exiting 16:12:29 (4436): No heartbeat from core client for 30 sec - exiting 16:12:30 (4436): No heartbeat from core client for 30 sec - exiting 16:12:32 (4436): No heartbeat from core client for 30 sec - exiting 16:12:33 (4436): No heartbeat from core client for 30 sec - exiting 16:12:34 (4436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4776, selfPID=4776, iMonCtr=2 01:03:23 (4568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4284, selfPID=4660, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4548, selfPID=4548, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6120, selfPID=6120, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2640, selfPID=2640, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4976, selfPID=4976, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2216, selfPID=2216, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5404, selfPID=5404, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=780, iMonCtr=2 04:41:55 (4652): No heartbeat from core client for 30 sec - exiting 04:41:56 (4652): No heartbeat from core client for 30 sec - exiting 04:41:57 (4652): No heartbeat from core client for 30 sec - exiting 04:41:58 (4652): No heartbeat from core client for 30 sec - exiting 04:41:59 (4652): No heartbeat from core client for 30 sec - exiting 04:42:00 (4652): No heartbeat from core client for 30 sec - exiting 04:42:01 (4652): No heartbeat from core client for 30 sec - exiting 04:42:02 (4652): No heartbeat from core client for 30 sec - exiting 04:42:04 (4652): No heartbeat from core client for 30 sec - exiting 04:42:05 (4652): No heartbeat from core client for 30 sec - exiting 04:42:06 (4652): No heartbeat from core client for 30 sec - exiting 04:42:07 (4652): No heartbeat from core client for 30 sec - exiting 04:42:08 (4652): No heartbeat from core client for 30 sec - exiting 04:42:09 (4652): No heartbeat from core client for 30 sec - exiting 04:42:10 (4652): No heartbeat from core client for 30 sec - exiting 04:42:11 (4652): No heartbeat from core client for 30 sec - exiting 04:42:12 (4652): No heartbeat from core client for 30 sec - exiting 04:42:13 (4652): No heartbeat from core client for 30 sec - exiting 04:42:15 (4652): No heartbeat from core client for 30 sec - exiting 04:42:16 (4652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:42:17 (4652): No heartbeat from core client for 30 sec - exiting 14:18:01 (4628): No heartbeat from core client for 30 sec - exiting 14:18:02 (4628): No heartbeat from core client for 30 sec - exiting 14:18:03 (4628): No heartbeat from core client for 30 sec - exiting 14:18:04 (4628): No heartbeat from core client for 30 sec - exiting 14:18:05 (4628): No heartbeat from core client for 30 sec - exiting 14:18:06 (4628): No heartbeat from core client for 30 sec - exiting 14:18:08 (4628): No heartbeat from core client for 30 sec - exiting 14:18:09 (4628): No heartbeat from core client for 30 sec - exiting 14:18:10 (4628): No heartbeat from core client for 30 sec - exiting 14:18:11 (4628): No heartbeat from core client for 30 sec - exiting 14:18:12 (4628): No heartbeat from core client for 30 sec - exiting 14:18:13 (4628): No heartbeat from core client for 30 sec - exiting 14:18:14 (4628): No heartbeat from core client for 30 sec - exiting 14:18:15 (4628): No heartbeat from core client for 30 sec - exiting 14:18:16 (4628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3432, selfPID=3432, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4976, selfPID=4976, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4312, selfPID=4312, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1432, selfPID=1432, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5608, selfPID=5608, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1296, selfPID=1296, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:02:50 (4440): No heartbeat from core client for 30 sec - exiting 16:02:51 (4440): No heartbeat from core client for 30 sec - exiting 16:02:52 (4440): No heartbeat from core client for 30 sec - exiting 16:02:53 (4440): No heartbeat from core client for 30 sec - exiting 16:02:54 (4440): No heartbeat from core client for 30 sec - exiting 16:02:55 (4440): No heartbeat from core client for 30 sec - exiting 16:02:56 (4440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Mar 2011 01:53:27 | 1124341 | 12616118 | hadam3p_pnw_2xfb_1969_1_007176815_0 | 92,256 | 233,342 | 2.5293 |
22 Mar 2011 18:18:49 | 1124341 | 12616118 | hadam3p_pnw_2xfb_1969_1_007176815_0 | 80,736 | 205,143 | 2.5409 |
20 Mar 2011 18:55:40 | 1124341 | 12616118 | hadam3p_pnw_2xfb_1969_1_007176815_0 | 69,216 | 176,485 | 2.5498 |
20 Mar 2011 03:05:59 | 1124341 | 12616118 | hadam3p_pnw_2xfb_1969_1_007176815_0 | 57,696 | 147,022 | 2.5482 |
18 Mar 2011 16:36:36 | 1124341 | 12616118 | hadam3p_pnw_2xfb_1969_1_007176815_0 | 46,176 | 118,054 | 2.5566 |
15 Mar 2011 14:03:58 | 1124341 | 12616118 | hadam3p_pnw_2xfb_1969_1_007176815_0 | 34,656 | 88,614 | 2.5570 |
15 Mar 2011 00:21:27 | 1124341 | 12616118 | hadam3p_pnw_2xfb_1969_1_007176815_0 | 23,136 | 59,062 | 2.5528 |
13 Mar 2011 19:09:51 | 1124341 | 12616118 | hadam3p_pnw_2xfb_1969_1_007176815_0 | 11,616 | 29,781 | 2.5638 |
©2024 cpdn.org