Name | hadam3p_eu_2rp7_1996_1_008160811_0 |
Workunit | 8315935 |
Created | 21 Aug 2012, 0:27:05 UTC |
Sent | 25 Aug 2012, 16:39:09 UTC |
Report deadline | 7 Aug 2013, 21:59:09 UTC |
Received | 5 Oct 2012, 6:06:52 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1154819 |
Run time | 1 days 13 hours 29 min 13 sec |
CPU time | 20 hours 46 min 19 sec |
Validate state | Invalid |
Credit | 399.11 |
Device peak FLOPS | 1.96 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.31</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3644, selfPID=3644, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5140, selfPID=5912, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2308, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1604, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5284, selfPID=4156, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5860, selfPID=4108, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4456, selfPID=2916, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4528, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3416, selfPID=1500, iMonCtr=1 Model crash detected, will try to restart... 22:37:47 (3532): No heartbeat from core client for 30 sec - exiting 22:37:49 (3532): No heartbeat from core client for 30 sec - exiting 22:37:50 (3532): No heartbeat from core client for 30 sec - exiting 22:37:51 (3532): No heartbeat from core client for 30 sec - exiting 22:37:52 (3532): No heartbeat from core client for 30 sec - exiting 22:37:53 (3532): No heartbeat from core client for 30 sec - exiting 22:37:54 (3532): No heartbeat from core client for 30 sec - exiting 22:37:55 (3532): No heartbeat from core client for 30 sec - exiting 22:37:56 (3532): No heartbeat from core client for 30 sec - exiting 22:37:58 (3532): No heartbeat from core client for 30 sec - exiting 22:37:59 (3532): No heartbeat from core client for 30 sec - exiting 22:38:00 (3532): No heartbeat from core client for 30 sec - exiting 22:38:01 (3532): No heartbeat from core client for 30 sec - exiting 22:38:02 (3532): No heartbeat from core client for 30 sec - exiting 22:38:03 (3532): No heartbeat from core client for 30 sec - exiting 22:38:04 (3532): No heartbeat from core client for 30 sec - exiting 22:38:05 (3532): No heartbeat from core client for 30 sec - exiting 22:38:06 (3532): No heartbeat from core client for 30 sec - exiting 22:38:08 (3532): No heartbeat from core client for 30 sec - exiting 22:38:09 (3532): No heartbeat from core client for 30 sec - exiting 22:38:10 (3532): No heartbeat from core client for 30 sec - exiting 22:38:11 (3532): No heartbeat from core client for 30 sec - exiting 22:38:12 (3532): No heartbeat from core client for 30 sec - exiting 22:38:13 (3532): No heartbeat from core client for 30 sec - exiting 22:38:14 (3532): No heartbeat from core client for 30 sec - exiting 22:38:15 (3532): No heartbeat from core client for 30 sec - exiting 22:38:16 (3532): No heartbeat from core client for 30 sec - exiting 22:38:18 (3532): No heartbeat from core client for 30 sec - exiting 22:38:19 (3532): No heartbeat from core client for 30 sec - exiting 22:38:20 (3532): No heartbeat from core client for 30 sec - exiting 22:38:21 (3532): No heartbeat from core client for 30 sec - exiting 22:38:22 (3532): No heartbeat from core client for 30 sec - exiting 22:38:23 (3532): No heartbeat from core client for 30 sec - exiting 22:38:24 (3532): No heartbeat from core client for 30 sec - exiting 22:38:26 (3532): No heartbeat from core client for 30 sec - exiting 22:38:27 (3532): No heartbeat from core client for 30 sec - exiting 22:38:28 (3532): No heartbeat from core client for 30 sec - exiting 22:38:29 (3532): No heartbeat from core client for 30 sec - exiting 22:38:30 (3532): No heartbeat from core client for 30 sec - exiting 22:38:31 (3532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:11:22 (3992): No heartbeat from core client for 30 sec - exiting 10:11:23 (3992): No heartbeat from core client for 30 sec - exiting 10:11:24 (3992): No heartbeat from core client for 30 sec - exiting 10:11:25 (3992): No heartbeat from core client for 30 sec - exiting 10:11:27 (3992): No heartbeat from core client for 30 sec - exiting 10:11:28 (3992): No heartbeat from core client for 30 sec - exiting 10:11:29 (3992): No heartbeat from core client for 30 sec - exiting 10:11:30 (3992): No heartbeat from core client for 30 sec - exiting 10:11:31 (3992): No heartbeat from core client for 30 sec - exiting 10:11:32 (3992): No heartbeat from core client for 30 sec - exiting 10:11:33 (3992): No heartbeat from core client for 30 sec - exiting 10:11:34 (3992): No heartbeat from core client for 30 sec - exiting 10:11:35 (3992): No heartbeat from core client for 30 sec - exiting 10:11:36 (3992): No heartbeat from core client for 30 sec - exiting 10:11:37 (3992): No heartbeat from core client for 30 sec - exiting 10:11:39 (3992): No heartbeat from core client for 30 sec - exiting 10:11:40 (3992): No heartbeat from core client for 30 sec - exiting 10:11:41 (3992): No heartbeat from core client for 30 sec - exiting 10:11:42 (3992): No heartbeat from core client for 30 sec - exiting 10:11:43 (3992): No heartbeat from core client for 30 sec - exiting 10:11:44 (3992): No heartbeat from core client for 30 sec - exiting 10:11:45 (3992): No heartbeat from core client for 30 sec - exiting 10:11:46 (3992): No heartbeat from core client for 30 sec - exiting 10:11:47 (3992): No heartbeat from core client for 30 sec - exiting 10:11:48 (3992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:35:02 (4660): No heartbeat from core client for 30 sec - exiting 20:35:03 (4660): No heartbeat from core client for 30 sec - exiting 20:35:04 (4660): No heartbeat from core client for 30 sec - exiting 20:35:05 (4660): No heartbeat from core client for 30 sec - exiting 20:35:06 (4660): No heartbeat from core client for 30 sec - exiting 20:35:07 (4660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6740, selfPID=6740, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6120, selfPID=3308, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5744, selfPID=4100, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7032, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2408, selfPID=2408, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1852, selfPID=1852, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2920, selfPID=2920, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6332, selfPID=2724, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 22:39:14 (1576): No heartbeat from core client for 30 sec - exiting 22:39:16 (1576): No heartbeat from core client for 30 sec - exiting 22:39:17 (1576): No heartbeat from core client for 30 sec - exiting 22:39:18 (1576): No heartbeat from core client for 30 sec - exiting 22:39:19 (1576): No heartbeat from core client for 30 sec - exiting 22:39:20 (1576): No heartbeat from core client for 30 sec - exiting 22:39:21 (1576): No heartbeat from core client for 30 sec - exiting 22:39:22 (1576): No heartbeat from core client for 30 sec - exiting 22:39:23 (1576): No heartbeat from core client for 30 sec - exiting 22:39:24 (1576): No heartbeat from core client for 30 sec - exiting 22:39:25 (1576): No heartbeat from core client for 30 sec - exiting 22:39:27 (1576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7880, selfPID=7880, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2436, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6472, selfPID=6472, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... G </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Sep 2012 20:27:21 | 1154819 | 15161404 | hadam3p_eu_2rp7_1996_1_008160811_0 | 23,136 | 63,994 | 2.7660 |
08 Sep 2012 19:15:11 | 1154819 | 15161404 | hadam3p_eu_2rp7_1996_1_008160811_0 | 11,616 | 32,422 | 2.7912 |
©2024 cpdn.org