Name | hadam3p_eu_a66p_1986_1_007790641_1 |
Workunit | 7945750 |
Created | 24 Mar 2012, 10:25:13 UTC |
Sent | 24 Mar 2012, 10:25:47 UTC |
Report deadline | 6 Mar 2013, 15:45:47 UTC |
Received | 20 Jul 2012, 13:51:32 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1094400 |
Run time | 2 days 15 hours 59 min 8 sec |
CPU time | 2 days 12 hours 13 min 31 sec |
Validate state | Invalid |
Credit | 1,194.02 |
Device peak FLOPS | 2.79 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> G16:50:21 (4408): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6740, iMController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5812, selfPID=4044, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1892, selfPID=836, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4876, selfPID=4916, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5968, selfPID=5144, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5412, selfPID=5412, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5636, selfPID=5636, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4668, selfPID=4668, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5384, selfPID=5384, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional WorkerCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:02:57 (4540): Can't acquire lockfile (32) - waiting 35s Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4872, selfPID=3712, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1248, selfPID=5384, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5588, selfPID=5588, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3224, selfPID=3224, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4708, selfPID=4560, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5860, selfPID=4132, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3920, selfPID=3920, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6140, selfPID=6140, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4660, selfPID=4660, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5204, selfPID=5476, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5932, selfPID=4756, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6788, selfPID=6452, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2628, selfPID=5196, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
19 Jul 2012 12:31:59 | 1094400 | 14312594 | hadam3p_eu_a66p_1986_1_007790641_1 | 69,216 | 205,787 | 2.9731 |
29 Jun 2012 18:48:31 | 1094400 | 14312594 | hadam3p_eu_a66p_1986_1_007790641_1 | 57,700 | 172,241 | 2.9851 |
25 Jun 2012 14:41:45 | 1094400 | 14312594 | hadam3p_eu_a66p_1986_1_007790641_1 | 57,696 | 171,786 | 2.9774 |
18 Jun 2012 11:39:11 | 1094400 | 14312594 | hadam3p_eu_a66p_1986_1_007790641_1 | 46,176 | 137,284 | 2.9731 |
09 Jun 2012 10:18:16 | 1094400 | 14312594 | hadam3p_eu_a66p_1986_1_007790641_1 | 34,656 | 102,631 | 2.9614 |
08 May 2012 17:06:32 | 1094400 | 14312594 | hadam3p_eu_a66p_1986_1_007790641_1 | 23,146 | 68,552 | 2.9617 |
07 May 2012 07:46:42 | 1094400 | 14312594 | hadam3p_eu_a66p_1986_1_007790641_1 | 23,140 | 68,094 | 2.9427 |
06 May 2012 12:17:23 | 1094400 | 14312594 | hadam3p_eu_a66p_1986_1_007790641_1 | 23,136 | 67,646 | 2.9238 |
17 Apr 2012 16:48:33 | 1094400 | 14312594 | hadam3p_eu_a66p_1986_1_007790641_1 | 11,616 | 33,138 | 2.8528 |
©2024 climateprediction.net