Name | hadam3prm3pm2t_eu_ifr1_2002_1_009598490_2 |
Workunit | 9672824 |
Created | 29 Oct 2015, 9:34:47 UTC |
Sent | 29 Oct 2015, 11:46:53 UTC |
Report deadline | 10 Oct 2016, 17:06:53 UTC |
Received | 16 Nov 2015, 9:54:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1377973 |
Run time | 12 days 18 hours 13 min 54 sec |
CPU time | 9 days 2 hours 28 min 32 sec |
Validate state | Invalid |
Credit | 3,995.19 |
Device peak FLOPS | 2.06 GFLOPS |
Application version | UK Met Office HadAM3P and HadRM3P model with MOSES II and TRIFFID Europe v7.01 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 17:22:25 (25974): No heartbeat from client for 30 sec - exiting 17:22:25 (25974): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 3 received, exiting... 10:16:06 (1017): called boinc_finish Signal 3 received, exiting... 10:16:06 (1016): called boinc_finish 10:16:39 (1166): No heartbeat from client for 30 sec - exiting 10:16:39 (1166): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13134, selfPID=13067, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13570, selfPID=13538, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20253, selfPID=20118, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:26:19 (21025): No heartbeat from client for 30 sec - exiting 07:26:19 (21025): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18184, selfPID=18134, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=23483, selfPID=23405, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 17:14:18 (24636): No heartbeat from client for 30 sec - exiting 17:14:18 (24636): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:00:45 (25041): No heartbeat from client for 30 sec - exiting 02:00:47 (25041): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:12:28 (27693): No heartbeat from client for 30 sec - exiting 18:12:44 (27693): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:12:45 (27693): No heartbeat from client for 30 sec - exiting 18:13:26 (27693): timer handler: client dead, exiting 18:13:27 (27693): No heartbeat from client for 30 sec - exiting 18:13:32 (27693): timer handler: client dead, exiting 18:13:33 (27693): No heartbeat from client for 30 sec - exiting 18:13:38 (27693): timer handler: client dead, exiting 18:13:39 (27693): No heartbeat from client for 30 sec - exiting 18:13:41 (27693): timer handler: client dead, exiting 18:13:42 (27693): No heartbeat from client for 30 sec - exiting 18:13:43 (27693): timer handler: client dead, exiting 18:13:44 (27693): No heartbeat from client for 30 sec - exiting 18:13:46 (27693): timer handler: client dead, exiting 18:13:47 (27693): No heartbeat from client for 30 sec - exiting 18:13:49 (27693): timer handler: client dead, exiting 18:14:20 (27693): No heartbeat from client for 30 sec - exiting 18:14:20 (27693): timer handler: client dead, exiting 18:14:21 (27693): No heartbeat from client for 30 sec - exiting 18:14:21 (27693): timer handler: client dead, exiting 18:14:22 (27693): No heartbeat from client for 30 sec - exiting 18:14:22 (27693): timer handler: client dead, exiting 18:14:23 (27693): No heartbeat from client for 30 sec - exiting 18:14:24 (27693): timer handler: client dead, exiting 18:14:25 (27693): No heartbeat from client for 30 sec - exiting 18:14:26 (27693): timer handler: client dead, exiting 18:14:27 (27693): No heartbeat from client for 30 sec - exiting 18:14:28 (27693): timer handler: client dead, exiting 18:14:29 (27693): No heartbeat from client for 30 sec - exiting 18:14:29 (27693): timer handler: client dead, exiting 18:14:30 (27693): No heartbeat from client for 30 sec - exiting 18:14:31 (27693): timer handler: client dead, exiting 20:16:50 (28035): No heartbeat from client for 30 sec - exiting 20:16:51 (28035): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:48 (28152): No heartbeat from client for 30 sec - exiting 04:01:52 (28152): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:53 (28152): No heartbeat from client for 30 sec - exiting 04:01:53 (28152): timer handler: client dead, exiting 06:11:25 (28316): No heartbeat from client for 30 sec - exiting 06:11:27 (28316): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:11:28 (28316):execv: No such file or directory </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Nov 2015 18:46:37 | 1377973 | 19041413 | hadam3prm3pm2t_eu_ifr1_2002_1_009598490_2 | 57,899 | 702,151 | 12.1272 |
11 Nov 2015 23:06:54 | 1377973 | 19041413 | hadam3prm3pm2t_eu_ifr1_2002_1_009598490_2 | 46,379 | 558,766 | 12.0478 |
09 Nov 2015 04:58:43 | 1377973 | 19041413 | hadam3prm3pm2t_eu_ifr1_2002_1_009598490_2 | 34,859 | 409,166 | 11.7377 |
06 Nov 2015 08:19:10 | 1377973 | 19041413 | hadam3prm3pm2t_eu_ifr1_2002_1_009598490_2 | 23,339 | 263,831 | 11.3043 |
05 Nov 2015 14:20:39 | 1377973 | 19041413 | hadam3prm3pm2t_eu_ifr1_2002_1_009598490_2 | 11,819 | 122,266 | 10.3449 |
©2024 cpdn.org