Name | hadam3prm3pm2t_eu_jpw6_2002_1_009823825_0 |
Workunit | 9879751 |
Created | 7 May 2015, 14:29:43 UTC |
Sent | 30 Oct 2015, 19:41:04 UTC |
Report deadline | 12 Oct 2016, 1:01:04 UTC |
Received | 16 Nov 2015, 9:54:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1377973 |
Run time | 11 days 13 hours 33 min 24 sec |
CPU time | 8 days 4 hours 15 min 27 sec |
Validate state | Invalid |
Credit | 3,200.28 |
Device peak FLOPS | 2.06 GFLOPS |
Application version | UK Met Office HadAM3P and HadRM3P model with MOSES II and TRIFFID Europe v7.01 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1077, selfPID=1077, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1077, selfPID=1078, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13174, selfPID=13077, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13594, selfPID=13542, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20277, selfPID=20122, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:26:19 (21029): No heartbeat from client for 30 sec - exiting 07:26:19 (21029): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18208, selfPID=18138, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=23491, selfPID=23412, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 17:14:18 (24640): No heartbeat from client for 30 sec - exiting 17:14:18 (24640): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:00:45 (25045): No heartbeat from client for 30 sec - exiting 02:00:46 (25045): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:00:47 (25045): No heartbeat from client for 30 sec - exiting 02:00:47 (25045): timer handler: client dead, exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:12:28 (27697): No heartbeat from client for 30 sec - exiting 18:12:37 (27697): timer handler: client dead, exiting 18:12:38 (27697): No heartbeat from client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:12:44 (27697): timer handler: client dead, exiting 18:12:45 (27697): No heartbeat from client for 30 sec - exiting 18:12:45 (27697): timer handler: client dead, exiting 18:12:46 (27697): No heartbeat from client for 30 sec - exiting 18:13:26 (27697): timer handler: client dead, exiting 18:13:27 (27697):20:16:49 (28043): No heartbeat from client for 30 sec - exiting 20:16:51 (28043): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:48 (28160): No heartbeat from client for 30 sec - exiting 04:01:52 (28160): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:53 (28160): No heartbeat from client for 30 sec - exiting 04:01:53 (28160): timer handler: client dead, exiting 06:11:25 (28324): No heartbeat from client for 30 sec - exiting 06:11:26 (28324): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... execv: No such file or directory </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Nov 2015 20:35:31 | 1377973 | 18410065 | hadam3prm3pm2t_eu_jpw6_2002_1_009823825_0 | 46,379 | 573,871 | 12.3735 |
11 Nov 2015 01:13:29 | 1377973 | 18410065 | hadam3prm3pm2t_eu_jpw6_2002_1_009823825_0 | 34,859 | 429,178 | 12.3118 |
08 Nov 2015 09:20:20 | 1377973 | 18410065 | hadam3prm3pm2t_eu_jpw6_2002_1_009823825_0 | 23,339 | 282,969 | 12.1243 |
05 Nov 2015 14:23:55 | 1377973 | 18410065 | hadam3prm3pm2t_eu_jpw6_2002_1_009823825_0 | 11,819 | 137,579 | 11.6405 |
©2024 cpdn.org