Name | hadam3prm3pm2t_eu_oftt_2002_1_009604590_0 |
Workunit | 9678924 |
Created | 11 Mar 2015, 15:50:30 UTC |
Sent | 31 Oct 2015, 15:55:48 UTC |
Report deadline | 12 Oct 2016, 21:15:48 UTC |
Received | 16 Nov 2015, 11:57:25 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1377973 |
Run time | 10 days 11 hours 13 min 21 sec |
CPU time | 7 days 8 hours 57 min 40 sec |
Validate state | Invalid |
Credit | 3,193.66 |
Device peak FLOPS | 2.06 GFLOPS |
Application version | UK Met Office HadAM3P and HadRM3P model with MOSES II and TRIFFID Europe v7.01 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1032, selfPID=1032, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1032, selfPID=1033, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13126, selfPID=13059, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13628, selfPID=13550, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20235, selfPID=20143, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18234, selfPID=18146, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=23463, selfPID=23425, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 17:14:18 (24648): No heartbeat from client for 30 sec - exiting 17:14:18 (24648): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:00:45 (25053): No heartbeat from client for 30 sec - exiting 02:00:47 (25053): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:12:28 (27705): No heartbeat from client for 30 sec - exiting 18:12:43 (27705): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:12:44 (27705): No heartbeat from client for 30 sec - exiting 18:12:44 (27705): timer handler: client dead, exiting 18:12:45 (27705): No heartbeat from client for 30 sec - exiting 18:13:26 (27705): timer handler: client dead, exiting 18:13:27 (27705): No heartbeat from client for 30 sec - exiting 18:13:32 (27705): timer handler: client dead, exiting 18:13:33 (27705): No heartbeat from client for 30 sec - exiting 18:13:38 (27705): timer handler: client dead, exiting 18:13:39 (27705): No heartbeat from client for 30 sec - exiting 18:13:41 (27705): timer handler: client dead, exiting 18:13:42 (27705): No heartbeat from client for 30 sec - exiting 18:13:43 (27705): timer handler: client dead, exiting 18:13:44 (27705): No heartbeat from client for 30 sec - exiting 18:13:46 (27705): timer handler: client dead, exiting 18:13:47 (27705): No heartbeat from client for 30 sec - exiting 18:13:49 (27705): timer handler: client dead, exiting 18:14:20 (27705): No heartbeat from client for 30 sec - exiting 18:14:20 (27705): timer handler: client dead, exiting 18:14:21 (27705): No heartbeat from client for 30 sec - exiting 18:14:21 (27705): timer handler: client dead, exiting 20:16:49 (28059): No heartbeat from client for 30 sec - exiting 20:16:50 (28059): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:16:51 (28059): No heartbeat from client for 30 sec - exiting 20:16:51 (28059): timer handler: client dead, exiting 04:01:48 (28176): No heartbeat from client for 30 sec - exiting 04:01:52 (28176): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:53 (28176): No heartbeat from client for 30 sec - exiting 04:01:53 (28176): timer handler: client dead, exiting 04:01:54 (28176): No heartbeat from client for 30 sec - exiting 04:01:55 (28176): timer handler: client dead, exiting 06:11:25 (28340): No heartbeat from client for 30 sec - exiting 06:11:27 (28340): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... execv: No such file or directory </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Nov 2015 05:31:26 | 1377973 | 18075256 | hadam3prm3pm2t_eu_oftt_2002_1_009604590_0 | 46,283 | 575,050 | 12.4246 |
12 Nov 2015 15:01:52 | 1377973 | 18075256 | hadam3prm3pm2t_eu_oftt_2002_1_009604590_0 | 34,859 | 440,815 | 12.6457 |
09 Nov 2015 18:31:36 | 1377973 | 18075256 | hadam3prm3pm2t_eu_oftt_2002_1_009604590_0 | 23,339 | 291,802 | 12.5028 |
06 Nov 2015 08:19:10 | 1377973 | 18075256 | hadam3prm3pm2t_eu_oftt_2002_1_009604590_0 | 11,819 | 148,235 | 12.5421 |
©2024 cpdn.org