Name | hadam3prm3pm2t_eu_jlhv_2002_1_010008891_2 |
Workunit | 10008296 |
Created | 14 Sep 2015, 21:52:39 UTC |
Sent | 30 Oct 2015, 3:18:43 UTC |
Report deadline | 11 Oct 2016, 8:38:43 UTC |
Received | 16 Nov 2015, 9:54:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1377973 |
Run time | 12 days 0 hours 20 min 34 sec |
CPU time | 8 days 13 hours 31 min 33 sec |
Validate state | Invalid |
Credit | 3,988.57 |
Device peak FLOPS | 2.06 GFLOPS |
Application version | UK Met Office HadAM3P and HadRM3P model with MOSES II and TRIFFID Europe v7.01 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 3 received, exiting... Signal 3 received, exiting... 10:16:08 (1053): called boinc_finish 10:16:08 (1052): called boinc_finish 10:16:39 (1170): No heartbeat from client for 30 sec - exiting 10:16:39 (1170): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13130, selfPID=13073, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13578, selfPID=13540, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20281, selfPID=20120, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:26:19 (21027): No heartbeat from client for 30 sec - exiting 07:26:19 (21027): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18200, selfPID=18136, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=23495, selfPID=23407, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 17:14:18 (24638): No heartbeat from client for 30 sec - exiting 17:14:18 (24638): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:00:45 (25043): No heartbeat from client for 30 sec - exiting 02:00:47 (25043): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:12:30 (27695): No heartbeat from client for 30 sec - exiting 18:12:44 (27695): timer handler: client dead, exiting 18:12:45 (27695): No heartbeat from client for 30 sec - exiting 18:12:45 (27695): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:12:46 (27695): No heartbeat from client for 30 sec - exiting 18:13:26 (27695): timer handler: client dead, exiting 18:13:27 (27695): No heartbeat from client for 30 sec - exiting 18:13:32 (27695): timer handler: client dead, exiting 20:16:50 (28039): No heartbeat from client for 30 sec - exiting 20:16:51 (28039): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:48 (28156): No heartbeat from client for 30 sec - exiting 04:01:52 (28156): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:53 (28156): No heartbeat from client for 30 sec - exiting 04:01:53 (28156): timer handler: client dead, exiting 06:11:25 (28320): No heartbeat from client for 30 sec - exiting 06:11:26 (28320): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... execv: No such file or directory </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Nov 2015 13:05:34 | 1377973 | 18906974 | hadam3prm3pm2t_eu_jlhv_2002_1_010008891_2 | 57,803 | 701,371 | 12.1338 |
12 Nov 2015 21:02:53 | 1377973 | 18906974 | hadam3prm3pm2t_eu_jlhv_2002_1_010008891_2 | 46,379 | 563,749 | 12.1553 |
10 Nov 2015 02:12:47 | 1377973 | 18906974 | hadam3prm3pm2t_eu_jlhv_2002_1_010008891_2 | 34,859 | 416,140 | 11.9378 |
07 Nov 2015 13:26:50 | 1377973 | 18906974 | hadam3prm3pm2t_eu_jlhv_2002_1_010008891_2 | 23,339 | 272,039 | 11.6560 |
05 Nov 2015 14:22:21 | 1377973 | 18906974 | hadam3prm3pm2t_eu_jlhv_2002_1_010008891_2 | 11,819 | 127,907 | 10.8222 |
©2024 climateprediction.net