Name | hadam3prm3pm2t_eu_pq4d_2002_1_009830120_1 |
Workunit | 9886046 |
Created | 30 May 2015, 0:38:18 UTC |
Sent | 29 Oct 2015, 8:15:47 UTC |
Report deadline | 10 Oct 2016, 13:35:47 UTC |
Received | 16 Nov 2015, 9:54:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1377973 |
Run time | 12 days 21 hours 53 min 35 sec |
CPU time | 9 days 3 hours 58 min 58 sec |
Validate state | Invalid |
Credit | 3,995.19 |
Device peak FLOPS | 2.06 GFLOPS |
Application version | UK Met Office HadAM3P and HadRM3P model with MOSES II and TRIFFID Europe v7.01 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> 17:22:26 (25211): No heartbeat from client for 30 sec - exiting 17:22:26 (25211): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1000, selfPID=1000, iMonCtr=1 Signal 3 received, exiting... 10:16:05 (1001): called boinc_finish 10:16:38 (1164): No heartbeat from client for 30 sec - exiting 10:16:38 (1164): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13158, selfPID=13065, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13586, selfPID=13537, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20117, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=21106, selfPID=21106, iMonCtr=1 Signal 3 received, exiting... 07:25:38 (21107): called boinc_finish Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15047, selfPID=15048, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15047, selfPID=15047, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18218, selfPID=18133, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=23467, selfPID=23404, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 17:14:18 (24635): No heartbeat from client for 30 sec - exiting 17:14:18 (24635): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:00:45 (25040): No heartbeat from client for 30 sec - exiting 02:00:46 (25040): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:00:47 (25040): No heartbeat from client for 30 sec - exiting 02:00:47 (25040): timer handler: client dead, exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:12:28 (27692): No heartbeat from client for 30 sec - exiting 18:12:44 (27692): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:12:45 (27692): No heartbeat from client for 30 sec - exiting 18:12:45 (27692): timer handler: client dead, exiting 18:12:46 (27692): No heartbeat from client for 30 sec - exiting 18:13:26 (27692): timer handler: client dead, exiting 18:13:27 (27692):18:14:34 (28033): No heartbeat from client for 30 sec - exiting 18:14:34 (28033): timer handler: client dead, exiting 18:14:35 (28033): No heartbeat from client for 30 sec - exiting 18:14:35 (28033): timer handler: client dead, exiting 18:14:36 (28033): No heartbeat from client for 30 sec - exiting 18:14:36 (28033): timer handler: client dead, exiting 18:14:37 (28033): No heartbeat from client for 30 sec - exiting 18:14:37 (28033): timer handler: client dead, exiting 18:14:38 (28033): No heartbeat from client for 30 sec - exiting 18:14:40 (28033): timer handler: client dead, exiting 18:14:41 (28033): No heartbeat from client for 30 sec - exiting 18:14:41 (28033): timer handler: client dead, exiting 18:14:42 (28033): No heartbeat from client for 30 sec - exiting 18:14:44 (28033): timer handler: client dead, exiting 18:14:45 (28033): No heartbeat from client for 30 sec - exiting 18:14:45 (28033): timer handler: client dead, exiting 18:14:46 (28033): No heartbeat from client for 30 sec - exiting 18:14:50 (28033): timer handler: client dead, exiting 18:14:51 (28033): No heartbeat from client for 30 sec - exiting 18:14:53 (28033): timer handler: client dead, exiting 18:14:54 (28033): No heartbeat from client for 30 sec - exiting 18:14:56 (28033): timer handler: client dead, exiting 18:14:57 (28033): No heartbeat from client for 30 sec - exiting 18:15:02 (28033): timer handler: client dead, exiting 18:15:03 (28033): No heartbeat from client for 30 sec - exiting 18:15:06 (28033): timer handler: client dead, exiting 18:15:07 (28033): No heartbeat from client for 30 sec - exiting 18:15:08 (28033): timer handler: client dead, exiting 18:15:09 (28033): No heartbeat from client for 30 sec - exiting 18:15:09 (28033): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:16:49 (28071): No heartbeat from client for 30 sec - exiting 20:16:51 (28071): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:48 (28182): No heartbeat from client for 30 sec - exiting 04:01:52 (28182): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:53 (28182): No heartbeat from client for 30 sec - exiting 04:01:53 (28182): timer handler: client dead, exiting 04:01:54 (28182):00:03:22 (28346): No heartbeat from client for 30 sec - exiting 00:03:23 (28346): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:11:25 (28736): No heartbeat from client for 30 sec - exiting 06:11:27 (28736): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... execv: No such file or directory </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Nov 2015 13:45:50 | 1377973 | 18513761 | hadam3prm3pm2t_eu_pq4d_2002_1_009830120_1 | 57,899 | 697,209 | 12.0418 |
11 Nov 2015 17:41:09 | 1377973 | 18513761 | hadam3prm3pm2t_eu_pq4d_2002_1_009830120_1 | 46,379 | 553,749 | 11.9396 |
08 Nov 2015 20:16:56 | 1377973 | 18513761 | hadam3prm3pm2t_eu_pq4d_2002_1_009830120_1 | 34,859 | 404,544 | 11.6052 |
06 Nov 2015 08:19:10 | 1377973 | 18513761 | hadam3prm3pm2t_eu_pq4d_2002_1_009830120_1 | 23,339 | 257,568 | 11.0359 |
05 Nov 2015 14:20:06 | 1377973 | 18513761 | hadam3prm3pm2t_eu_pq4d_2002_1_009830120_1 | 11,819 | 120,218 | 10.1716 |
©2024 cpdn.org