Name | hadam3p_pnw_bzo4_1987_1_007932531_1 |
Workunit | 8087643 |
Created | 1 May 2012, 20:25:50 UTC |
Sent | 1 May 2012, 20:42:26 UTC |
Report deadline | 14 Apr 2013, 2:02:26 UTC |
Received | 6 Jun 2012, 19:15:13 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1213518 |
Run time | 2 days 3 hours 52 min 43 sec |
CPU time | 1 days 15 hours 19 min 15 sec |
Validate state | Invalid |
Credit | 502.72 |
Device peak FLOPS | 2.13 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.60</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5668, selfPID=5668, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 00:37:21 (5904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:42:35 (7244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:22:57 (6292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:22:58 (6292): No heartbeat from core client for 30 sec - exiting 03:22:59 (6292): No heartbeat from core client for 30 sec - exiting 03:23:00 (6292): No heartbeat from core client for 30 sec - exiting 03:23:01 (6292): No heartbeat from core client for 30 sec - exiting 03:23:02 (6292): No heartbeat from core client for 30 sec - exiting 03:23:03 (6292): No heartbeat from core client for 30 sec - exiting 03:23:04 (6292): No heartbeat from core client for 30 sec - exiting 03:23:05 (6292): No heartbeat from core client for 30 sec - exiting 03:23:06 (6292): No heartbeat from core client for 30 sec - exiting 03:23:07 (6292): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3116, selfPID=4804, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 0 20:32:36 (5356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3840, selfPID=6056, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=516, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=996, selfPID=7220, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 18:15:44 (4520): No heartbeat from core client for 30 sec - exiting 18:15:45 (4520): No heartbeat from core client for 30 sec - exiting 18:15:46 (4520): No heartbeat from core client for 30 sec - exiting 18:15:47 (4520): No heartbeat from core client for 30 sec - exiting 18:15:49 (4520): No heartbeat from core client for 30 sec - exiting 18:15:50 (4520): No heartbeat from core client for 30 sec - exiting 18:15:51 (4520): No heartbeat from core client for 30 sec - exiting 18:15:52 (4520): No heartbeat from core client for 30 sec - exiting 18:15:53 (4520): No heartbeat from core client for 30 sec - exiting 18:15:54 (4520): No heartbeat from core client for 30 sec - exiting 18:15:55 (4520): No heartbeat from core client for 30 sec - exiting 18:15:56 (4520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:19:15 (5560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3912, selfPID=3912, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6624, iMonCtr=2 23:34:33 (620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:34:35 (620): No heartbeat from core client for 30 sec - exiting 23:34:36 (620): No heartbeat from core client for 30 sec - exiting 13:10:37 (5600): No heartbeat from core client for 30 sec - exiting 13:10:38 (5600): No heartbeat from core client for 30 sec - exiting 13:10:39 (5600): No heartbeat from core client for 30 sec - exiting 13:10:40 (5600): No heartbeat from core client for 30 sec - exiting 13:10:41 (5600): No heartbeat from core client for 30 sec - exiting 13:10:42 (5600): No heartbeat from core client for 30 sec - exiting 13:10:43 (5600): No heartbeat from core client for 30 sec - exiting 13:10:44 (5600): No heartbeat from core client for 30 sec - exiting 13:10:45 (5600): No heartbeat from core client for 30 sec - exiting 13:10:46 (5600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 14:25:34 (6360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:21:24 (2740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6480, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5100, selfPID=3080, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1204, selfPID=3104, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1648, selfPID=7340, iMonCtr=1 Model crash detected, will try to restart... 01:18:32 (6724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:32:34 (6008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 May 2012 00:39:47 | 1213518 | 14614356 | hadam3p_pnw_bzo4_1987_1_007932531_1 | 23,136 | 109,414 | 4.7292 |
20 May 2012 17:35:55 | 1213518 | 14614356 | hadam3p_pnw_bzo4_1987_1_007932531_1 | 11,616 | 56,407 | 4.8560 |
©2024 climateprediction.net