Name | hadam3p_anz_x0aw_2007_1_009871236_0 |
Workunit | 9909733 |
Created | 1 Jun 2015, 10:26:10 UTC |
Sent | 1 Jun 2015, 13:21:21 UTC |
Report deadline | 13 May 2016, 18:41:21 UTC |
Received | 17 Jun 2015, 12:50:03 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1361081 |
Run time | 9 days 18 hours 47 min 26 sec |
CPU time | 8 days 22 hours 35 min 24 sec |
Validate state | Invalid |
Credit | 4,484.28 |
Device peak FLOPS | 2.44 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.4.42</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:05:57 (12124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:21:06 (12716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:21:07 (12716): No heartbeat from core client for 30 sec - exiting 02:21:08 (12716): No heartbeat from core client for 30 sec - exiting 02:21:09 (12716): No heartbeat from core client for 30 sec - exiting 02:21:10 (12716): No heartbeat from core client for 30 sec - exiting 02:21:11 (12716): No heartbeat from core client for 30 sec - exiting Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11536, selfPID=11536, iMonCtr=2 02:21:12 (12716): No heartbeat from core client for 30 sec - exiting 02:21:13 (12716): No heartbeat from core client for 30 sec - exiting 02:21:14 (12716): No heartbeat from core client for 30 sec - exiting 02:21:15 (12716): No heartbeat from core client for 30 sec - exiting 02:21:16 (12716): No heartbeat from core client for 30 sec - exiting 02:33:26 (12488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:33:27 (12488): No heartbeat from core client for 30 sec - exiting 02:33:28 (12488): No heartbeat from core client for 30 sec - exiting 02:33:29 (12488): No heartbeat from core client for 30 sec - exiting 02:33:30 (12488): No heartbeat from core client for 30 sec - exiting 02:33:31 (12488): No heartbeat from core client for 30 sec - exiting 02:33:32 (12488): No heartbeat from core client for 30 sec - exiting 02:36:25 (6528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:36:26 (6528): No heartbeat from core client for 30 sec - exiting 02:36:27 (6528): No heartbeat from core client for 30 sec - exiting 02:39:29 (9168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10020, selfPID=10020, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13012, selfPID=13012, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5900, selfPID=5900, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11200, selfPID=11200, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 03:44:33 (2984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1332, selfPID=1332, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4864, selfPID=4864, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4272, selfPID=3208, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6232, selfPID=6232, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5956, iMonCtr=2 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Jun 2015 12:50:33 | 1361081 | 18521110 | hadam3p_anz_x0aw_2007_1_009871236_0 | 103,979 | 774,682 | 7.4504 |
15 Jun 2015 04:06:39 | 1361081 | 18521110 | hadam3p_anz_x0aw_2007_1_009871236_0 | 92,459 | 690,854 | 7.4720 |
13 Jun 2015 06:05:37 | 1361081 | 18521110 | hadam3p_anz_x0aw_2007_1_009871236_0 | 80,939 | 601,413 | 7.4304 |
11 Jun 2015 21:59:16 | 1361081 | 18521110 | hadam3p_anz_x0aw_2007_1_009871236_0 | 69,419 | 516,794 | 7.4446 |
10 Jun 2015 15:59:05 | 1361081 | 18521110 | hadam3p_anz_x0aw_2007_1_009871236_0 | 57,899 | 432,833 | 7.4757 |
07 Jun 2015 22:14:06 | 1361081 | 18521110 | hadam3p_anz_x0aw_2007_1_009871236_0 | 46,379 | 348,136 | 7.5063 |
05 Jun 2015 02:47:32 | 1361081 | 18521110 | hadam3p_anz_x0aw_2007_1_009871236_0 | 34,859 | 265,098 | 7.6049 |
03 Jun 2015 19:23:50 | 1361081 | 18521110 | hadam3p_anz_x0aw_2007_1_009871236_0 | 23,339 | 181,005 | 7.7555 |
03 Jun 2015 10:22:16 | 1361081 | 18521110 | hadam3p_anz_x0aw_2007_1_009871236_0 | 11,819 | 90,522 | 7.6590 |
©2024 cpdn.org