Name | hadam3p_anz_n4yd_2012_1_008580109_0 |
Workunit | 8726621 |
Created | 25 Mar 2014, 18:51:01 UTC |
Sent | 26 Mar 2014, 15:24:52 UTC |
Report deadline | 8 Mar 2015, 20:44:52 UTC |
Received | 28 Apr 2014, 17:31:44 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 1 (0x00000001) Unknown error code |
Computer ID | 1321213 |
Run time | 8 days 15 hours 0 min 36 sec |
CPU time | 7 days 20 hours 24 min 33 sec |
Validate state | Invalid |
Credit | 3,987.46 |
Device peak FLOPS | 2.42 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> Incorrect function. (0x1) - exit code 1 (0x1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3584, selfPID=3584, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:55:39 (4216): No heartbeat from core client for 30 sec - exiting 08:55:40 (4216): No heartbeat from core client for 30 sec - exiting 08:55:41 (4216): No heartbeat from core client for 30 sec - exiting 08:55:42 (4216): No heartbeat from core client for 30 sec - exiting 08:55:43 (4216): No heartbeat from core client for 30 sec - exiting 08:55:44 (4216): No heartbeat from core client for 30 sec - exiting 08:55:45 (4216): No heartbeat from core client for 30 sec - exiting 08:55:46 (4216): No heartbeat from core client for 30 sec - exiting 08:55:47 (4216): No heartbeat from core client for 30 sec - exiting 08:55:48 (4216): No heartbeat from core client for 30 sec - exiting 08:55:49 (4216): No heartbeat from core client for 30 sec - exiting 08:55:50 (4216): No heartbeat from core client for 30 sec - exiting 08:55:51 (4216): No heartbeat from core client for 30 sec - exiting 08:55:52 (4216): No heartbeat from core client for 30 sec - exiting 08:55:53 (4216): No heartbeat from core client for 30 sec - exiting 08:55:54 (4216): No heartbeat from core client for 30 sec - exiting 08:55:55 (4216): No heartbeat from core client for 30 sec - exiting 08:55:56 (4216): No heartbeat from core client for 30 sec - exiting 08:55:57 (4216): No heartbeat from core client for 30 sec - exiting 08:55:58 (4216): No heartbeat from core client for 30 sec - exiting 08:55:59 (4216): No heartbeat from core client for 30 sec - exiting 08:56:00 (4216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6832, selfPID=6832, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3768, selfPID=3768, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14428, selfPID=14428, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6928, selfPID=6928, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7284, selfPID=7284, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2672, selfPID=2672, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10968, selfPID=10968, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13108, selfPID=13108, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7140, selfPID=7140, iMonCtr=2 GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6596, iMonCtr=2 Model crash detected, will try to restart... lobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4700, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6180, selfPID=6180, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6148, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 10:51:42 (5456): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5684, selfPID=5684, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5392, selfPID=CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8104, selfPID=8104, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4240, selfPID=4240, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:30:54 (5312): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5284, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6360, selfPID=7076, iMonCtr=1 Model crash detected, will try to restart... 09:28:22 (4440): No heartbeat from core client for 30 sec - exiting 09:28:23 (4440): No heartbeat from core client for 30 sec - exiting 09:28:24 (4440): No heartbeat from core client for 30 sec - exiting 09:28:25 (4440): No heartbeat from core client for 30 sec - exiting 09:28:26 (4440): No heartbeat from core client for 30 sec - exiting 09:28:27 (4440): No heartbeat from core client for 30 sec - exiting 09:28:28 (4440): No heartbeat from core client for 30 sec - exiting 09:28:29 (4440): No heartbeat from core client for 30 sec - exiting 09:28:30 (4440): No heartbeat from core client for 30 sec - exiting 09:28:31 (4440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:29:12 (3560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6032, selfPID=6032, iMonCtr=2 16:52:55 (6616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:52:57 (6616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3616, selfPID=3616, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6640, selfPID=6640, iMonCtr=2 CPDN Monitor - Quit request from BOINC... GCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:51:37 (7108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... R </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
24 Apr 2014 21:40:40 | 1321213 | 16395129 | hadam3p_anz_n4yd_2012_1_008580109_0 | 92,459 | 621,196 | 6.7186 |
19 Apr 2014 17:59:00 | 1321213 | 16395129 | hadam3p_anz_n4yd_2012_1_008580109_0 | 80,939 | 538,930 | 6.6585 |
13 Apr 2014 23:28:45 | 1321213 | 16395129 | hadam3p_anz_n4yd_2012_1_008580109_0 | 69,419 | 462,197 | 6.6581 |
09 Apr 2014 18:12:36 | 1321213 | 16395129 | hadam3p_anz_n4yd_2012_1_008580109_0 | 57,899 | 385,605 | 6.6600 |
07 Apr 2014 20:45:36 | 1321213 | 16395129 | hadam3p_anz_n4yd_2012_1_008580109_0 | 46,379 | 311,294 | 6.7120 |
05 Apr 2014 15:43:41 | 1321213 | 16395129 | hadam3p_anz_n4yd_2012_1_008580109_0 | 34,859 | 236,848 | 6.7945 |
01 Apr 2014 03:11:46 | 1321213 | 16395129 | hadam3p_anz_n4yd_2012_1_008580109_0 | 23,339 | 161,695 | 6.9281 |
28 Mar 2014 18:58:14 | 1321213 | 16395129 | hadam3p_anz_n4yd_2012_1_008580109_0 | 11,819 | 78,500 | 6.6418 |
©2024 cpdn.org