Name | hadcm3n_p51z_1900_40_007224135_2 |
Workunit | 7422375 |
Created | 6 May 2011, 15:44:11 UTC |
Sent | 6 May 2011, 15:49:18 UTC |
Report deadline | 5 Aug 2011, 23:16:29 UTC |
Received | 10 May 2011, 23:35:37 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 647489 |
Run time | 2 days 23 hours 42 min 13 sec |
CPU time | 1 days 9 hours 9 min 23 sec |
Validate state | Invalid |
Credit | 1,244.16 |
Device peak FLOPS | 2.76 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:18:27 (5372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:18:37 (5372): No heartbeat from core client for 30 sec - exiting 23:18:39 (5372): No heartbeat from core client for 30 sec - exiting 23:18:40 (5372): No heartbeat from core client for 30 sec - exiting 23:18:41 (5372): No heartbeat from core client for 30 sec - exiting 23:18:42 (5372): No heartbeat from core client for 30 sec - exiting 23:18:43 (5372): No heartbeat from core client for 30 sec - exiting 23:18:44 (5372): No heartbeat from core client for 30 sec - exiting 23:18:45 (5372): No heartbeat from core client for 30 sec - exiting 23:18:48 (5372): No heartbeat from core client for 30 sec - exiting 23:18:49 (5372): No heartbeat from core client for 30 sec - exiting 23:18:50 (5372): No heartbeat from core client for 30 sec - exiting 23:18:51 (5372): No heartbeat from core client for 30 sec - exiting 23:18:52 (5372): No heartbeat from core client for 30 sec - exiting 23:18:53 (5372): No heartbeat from core client for 30 sec - exiting 23:18:54 (5372): No heartbeat from core client for 30 sec - exiting 23:18:56 (5372): No heartbeat from core client for 30 sec - exiting 23:18:57 (5372): No heartbeat from core client for 30 sec - exiting 23:18:58 (5372): No heartbeat from core client for 30 sec - exiting 23:18:59 (5372): No heartbeat from core client for 30 sec - exiting 23:21:00 (4220): No heartbeat from core client for 30 sec - exiting 23:21:01 (4220): No heartbeat from core client for 30 sec - exiting 23:21:02 (4220): No heartbeat from core client for 30 sec - exiting 23:21:03 (4220): No heartbeat from core client for 30 sec - exiting 23:21:04 (4220): No heartbeat from core client for 30 sec - exiting 23:21:05 (4220): No heartbeat from core client for 30 sec - exiting 23:21:06 (4220): No heartbeat from core client for 30 sec - exiting 23:21:07 (4220): No heartbeat from core client for 30 sec - exiting 23:21:08 (4220): No heartbeat from core client for 30 sec - exiting 23:21:09 (4220): No heartbeat from core client for 30 sec - exiting 23:21:10 (4220): No heartbeat from core client for 30 sec - exiting 23:21:11 (4220): No heartbeat from core client for 30 sec - exiting 23:21:12 (4220): No heartbeat from core client for 30 sec - exiting 23:21:13 (4220): No heartbeat from core client for 30 sec - exiting 23:21:14 (4220): No heartbeat from core client for 30 sec - exiting 23:21:15 (4220): No heartbeat from core client for 30 sec - exiting 23:21:16 (4220): No heartbeat from core client for 30 sec - exiting 23:21:17 (4220): No heartbeat from core client for 30 sec - exiting 23:21:18 (4220): No heartbeat from core client for 30 sec - exiting 23:21:19 (4220): No heartbeat from core client for 30 sec - exiting 23:21:20 (4220): No heartbeat from core client for 30 sec - exiting 23:21:21 (4220): No heartbeat from core client for 30 sec - exiting 23:21:22 (4220): No heartbeat from core client for 30 sec - exiting 23:21:23 (4220): No heartbeat from core client for 30 sec - exiting 23:21:24 (4220): No heartbeat from core client for 30 sec - exiting 23:21:25 (4220): No heartbeat from core client for 30 sec - exiting 23:21:26 (4220): No heartbeat from core client for 30 sec - exiting 23:21:27 (4220): No heartbeat from core client for 30 sec - exiting 23:21:28 (4220): No heartbeat from core client for 30 sec - exiting 23:21:29 (4220): No heartbeat from core client for 30 sec - exiting 23:21:30 (4220): No heartbeat from core client for 30 sec - exiting 23:21:31 (4220): No heartbeat from core client for 30 sec - exiting 23:21:32 (4220): No heartbeat from core client for 30 sec - exiting 23:21:33 (4220): No heartbeat from core client for 30 sec - exiting 23:21:34 (4220): No heartbeat from core client for 30 sec - exiting 23:21:35 (4220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5776, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5900, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5476, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5476, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5476, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5476, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5476, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 4 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5676, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5676, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5676, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 06:45:56 (6968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:46:06 (6968): No heartbeat from core client for 30 sec - exiting 06:46:07 (6968): No heartbeat from core client for 30 sec - exiting 06:46:08 (6968): No heartbeat from core client for 30 sec - exiting 06:46:09 (6968): No heartbeat from core client for 30 sec - exiting 06:46:10 (6968): No heartbeat from core client for 30 sec - exiting 06:46:11 (6968): No heartbeat from core client for 30 sec - exiting 06:46:12 (6968): No heartbeat from core client for 30 sec - exiting 06:46:13 (6968): No heartbeat from core client for 30 sec - exiting 06:46:14 (6968): No heartbeat from core client for 30 sec - exiting 06:46:15 (6968): No heartbeat from core client for 30 sec - exiting 06:46:16 (6968): No heartbeat from core client for 30 sec - exiting 06:46:18 (6968): No heartbeat from core client for 30 sec - exiting 06:46:19 (6968): No heartbeat from core client for 30 sec - exiting 06:46:20 (6968): No heartbeat from core client for 30 sec - exiting 06:46:21 (6968): No heartbeat from core client for 30 sec - exiting 06:46:22 (6968): No heartbeat from core client for 30 sec - exiting 06:46:23 (6968): No heartbeat from core client for 30 sec - exiting 06:46:24 (6968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1912, iMonCtr=1 Model crash detected, will try to restart... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 May 2011 23:14:35 | 647489 | 12873332 | hadcm3n_p51z_1900_40_007224135_2 | 103,680 | 93,497 | 0.9018 |
08 May 2011 20:36:38 | 647489 | 12873332 | hadcm3n_p51z_1900_40_007224135_2 | 77,760 | 62,055 | 0.7980 |
08 May 2011 05:12:40 | 647489 | 12873332 | hadcm3n_p51z_1900_40_007224135_2 | 51,840 | 54,688 | 1.0549 |
07 May 2011 10:43:24 | 647489 | 12873332 | hadcm3n_p51z_1900_40_007224135_2 | 25,920 | 42,649 | 1.6454 |
©2024 cpdn.org