Name | hadam3p_pnw_2z0i_1959_1_007178874_1 |
Workunit | 7377156 |
Created | 11 Mar 2011, 12:18:07 UTC |
Sent | 11 Mar 2011, 15:37:34 UTC |
Report deadline | 21 Feb 2012, 20:57:34 UTC |
Received | 25 Apr 2011, 18:25:53 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 989453 |
Run time | 6 days 12 hours 23 min 43 sec |
CPU time | 5 days 22 hours 1 min 7 sec |
Validate state | Workunit error - check skipped |
Credit | 3,005.88 |
Device peak FLOPS | 1.80 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.08 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4124, selfPID=2024, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4364, selfPID=2464, iMonCtr=1 Model crash detected, will try to restart... 10:35:41 (3488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1184, selfPID=4396, iMonCtr=1 Model crash detected, will try to restart... 13:12:02 (4176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1616, selfPID=1616, iMonCtr=2 11:14:28 (1192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:36:16 (3384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3468, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1980, selfPID=3228, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1240, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1164, selfPID=4252, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12, selfPID=3668, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4208, selfPID=1824, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3808, iMonCtr=2 Model crash detected, will try to restart... 09:57:02 (3596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4992, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3540, selfPID=684, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3492, selfPID=1888, iMonCtr=1 Model crash detected, will try to restart... 13:18:52 (4616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4216, selfPID=2296, iMonCtr=1 Model crash detected, will try to restart... 10:11:23 (4008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4120, selfPID=1384, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2624, selfPID=1200, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3576, selfPID=1236, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3744, selfPID=3116, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 9 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4300, iMonCtr=2 Model crash detected, will try to restart... 10:11:08 (3560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4724, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4672, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3608, selfPID=1348, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4484, selfPID=3480, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4368, selfPID=4008, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 11 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4216, selfPID=748, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3516, selfPID=3896, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3536, iMonCtr=2 Leaving CPDN_Main::Monitor... 13:38:13 (3756): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Apr 2011 17:32:14 | 989453 | 12660446 | hadam3p_pnw_2z0i_1959_1_007178874_1 | 138,336 | 510,340 | 3.6891 |
20 Apr 2011 21:32:50 | 989453 | 12660446 | hadam3p_pnw_2z0i_1959_1_007178874_1 | 126,816 | 467,696 | 3.6880 |
20 Apr 2011 18:11:54 | 989453 | 12660446 | hadam3p_pnw_2z0i_1959_1_007178874_1 | 115,296 | 425,096 | 3.6870 |
11 Apr 2011 20:53:44 | 989453 | 12660446 | hadam3p_pnw_2z0i_1959_1_007178874_1 | 103,776 | 382,965 | 3.6903 |
08 Apr 2011 17:29:53 | 989453 | 12660446 | hadam3p_pnw_2z0i_1959_1_007178874_1 | 92,256 | 341,107 | 3.6974 |
06 Apr 2011 17:55:10 | 989453 | 12660446 | hadam3p_pnw_2z0i_1959_1_007178874_1 | 80,736 | 299,158 | 3.7054 |
04 Apr 2011 20:42:35 | 989453 | 12660446 | hadam3p_pnw_2z0i_1959_1_007178874_1 | 69,216 | 257,304 | 3.7174 |
30 Mar 2011 20:42:33 | 989453 | 12660446 | hadam3p_pnw_2z0i_1959_1_007178874_1 | 57,696 | 214,376 | 3.7156 |
28 Mar 2011 14:38:31 | 989453 | 12660446 | hadam3p_pnw_2z0i_1959_1_007178874_1 | 46,176 | 172,154 | 3.7282 |
22 Mar 2011 18:41:32 | 989453 | 12660446 | hadam3p_pnw_2z0i_1959_1_007178874_1 | 34,656 | 128,451 | 3.7065 |
20 Mar 2011 01:29:15 | 989453 | 12660446 | hadam3p_pnw_2z0i_1959_1_007178874_1 | 23,136 | 85,606 | 3.7001 |
14 Mar 2011 19:17:43 | 989453 | 12660446 | hadam3p_pnw_2z0i_1959_1_007178874_1 | 11,616 | 43,300 | 3.7276 |
©2024 cpdn.org