Name | hadam3p_saf_1w19_2004_1_007017349_1 |
Workunit | 7220665 |
Created | 3 Jan 2012, 17:03:43 UTC |
Sent | 3 Jan 2012, 17:03:51 UTC |
Report deadline | 15 Dec 2012, 22:23:51 UTC |
Received | 26 Feb 2012, 11:59:08 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 953471 |
Run time | 4 days 13 hours 4 min 10 sec |
CPU time | 3 days 11 hours 52 min 14 sec |
Validate state | Workunit error - check skipped |
Credit | 2,244.09 |
Device peak FLOPS | 2.52 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7040, selfPID=5972, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5216, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7136, selfPID=7136, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4204, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6312, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4688, selfPID=3864, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6540, selfPID=5036, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4436, selfPID=4676, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6140, iMonCtr=2 Model crash detected, will try to restart... 22:08:24 (1364): No heartbeat from core client for 30 sec - exiting 22:08:25 (1364): No heartbeat from core client for 30 sec - exiting 22:08:26 (1364): No heartbeat from core client for 30 sec - exiting 22:08:27 (1364): No heartbeat from core client for 30 sec - exiting 22:08:28 (1364): No heartbeat from core client for 30 sec - exiting 22:08:30 (1364): No heartbeat from core client for 30 sec - exiting 22:08:31 (1364): No heartbeat from core client for 30 sec - exiting 22:08:32 (1364): No heartbeat from core client for 30 sec - exiting 22:08:33 (1364): No heartbeat from core client for 30 sec - exiting 22:08:35 (1364): No heartbeat from core client for 30 sec - exiting 22:08:36 (1364): No heartbeat from core client for 30 sec - exiting 22:08:37 (1364): No heartbeat from core client for 30 sec - exiting 22:08:38 (1364): No heartbeat from core client for 30 sec - exiting 22:08:39 (1364): No heartbeat from core client for 30 sec - exiting 22:08:40 (1364): No heartbeat from core client for 30 sec - exiting 22:08:41 (1364): No heartbeat from core client for 30 sec - exiting 22:08:42 (1364): No heartbeat from core client for 30 sec - exiting 22:08:43 (1364): No heartbeat from core client for 30 sec - exiting 22:08:44 (1364): No heartbeat from core client for 30 sec - exiting 22:08:45 (1364): No heartbeat from core client for 30 sec - exiting 22:08:47 (1364): No heartbeat from core client for 30 sec - exiting 22:08:48 (1364): No heartbeat from core client for 30 sec - exiting 22:08:49 (1364): No heartbeat from core client for 30 sec - exiting 22:08:50 (1364): No heartbeat from core client for 30 sec - exiting 22:08:51 (1364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1748, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5904, selfPID=6012, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1676, selfPID=5452, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 21:19:53 (3980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1616, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 Feb 2012 12:01:52 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 138,336 | 301,456 | 2.1792 |
26 Feb 2012 12:01:52 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 126,816 | 277,648 | 2.1894 |
26 Feb 2012 12:01:52 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 115,296 | 254,340 | 2.2060 |
26 Feb 2012 12:01:52 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 103,776 | 229,113 | 2.2078 |
08 Feb 2012 19:13:39 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 92,256 | 205,333 | 2.2257 |
08 Feb 2012 19:13:39 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 80,736 | 181,107 | 2.2432 |
30 Jan 2012 21:08:34 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 69,216 | 156,314 | 2.2584 |
30 Jan 2012 21:08:34 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 57,696 | 131,002 | 2.2706 |
24 Jan 2012 20:46:15 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 46,176 | 106,313 | 2.3023 |
13 Jan 2012 18:10:12 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 34,671 | 80,709 | 2.3279 |
13 Jan 2012 18:10:12 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 34,656 | 80,324 | 2.3178 |
08 Jan 2012 12:45:48 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 23,136 | 54,285 | 2.3463 |
07 Jan 2012 21:22:17 | 953471 | 13854668 | hadam3p_saf_1w19_2004_1_007017349_1 | 11,616 | 27,659 | 2.3811 |
©2024 cpdn.org