Name | hadam3p_saf_1tci_1977_1_007004266_1 |
Workunit | 7207582 |
Created | 27 Jan 2011, 0:01:51 UTC |
Sent | 17 Feb 2011, 18:01:27 UTC |
Report deadline | 30 Jan 2012, 23:21:27 UTC |
Received | 19 Jun 2011, 10:13:57 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1076660 |
Run time | 3 days 21 hours 7 min |
CPU time | 4 days 5 hours 52 min 21 sec |
Validate state | Workunit error - check skipped |
Credit | 2,244.09 |
Device peak FLOPS | 2.20 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 18:08:49 (4724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4748, selfPID=4748, iMonCtr=2 18:10:35 (3700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:10:37 (3700): No heartbeat from core client for 30 sec - exiting 18:14:11 (4372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1896, selfPID=1896, iMonCtr=2 18:14:16 (2160): Can't acquire lockfile (32) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=2 Model crash detected, will try to restart... 17:19:46 (4160): No heartbeat from core client for 30 sec - exiting 17:19:47 (4160): No heartbeat from core client for 30 sec - exiting 17:19:48 (4160): No heartbeat from core client for 30 sec - exiting 17:19:49 (4160): No heartbeat from core client for 30 sec - exiting 17:19:50 (4160): No heartbeat from core client for 30 sec - exiting 17:19:51 (4160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:29:36 (3352): No heartbeat from core client for 30 sec - exiting 12:29:37 (3352): No heartbeat from core client for 30 sec - exiting 12:29:38 (3352): No heartbeat from core client for 30 sec - exiting 12:29:39 (3352): No heartbeat from core client for 30 sec - exiting 12:29:40 (3352): No heartbeat from core client for 30 sec - exiting 12:29:41 (3352): No heartbeat from core client for 30 sec - exiting 12:29:42 (3352): No heartbeat from core client for 30 sec - exiting 12:29:43 (3352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3324, iMonCtr=2 Model crash detected, will try to restart... CSuspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3264, selfPID=3768, iMonCtr=1 Model crash detected, will try to restart... CCController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4400, selfPID=3592, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4288, selfPID=2468, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3760, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4932, selfPID=2196, iMonCtr=1 Model crash detected, will try to restart... GSuspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4184, selfPID=4016, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4416, selfPID=3040, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4416, selfPID=3728, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4136, selfPID=2644, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4004, selfPID=2124, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4140, selfPID=4084, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 19:46:28 (3360): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3136, selfPID=3852, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:53:25 (4572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3300, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4956, selfPID=2320, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2492, selfPID=3380, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Leaving CPDN_Main::Monitor... 21:34:43 (2124): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
19 Jun 2011 22:05:36 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 138,336 | 366,192 | 2.6471 |
07 Jun 2011 15:53:17 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 126,816 | 335,693 | 2.6471 |
02 Jun 2011 17:34:31 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 115,296 | 303,213 | 2.6299 |
25 May 2011 17:01:26 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 103,776 | 274,168 | 2.6419 |
05 May 2011 17:18:57 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 92,256 | 243,116 | 2.6352 |
23 Apr 2011 22:04:22 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 80,736 | 213,553 | 2.6451 |
22 Apr 2011 17:29:14 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 69,216 | 182,645 | 2.6388 |
08 Apr 2011 09:44:41 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 57,696 | 153,443 | 2.6595 |
05 Apr 2011 13:37:09 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 46,186 | 123,212 | 2.6677 |
05 Apr 2011 06:49:42 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 46,176 | 122,648 | 2.6561 |
31 Mar 2011 12:06:54 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 34,656 | 92,307 | 2.6635 |
11 Mar 2011 16:54:02 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 23,136 | 62,784 | 2.7137 |
08 Mar 2011 13:25:35 | 1076660 | 12538397 | hadam3p_saf_1tci_1977_1_007004266_1 | 11,616 | 31,373 | 2.7008 |
©2024 cpdn.org