Name | hadam3p_anz_c0h4_2013_1_009716485_0 |
Workunit | 9790634 |
Created | 8 Apr 2015, 15:31:44 UTC |
Sent | 18 Apr 2015, 14:03:29 UTC |
Report deadline | 30 Mar 2016, 19:23:29 UTC |
Received | 31 Oct 2015, 10:02:32 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1171334 |
Run time | 37 days 1 hours 48 min 50 sec |
CPU time | 32 days 22 hours 24 min 30 sec |
Validate state | Invalid |
Credit | 5,477.92 |
Device peak FLOPS | 0.70 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5028, selfPID=4640, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3128, selfPID=4004, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4592, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4908, selfPID=5764, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5340, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4180, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5292, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5072, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5252, iMonCtr=2 Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4328, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5664, selfPID=5204, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4744, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6036, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4564, selfPID=5788, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1928, selfPID=4884, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4476, selfPID=3892, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5424, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1680, selfPID=6068, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=256, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6444, selfPID=7476, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1728, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1652, selfPID=4376, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2096, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4360, selfPID=880, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2764, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1708, selfPID=2768, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5440, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5504, selfPID=2420, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4536, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4608, selfPID=3900, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 21:58:08 (4032): No heartbeat from core client for 30 sec - exiting 21:58:09 (4032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5700, selfPID=2096, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2152, selfPID=3808, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... 22:38:27 (1596): No heartbeat from core client for 30 sec - exiting 22:38:28 (1596): No heartbeat from core client for 30 sec - exiting 22:38:29 (1596): No heartbeat from core client for 30 sec - exiting 22:38:30 (1596): No heartbeat from core client for 30 sec - exiting 22:38:31 (1596): No heartbeat from core client for 30 sec - exiting 22:38:32 (1596): No heartbeat from core client for 30 sec - exiting 22:38:33 (1596): No heartbeat from core client for 30 sec - exiting 22:38:34 (1596): No heartbeat from core client for 30 sec - exiting 22:38:35 (1596): No heartbeat from core client for 30 sec - exiting 22:38:37 (1596): No heartbeat from core client for 30 sec - exiting 22:38:38 (1596): No heartbeat from core client for 30 sec - exiting 22:38:39 (1596): No heartbeat from core client for 30 sec - exiting 22:38:40 (1596): No heartbeat from core client for 30 sec - exiting 22:38:41 (1596): No heartbeat from core client for 30 sec - exiting 22:38:42 (1596): No heartbeat from core client for 30 sec - exiting 22:38:43 (1596): No heartbeat from core client for 30 sec - exiting 22:38:44 (1596): No heartbeat from core client for 30 sec - exiting 22:38:45 (1596): No heartbeat from core client for 30 sec - exiting 22:38:46 (1596): No heartbeat from core client for 30 sec - exiting 22:38:47 (1596): No heartbeat from core client for 30 sec - exiting 22:38:49 (1596): No heartbeat from core client for 30 sec - exiting 22:38:50 (1596): No heartbeat from core client for 30 sec - exiting 22:38:51 (1596): No heartbeat from core client for 30 sec - exiting 22:38:52 (1596): No heartbeat from core client for 30 sec - exiting 22:38:53 (1596): No heartbeat from core client for 30 sec - exiting 22:38:54 (1596): No heartbeat from core client for 30 sec - exiting 22:38:55 (1596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1388, selfPID=4784, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5008, selfPID=4036, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 21:10:13 (2576): No heartbeat from core client for 30 sec - exiting 21:10:14 (2576): No heartbeat from core client for 30 sec - exiting 21:10:15 (2576): No heartbeat from core client for 30 sec - exiting 21:10:16 (2576): No heartbeat from core client for 30 sec - exiting 21:10:17 (2576): No heartbeat from core client for 30 sec - exiting 21:10:18 (2576): No heartbeat from core client for 30 sec - exiting 21:10:19 (2576): No heartbeat from core client for 30 sec - exiting 21:10:20 (2576): No heartbeat from core client for 30 sec - exiting 21:10:21 (2576): No heartbeat from core client for 30 sec - exiting 21:10:23 (2576): No heartbeat from core client for 30 sec - exiting 21:10:24 (2576): No heartbeat from core client for 30 sec - exiting 21:10:25 (2576): No heartbeat from core client for 30 sec - exiting 21:10:26 (2576): No heartbeat from core client for 30 sec - exiting 21:10:27 (2576): No heartbeat from core client for 30 sec - exiting 21:10:28 (2576): No heartbeat from core client for 30 sec - exiting 21:10:29 (2576): No heartbeat from core client for 30 sec - exiting 21:10:30 (2576): No heartbeat from core client for 30 sec - exiting 21:10:31 (2576): No heartbeat from core client for 30 sec - exiting 21:10:32 (2576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4724, selfPID=4724, iMonCtr=2 22:48:32 (3256): No heartbeat from core client for 30 sec - exiting 22:48:33 (3256): No heartbeat from core client for 30 sec - exiting 22:48:34 (3256): No heartbeat from core client for 30 sec - exiting 22:48:35 (3256): No heartbeat from core client for 30 sec - exiting 22:48:36 (3256): No heartbeat from core client for 30 sec - exiting 22:48:37 (3256): No heartbeat from core client for 30 sec - exiting 22:48:39 (3256): No heartbeat from core client for 30 sec - exiting 22:48:40 (3256): No heartbeat from core client for 30 sec - exiting 22:48:41 (3256): No heartbeat from core client for 30 sec - exiting 22:48:42 (3256): No heartbeat from core client for 30 sec - exiting 22:48:43 (3256): No heartbeat from core client for 30 sec - exiting 22:48:44 (3256): No heartbeat from core client for 30 sec - exiting 22:48:45 (3256): No heartbeat from core client for 30 sec - exiting 22:48:46 (3256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4984, selfPID=4984, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3924, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4012, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5660, selfPID=3956, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4592, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2760, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2584, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4964, selfPID=1856, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6088, selfPID=908, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5580, selfPID=2100, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3712, selfPID=5188, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5280, selfPID=2424, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3872, selfPID=3036, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4692, selfPID=2880, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5032, selfPID=5032, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4700, selfPID=3192, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5800, selfPID=3928, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 13:47:01 (3128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2052, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4188, selfPID=3340, iMonCtr=1 Model crash detected, will try to restart... 21:25:40 (2500): No heartbeat from core client for 30 sec - exiting 21:25:41 (2500): No heartbeat from core client for 30 sec - exiting 21:25:42 (2500): No heartbeat from core client for 30 sec - exiting 21:25:44 (2500): No heartbeat from core client for 30 sec - exiting 21:25:45 (2500): No heartbeat from core client for 30 sec - exiting 21:25:46 (2500): No heartbeat from core client for 30 sec - exiting 21:25:47 (2500): No heartbeat from core client for 30 sec - exiting 21:25:48 (2500): No heartbeat from core client for 30 sec - exiting 21:25:49 (2500): No heartbeat from core client for 30 sec - exiting 21:25:50 (2500): No heartbeat from core client for 30 sec - exiting 21:25:51 (2500): No heartbeat from core client for 30 sec - exiting 21:25:52 (2500): No heartbeat from core client for 30 sec - exiting 21:25:53 (2500): No heartbeat from core client for 30 sec - exiting 21:25:54 (2500): No heartbeat from core client for 30 sec - exiting 21:25:56 (2500): No heartbeat from core client for 30 sec - exiting 21:25:57 (2500): No heartbeat from core client for 30 sec - exiting 21:25:58 (2500): No heartbeat from core client for 30 sec - exiting 21:25:59 (2500): No heartbeat from core client for 30 sec - exiting 21:26:00 (2500): No heartbeat from core client for 30 sec - exiting 21:26:01 (2500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5708, selfPID=4052, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5720, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4392, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4264, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3728, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3908, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5192, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1336, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5528, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5192, selfPID=5148, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5076, selfPID=4584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5036, selfPID=1988, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3420, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3056, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4424, selfPID=5288, iMonCtr=1 Model crash detected, will try to restart... 20:42:07 (3800): No heartbeat from core client for 30 sec - exiting 20:42:08 (3800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1728, selfPID=5700, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5324, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5216, selfPID=4812, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4684, selfPID=640, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6096, selfPID=1044, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4548, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4360, selfPID=3700, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4752, selfPID=2516, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4720, selfPID=1896, iMonCtr=1 Model crash detected, will try to restart... 11:27:54 (4160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4908, selfPID=4908, iMonCtr=2 11:27:55 (4160): No heartbeat from core client for 30 sec - exiting CGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5308, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4844, selfPID=4964, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4868, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4900, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6132, selfPID=5352, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3824, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3948, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4916, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4948, selfPID=4196, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 00:13:27 (2684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5168, selfPID=2436, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6052, selfPID=2720, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4100, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6064, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6100, selfPID=1872, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2532, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5168, selfPID=2152, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1304, selfPID=3708, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3488, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4004, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5176, selfPID=3020, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4804, selfPID=3892, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5080, selfPID=3016, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5844, selfPID=4332, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2740, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=792, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4720, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4784, selfPID=2620, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4620, selfPID=3148, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4624, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4676, selfPID=1144, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3492, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2340, iMonCtr=2 Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4164, selfPID=3572, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4756, selfPID=2900, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5156, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=620, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5668, iMonCtr=2 GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4544, selfPID=516, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4052, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3888, selfPID=3148, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4320, selfPID=4276, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 08:54:54 (2320): No heartbeat from core client for 30 sec - exiting 08:54:55 (2320): No heartbeat from core client for 30 sec - exiting 08:54:56 (2320): No heartbeat from core client for 30 sec - exiting 08:54:57 (2320): No heartbeat from core client for 30 sec - exiting 08:54:58 (2320): No heartbeat from core client for 30 sec - exiting 08:54:59 (2320): No heartbeat from core client for 30 sec - exiting 08:55:00 (2320): No heartbeat from core client for 30 sec - exiting 08:55:01 (2320): No heartbeat from core client for 30 sec - exiting 08:55:02 (2320): No heartbeat from core client for 30 sec - exiting 08:55:03 (2320): No heartbeat from core client for 30 sec - exiting 08:55:04 (2320): No heartbeat from core client for 30 sec - exiting 08:55:06 (2320): No heartbeat from core client for 30 sec - exiting 08:55:07 (2320): No heartbeat from core client for 30 sec - exiting 08:55:08 (2320): No heartbeat from core client for 30 sec - exiting 08:55:09 (2320): No heartbeat from core client for 30 sec - exiting 08:55:10 (2320): No heartbeat from core client for 30 sec - exiting 08:55:11 (2320): No heartbeat from core client for 30 sec - exiting 08:55:12 (2320): No heartbeat from core client for 30 sec - exiting 08:55:13 (2320): No heartbeat from core client for 30 sec - exiting 08:55:14 (2320): No heartbeat from core client for 30 sec - exiting 08:55:15 (2320): No heartbeat from core client for 30 sec - exiting 08:55:16 (2320): No heartbeat from core client for 30 sec - exiting 08:55:18 (2320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3284, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5624, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5164, selfPID=2584, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5624, selfPID=4344, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> upload failure: <file_xfer_error> <file_name>hadam3p_anz_c0h4_2013_1_009716485_0_12.zip</file_name> <error_code>-161 (not found)</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Oct 2015 20:54:31 | 1171334 | 18271995 | hadam3p_anz_c0h4_2013_1_009716485_0 | 127,019 | 2,752,268 | 21.6682 |
08 Oct 2015 01:47:15 | 1171334 | 18271995 | hadam3p_anz_c0h4_2013_1_009716485_0 | 115,499 | 2,506,637 | 21.7027 |
02 Oct 2015 08:23:23 | 1171334 | 18271995 | hadam3p_anz_c0h4_2013_1_009716485_0 | 103,979 | 2,262,808 | 21.7622 |
01 Aug 2015 11:56:05 | 1171334 | 18271995 | hadam3p_anz_c0h4_2013_1_009716485_0 | 92,459 | 2,016,433 | 21.8089 |
13 Jun 2015 03:47:54 | 1171334 | 18271995 | hadam3p_anz_c0h4_2013_1_009716485_0 | 80,939 | 1,770,899 | 21.8794 |
05 Jun 2015 23:38:04 | 1171334 | 18271995 | hadam3p_anz_c0h4_2013_1_009716485_0 | 69,419 | 1,520,860 | 21.9084 |
26 May 2015 18:55:44 | 1171334 | 18271995 | hadam3p_anz_c0h4_2013_1_009716485_0 | 57,899 | 1,270,683 | 21.9465 |
18 May 2015 18:03:53 | 1171334 | 18271995 | hadam3p_anz_c0h4_2013_1_009716485_0 | 46,379 | 1,017,506 | 21.9389 |
10 May 2015 21:21:10 | 1171334 | 18271995 | hadam3p_anz_c0h4_2013_1_009716485_0 | 34,859 | 769,520 | 22.0752 |
08 May 2015 19:10:01 | 1171334 | 18271995 | hadam3p_anz_c0h4_2013_1_009716485_0 | 23,339 | 521,394 | 22.3400 |
25 Apr 2015 23:56:20 | 1171334 | 18271995 | hadam3p_anz_c0h4_2013_1_009716485_0 | 11,819 | 274,801 | 23.2508 |
©2024 cpdn.org