Name | hadam3p_anz_a0l4_2012_1_008612280_0 |
Workunit | 8758792 |
Created | 2 Apr 2014, 14:21:46 UTC |
Sent | 2 May 2014, 0:31:08 UTC |
Report deadline | 14 Apr 2015, 5:51:08 UTC |
Received | 16 Jun 2014, 15:00:41 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 1 (0x00000001) Unknown error code |
Computer ID | 1321213 |
Run time | 6 days 16 hours 21 min 32 sec |
CPU time | 5 days 20 hours 34 min 50 sec |
Validate state | Invalid |
Credit | 2,993.82 |
Device peak FLOPS | 2.31 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> Incorrect function. (0x1) - exit code 1 (0x1) </message> <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8752, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9024, iMonCtr=2 Model crash detected, will try to restart... 09:34:17 (4840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8108, selfPID=8108, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:58:06 (4868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5392, selfPID=5392, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 11:40:36 (5636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:08:40 (6484): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:48:01 (5980): No heartbeat from core client for 30 sec - exiting 08:48:02 (5980): No heartbeat from core client for 30 sec - exiting 08:48:03 (5980): No heartbeat from core client for 30 sec - exiting 08:48:04 (5980): No heartbeat from core client for 30 sec - exiting 08:48:05 (5980): No heartbeat from core client for 30 sec - exiting 08:48:06 (5980): No heartbeat from core client for 30 sec - exiting 08:48:07 (5980): No heartbeat from core client for 30 sec - exiting 08:48:08 (5980): No heartbeat from core client for 30 sec - exiting 08:48:09 (5980): No heartbeat from core client for 30 sec - exiting 08:48:10 (5980): No heartbeat from core client for 30 sec - exiting 08:48:11 (5980): No heartbeat from core client for 30 sec - exiting 08:48:12 (5980): No heartbeat from core client for 30 sec - exiting 08:48:13 (5980): No heartbeat from core client for 30 sec - exiting 08:48:14 (5980): No heartbeat from core client for 30 sec - exiting 08:48:15 (5980): No heartbeat from core client for 30 sec - exiting 08:48:16 (5980): No heartbeat from core client for 30 sec - exiting 08:48:17 (5980): No heartbeat from core client for 30 sec - exiting 08:48:18 (5980): No heartbeat from core client for 30 sec - exiting 08:48:18 (1652): start_timer_thread(): CreateThread() failed, errno 0 08:48:19 (5980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... G09:22:14 (4740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5304, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:16:55 (11108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 17:39:26 (6212): No heartbeat from core client for 30 sec - exiting 17:39:27 (6212): No heartbeat from core client for 30 sec - exiting 17:39:28 (6212): No heartbeat from core client for 30 sec - exiting 17:39:29 (6212): No heartbeat from core client for 30 sec - exiting 17:39:30 (6212): No heartbeat from core client for 30 sec - exiting 17:39:31 (6212): No heartbeat from core client for 30 sec - exiting 17:39:32 (6212): No heartbeat from core client for 30 sec - exiting 17:39:33 (6212): No heartbeat from core client for 30 sec - exiting 17:39:34 (6212): No heartbeat from core client for 30 sec - exiting 17:39:35 (6212): No heartbeat from core client for 30 sec - exiting 17:39:36 (6212): No heartbeat from core client for 30 sec - exiting 17:39:37 (6212): No heartbeat from core client for 30 sec - exiting 17:39:38 (6212): No heartbeat from core client for 30 sec - exiting 17:39:39 (6212): No heartbeat from core client for 30 sec - exiting 17:39:40 (6212): No heartbeat from core client for 30 sec - exiting 17:39:41 (6212): No heartbeat from core client for 30 sec - exiting 17:39:42 (6212): No heartbeat from core client for 30 sec - exiting 17:39:43 (6212): No heartbeat from core client for 30 sec - exiting 17:39:44 (6212): No heartbeat from core client for 30 sec - exiting 17:39:45 (6212): No heartbeat from core client for 30 sec - exiting 17:39:46 (6212): No heartbeat from core client for 30 sec - exiting 17:39:47 (6212): No heartbeat from core client for 30 sec - exiting 17:39:48 (6212): No heartbeat from core client for 30 sec - exiting 17:39:49 (6212): No heartbeat from core client for 30 sec - exiting 17:39:50 (6212): No heartbeat from core client for 30 sec - exiting 17:39:51 (6212): No heartbeat from core client for 30 sec - exiting 17:39:52 (6212): No heartbeat from core client for 30 sec - exiting 17:39:53 (6212): No heartbeat from core client for 30 sec - exiting 17:39:54 (6212): No heartbeat from core client for 30 sec - exiting 17:39:55 (6212): No heartbeat from core client for 30 sec - exiting 17:39:56 (6212): No heartbeat from core client for 30 sec - exiting 17:39:57 (6212): No heartbeat from core client for 30 sec - exiting 17:39:58 (6212): No heartbeat from core client for 30 sec - exiting 17:39:59 (6212): No heartbeat from core client for 30 sec - exiting 17:40:00 (6212): No heartbeat from core client for 30 sec - exiting 17:40:01 (6212): No heartbeat from core client for 30 sec - exiting 17:40:02 (6212): No heartbeat from core client for 30 sec - exiting 17:40:03 (6212): No heartbeat from core client for 30 sec - exiting 17:40:04 (6212): No heartbeat from core client for 30 sec - exiting 17:40:05 (6212): No heartbeat from core client for 30 sec - exiting 17:40:06 (6212): No heartbeat from core client for 30 sec - exiting 17:40:07 (6212): No heartbeat from core client for 30 sec - exiting 17:40:08 (6212): No heartbeat from core client for 30 sec - exiting 17:40:09 (6212): No heartbeat from core client for 30 sec - exiting 17:40:10 (6212): No heartbeat from core client for 30 sec - exiting 17:40:11 (6212): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:40:12 (6212): No heartbeat from core client for 30 sec - exiting 17:40:13 (6212): No heartbeat from core client for 30 sec - exiting 17:40:14 (6212): No heartbeat from core client for 30 sec - exiting 17:58:58 (6408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:58:59 (6408): No heartbeat from core client for 30 sec - exiting 17:59:32 (5900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7232, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:07:12 (5136): No heartbeat from core client for 30 sec - exiting 17:07:13 (5136): No heartbeat from core client for 30 sec - exiting 17:07:14 (5136): No heartbeat from core client for 30 sec - exiting 17:07:15 (5136): No heartbeat from core client for 30 sec - exiting 17:07:16 (5136): No heartbeat from core client for 30 sec - exiting 17:07:17 (5136): No heartbeat from core client for 30 sec - exiting 17:07:18 (5136): No heartbeat from core client for 30 sec - exiting 17:07:19 (5136): No heartbeat from core client for 30 sec - exiting 17:07:20 (5136): No heartbeat from core client for 30 sec - exiting 17:07:21 (5136): No heartbeat from core client for 30 sec - exiting 17:07:22 (5136): No heartbeat from core client for 30 sec - exiting 17:07:23 (5136): No heartbeat from core client for 30 sec - exiting 17:07:24 (5136): No heartbeat from core client for 30 sec - exiting 17:07:25 (5136): No heartbeat from core client for 30 sec - exiting 17:07:26 (5136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6128, iMonCtr=2 Model crash detected, will try to restart... 20:18:59 (5364): No heartbeat from core client for 30 sec - exiting 20:19:00 (5364): No heartbeat from core client for 30 sec - exiting 20:19:01 (5364): No heartbeat from core client for 30 sec - exiting 20:19:02 (5364): No heartbeat from core client for 30 sec - exiting 20:19:03 (5364): No heartbeat from core client for 30 sec - exiting 20:19:04 (5364): No heartbeat from core client for 30 sec - exiting 20:19:05 (5364): No heartbeat from core client for 30 sec - exiting 20:19:06 (5364): No heartbeat from core client for 30 sec - exiting 20:19:07 (5364): No heartbeat from core client for 30 sec - exiting 20:19:08 (5364): No heartbeat from core client for 30 sec - exiting 20:19:09 (5364): No heartbeat from core client for 30 sec - exiting 20:19:10 (5364): No heartbeat from core client for 30 sec - exiting 20:19:11 (5364): No heartbeat from core client for 30 sec - exiting 20:19:12 (5364): No heartbeat from core client for 30 sec - exiting 20:19:13 (5364): No heartbeat from core client for 30 sec - exiting 20:19:14 (5364): No heartbeat from core client for 30 sec - exiting 20:19:15 (5364): No heartbeat from core client for 30 sec - exiting 20:19:16 (5364): No heartbeat from core client for 30 sec - exiting 20:19:17 (5364): No heartbeat from core client for 30 sec - exiting 20:19:18 (5364): No heartbeat from core client for 30 sec - exiting 20:19:19 (5364): No heartbeat from core client for 30 sec - exiting 20:19:20 (5364): No heartbeat from core client for 30 sec - exiting 20:19:21 (5364): No heartbeat from core client for 30 sec - exiting 20:19:22 (5364): No heartbeat from core client for 30 sec - exiting 20:19:23 (5364): No heartbeat from core client for 30 sec - exiting 20:19:24 (5364): No heartbeat from core client for 30 sec - exiting 20:19:25 (5364): No heartbeat from core client for 30 sec - exiting 20:19:26 (5364): No heartbeat from core client for 30 sec - exiting 20:19:27 (5364): No heartbeat from core client for 30 sec - exiting 20:19:28 (5364): No heartbeat from core client for 30 sec - exiting 20:19:29 (5364): No heartbeat from core client for 30 sec - exiting 20:19:30 (5364): No heartbeat from core client for 30 sec - exiting 20:19:31 (5364): No heartbeat from core client for 30 sec - exiting 20:19:32 (5364): No heartbeat from core client for 30 sec - exiting 20:19:33 (5364): No heartbeat from core client for 30 sec - exiting 20:19:34 (5364): No heartbeat from core client for 30 sec - exiting 20:19:35 (5364): No heartbeat from core client for 30 sec - exiting 20:19:36 (5364): No heartbeat from core client for 30 sec - exiting 20:19:37 (5364): No heartbeat from core client for 30 sec - exiting 20:19:38 (5364): No heartbeat from core client for 30 sec - exiting 20:19:39 (5364): No heartbeat from core client for 30 sec - exiting 20:19:40 (5364): No heartbeat from core client for 30 sec - exiting 20:19:41 (5364): No heartbeat from core client for 30 sec - exiting 20:19:42 (5364): No heartbeat from core client for 30 sec - exiting 20:19:43 (5364): No heartbeat from core client for 30 sec - exiting 20:19:44 (5364): No heartbeat from core client for 30 sec - exiting 20:19:45 (5364): No heartbeat from core client for 30 sec - exiting 20:19:46 (5364): No heartbeat from core client for 30 sec - exiting 20:19:47 (5364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:55:02 (4640): start_timer_thread(): CreateThread() failed, errno 0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6416, selfPID=5996, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18644, selfPID=18644, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checCPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6008, selfPID=6008, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:42:19 (4588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:31:00 (6796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:31:01 (6796): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 11:01:14 (4212): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5540, selfPID=2500, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 21:34:07 (5544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5744, selfPID=5744, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7156, selfPID=5348, iMonCtr=1 Model crash detected, will try to restart... 20:01:12 (5480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6760, selfPID=6760, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:47:23 (12084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2300, selfPID=2300, iMonCtr=2 13:09:46 (5432): Can't set up shared mem: -1. Will run in standalone mode. 13:09:54 (5988): Can't set up shared mem: -1. Will run in standalone mode. 13:09:56 (2552): Can't set up shared mem: -1. Will run in standalone mode. No Process Handle Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5988, selfPID=2552, iMonCtr=1 No Process Handle Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5988, selfPID=5988, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5600, selfPID=5600, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5600, selfPID=6092, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Jun 2014 21:59:03 | 1321213 | 16443251 | hadam3p_anz_a0l4_2012_1_008612280_0 | 69,419 | 464,867 | 6.6965 |
30 May 2014 17:47:31 | 1321213 | 16443251 | hadam3p_anz_a0l4_2012_1_008612280_0 | 57,899 | 387,611 | 6.6946 |
26 May 2014 00:21:28 | 1321213 | 16443251 | hadam3p_anz_a0l4_2012_1_008612280_0 | 46,379 | 309,761 | 6.6789 |
15 May 2014 17:46:01 | 1321213 | 16443251 | hadam3p_anz_a0l4_2012_1_008612280_0 | 34,859 | 234,623 | 6.7306 |
09 May 2014 02:14:27 | 1321213 | 16443251 | hadam3p_anz_a0l4_2012_1_008612280_0 | 23,339 | 156,491 | 6.7051 |
05 May 2014 13:35:26 | 1321213 | 16443251 | hadam3p_anz_a0l4_2012_1_008612280_0 | 11,819 | 76,280 | 6.4540 |
©2024 cpdn.org