Name | hadam3p_eu_wasq_1985_1_006809346_1 |
Workunit | 7012662 |
Created | 31 Oct 2011, 1:25:04 UTC |
Sent | 31 Oct 2011, 1:25:07 UTC |
Report deadline | 12 Oct 2012, 6:45:07 UTC |
Received | 30 Nov 2011, 23:57:02 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 870397 |
Run time | 4 days 4 hours 19 min 58 sec |
CPU time | 4 days 4 hours 19 min 58 sec |
Validate state | Invalid |
Credit | 2,187.67 |
Device peak FLOPS | 2.31 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.2.18</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... 09:33:03 (4300): No heartbeat from core client for 30 sec - exiting 09:33:04 (4300): No heartbeat from core client for 30 sec - exiting 09:33:05 (4300): No heartbeat from core client for 30 sec - exiting 09:33:06 (4300): No heartbeat from core client for 30 sec - exiting 09:33:07 (4300): No heartbeat from core client for 30 sec - exiting 09:33:08 (4300): No heartbeat from core client for 30 sec - exiting 09:33:09 (4300): No heartbeat from core client for 30 sec - exiting 09:33:10 (4300): No heartbeat from core client for 30 sec - exiting 09:33:11 (4300): No heartbeat from core client for 30 sec - exiting 09:33:12 (4300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2484, selfPID=4296, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=2 Model crash detected, will try to restart... 16:42:00 (5996): No heartbeat from core client for 30 sec - exiting 16:42:01 (5996): No heartbeat from core client for 30 sec - exiting 16:42:02 (5996): No heartbeat from core client for 30 sec - exiting 16:42:03 (5996): No heartbeat from core client for 30 sec - exiting 16:42:04 (5996): No heartbeat from core client for 30 sec - exiting 16:42:05 (5996): No heartbeat from core client for 30 sec - exiting 16:42:06 (5996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:13:19 (5564): No heartbeat from core client for 30 sec - exiting 09:13:20 (5564): No heartbeat from core client for 30 sec - exiting 09:13:21 (5564): No heartbeat from core client for 30 sec - exiting 09:13:22 (5564): No heartbeat from core client for 30 sec - exiting 09:13:23 (5564): No heartbeat from core client for 30 sec - exiting 09:13:24 (5564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5548, selfPID=5612, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3220, selfPID=6132, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5416, selfPID=5976, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4412, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5608, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3160, selfPID=5988, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1812, selfPID=2356, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4520, iMonCtr=2 Model crash detected, will try to restart... 13:07:45 (3972): No heartbeat from core client for 30 sec - exiting 13:07:46 (3972): No heartbeat from core client for 30 sec - exiting 13:07:47 (3972): No heartbeat from core client for 30 sec - exiting 13:07:48 (3972): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5712, selfPID=1160, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4388, selfPID=4388, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Colobntrollker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=340, iMonCtr=2 2 odel crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2404, selfPID=2404, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2452, selfPID=5312, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 09:05:54 (2648): No heartbeat from core client for 30 sec - exiting 09:05:56 (2648): No heartbeat from core client for 30 sec - exiting 09:05:57 (2648): No heartbeat from core client for 30 sec - exiting 09:05:58 (2648): No heartbeat from core client for 30 sec - exiting 09:06:00 (2648): No heartbeat from core client for 30 sec - exiting 09:06:02 (2648): No heartbeat from core client for 30 sec - exiting 09:06:03 (2648): No heartbeat from core client for 30 sec - exiting 09:06:05 (2648): No heartbeat from core client for 30 sec - exiting 09:06:09 (2648): No heartbeat from core client for 30 sec - exiting 09:06:11 (2648): No heartbeat from core client for 30 sec - exiting 09:06:13 (2648): No heartbeat from core client for 30 sec - exiting 09:06:15 (2648): No heartbeat from core client for 30 sec - exiting 09:09:33 (2648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 13:55:16 (2788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:55:17 (2788): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4580, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3964, selfPID=2604, iMonCtr=1 Model crash detected, will try to restart... 16:37:19 (4024): No heartbeat from core client for 30 sec - exiting 16:37:20 (4024): No heartbeat from core client for 30 sec - exiting 16:37:21 (4024): No heartbeat from core client for 30 sec - exiting 16:37:22 (4024): No heartbeat from core client for 30 sec - exiting 16:37:23 (4024): No heartbeat from core client for 30 sec - exiting 16:37:24 (4024): No heartbeat from core client for 30 sec - exiting 16:37:25 (4024): No heartbeat from core client for 30 sec - exiting 16:37:26 (4024): No heartbeat from core client for 30 sec - exiting 16:37:27 (4024): No heartbeat from core client for 30 sec - exiting 16:37:28 (4024): No heartbeat from core client for 30 sec - exiting 16:37:29 (4024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5796, selfPID=2312, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5968, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_wasq_1985_1_006809346\tmp\xaakg.namelists Image PC Routine Line Source hadrm3p_eu_um_6.0 0139C52A Unknown Unknown Unknown hadrm3p_eu_um_6.0 01344460 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0134362A Unknown Unknown Unknown hadrm3p_eu_um_6.0 01322469 Unknown Unknown Unknown hadrm3p_eu_um_6.0 012266EB Unknown Unknown Unknown hadrm3p_eu_um_6.0 012C2AE2 Unknown Unknown Unknown hadrm3p_eu_um_6.0 012C35AF Unknown Unknown Unknown hadrm3p_eu_um_6.0 01069860 Unknown Unknown Unknown hadrm3p_eu_um_6.0 01380893 Unknown Unknown Unknown kernel32.dll 761AD309 Unknown Unknown Unknown ntdll.dll 776716C3 Unknown Unknown Unknown ntdll.dll 77671696 Unknown Unknown Unknown forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_wasq_1985_1_006809346\tmp\xaakm.namelists Image PC Routine Line Source hadam3p_eu_um_6.0 0164A39A Unknown Unknown Unknown hadam3p_eu_um_6.0 015F2CD0 Unknown Unknown Unknown hadam3p_eu_um_6.0 015F1E9A Unknown Unknown Unknown hadam3p_eu_um_6.0 015D2819 Unknown Unknown Unknown hadam3p_eu_um_6.0 014D2287 Unknown Unknown Unknown hadam3p_eu_um_6.0 0156E7B2 Unknown Unknown Unknown hadam3p_eu_um_6.0 0156F2DA Unknown Unknown Unknown hadam3p_eu_um_6.0 012E9BD2 Unknown Unknown Unknown hadam3p_eu_um_6.0 0162E638 Unknown Unknown Unknown kernel32.dll 761AD309 Unknown Unknown Unknown ntdll.dll 776716C3 Unknown Unknown Unknown ntdll.dll 77671696 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4800, selfPID=4232, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_eu_wasq_1985_1_006809346_1_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Nov 2011 08:55:35 | 870397 | 13570492 | hadam3p_eu_wasq_1985_1_006809346_1 | 126,816 | 346,776 | 2.7345 |
26 Nov 2011 01:42:49 | 870397 | 13570492 | hadam3p_eu_wasq_1985_1_006809346_1 | 115,296 | 315,594 | 2.7373 |
23 Nov 2011 09:53:18 | 870397 | 13570492 | hadam3p_eu_wasq_1985_1_006809346_1 | 103,776 | 283,175 | 2.7287 |
22 Nov 2011 01:22:00 | 870397 | 13570492 | hadam3p_eu_wasq_1985_1_006809346_1 | 92,256 | 252,467 | 2.7366 |
19 Nov 2011 01:21:45 | 870397 | 13570492 | hadam3p_eu_wasq_1985_1_006809346_1 | 80,736 | 220,879 | 2.7358 |
16 Nov 2011 01:21:37 | 870397 | 13570492 | hadam3p_eu_wasq_1985_1_006809346_1 | 69,216 | 189,476 | 2.7375 |
16 Nov 2011 01:21:37 | 870397 | 13570492 | hadam3p_eu_wasq_1985_1_006809346_1 | 57,696 | 158,349 | 2.7445 |
09 Nov 2011 07:31:26 | 870397 | 13570492 | hadam3p_eu_wasq_1985_1_006809346_1 | 46,176 | 127,125 | 2.7531 |
07 Nov 2011 04:46:22 | 870397 | 13570492 | hadam3p_eu_wasq_1985_1_006809346_1 | 34,656 | 95,922 | 2.7678 |
03 Nov 2011 07:41:44 | 870397 | 13570492 | hadam3p_eu_wasq_1985_1_006809346_1 | 23,136 | 63,345 | 2.7379 |
02 Nov 2011 05:17:24 | 870397 | 13570492 | hadam3p_eu_wasq_1985_1_006809346_1 | 11,620 | 31,603 | 2.7197 |
02 Nov 2011 00:52:37 | 870397 | 13570492 | hadam3p_eu_wasq_1985_1_006809346_1 | 11,616 | 31,152 | 2.6818 |
©2024 cpdn.org