Name | hadam3p_eu_83bx_2001_1_007654342_0 |
Workunit | 7809429 |
Created | 4 Jan 2012, 15:15:59 UTC |
Sent | 11 Jan 2012, 15:11:19 UTC |
Report deadline | 23 Dec 2012, 20:31:19 UTC |
Received | 26 Feb 2012, 17:14:27 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -187 (0xFFFFFF45) ERR_RESULT_UPLOAD |
Computer ID | 1188589 |
Run time | 5 days 5 hours 10 min 23 sec |
CPU time | 4 days 6 hours 32 min 38 sec |
Validate state | Invalid |
Credit | 1,790.21 |
Device peak FLOPS | 1.52 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> upload failure </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2536, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4280, selfPID=3956, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4400, selfPID=3620, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4748, selfPID=3984, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5276, selfPID=5276, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5116, selfPID=4084, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4548, selfPID=3680, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3432, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4424, selfPID=3716, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5376, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5384, selfPID=3844, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3032, selfPID=3836, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4576, selfPID=3672, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5572, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2500, selfPID=6020, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4672, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4680, selfPID=3804, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5880, selfPID=3964, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4604, selfPID=3852, iMonCtr=1 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4668, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4512, selfPID=3768, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4060, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4324, selfPID=3684, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4896, selfPID=3648, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5392, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4064, selfPID=2352, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1364, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1888, selfPID=3632, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4360, selfPID=3948, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=972, selfPID=3984, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2736, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3628, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4524, selfPID=3596, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4340, selfPID=3880, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3328, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3028, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4356, selfPID=3932, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4192, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4200, selfPID=3164, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4420, selfPID=4420, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5476, selfPID=5524, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5604, selfPID=2596, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3300, selfPID=3484, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4272, selfPID=1000, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4440, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3296, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2148, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4296, selfPID=3968, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1228, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4356, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4608, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3824, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5700, selfPID=5700, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4232, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3112, selfPID=3920, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4268, selfPID=3704, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4484, selfPID=4052, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3900, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... GCPDN Monitor - Quit request from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: STWORK : Error in PP_FILE tmp/xaakm.pipe_dummy 2048 Leaving CPDN_Main::Monitor... zip error: Output file write failure (write error on zip file) Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
24 Feb 2012 19:15:41 | 1188589 | 13856541 | hadam3p_eu_83bx_2001_1_007654342_0 | 103,776 | 331,520 | 3.1946 |
22 Feb 2012 05:03:46 | 1188589 | 13856541 | hadam3p_eu_83bx_2001_1_007654342_0 | 92,271 | 294,710 | 3.1940 |
21 Feb 2012 19:25:07 | 1188589 | 13856541 | hadam3p_eu_83bx_2001_1_007654342_0 | 92,261 | 294,063 | 3.1873 |
20 Feb 2012 19:24:43 | 1188589 | 13856541 | hadam3p_eu_83bx_2001_1_007654342_0 | 92,256 | 293,572 | 3.1821 |
19 Feb 2012 07:10:29 | 1188589 | 13856541 | hadam3p_eu_83bx_2001_1_007654342_0 | 80,736 | 259,217 | 3.2107 |
15 Feb 2012 06:38:34 | 1188589 | 13856541 | hadam3p_eu_83bx_2001_1_007654342_0 | 69,216 | 221,879 | 3.2056 |
11 Feb 2012 14:10:44 | 1188589 | 13856541 | hadam3p_eu_83bx_2001_1_007654342_0 | 57,696 | 186,341 | 3.2297 |
05 Feb 2012 08:49:39 | 1188589 | 13856541 | hadam3p_eu_83bx_2001_1_007654342_0 | 46,176 | 150,991 | 3.2699 |
02 Feb 2012 14:39:14 | 1188589 | 13856541 | hadam3p_eu_83bx_2001_1_007654342_0 | 34,656 | 113,900 | 3.2866 |
01 Feb 2012 09:51:06 | 1188589 | 13856541 | hadam3p_eu_83bx_2001_1_007654342_0 | 23,136 | 79,597 | 3.4404 |
29 Jan 2012 11:20:36 | 1188589 | 13856541 | hadam3p_eu_83bx_2001_1_007654342_0 | 11,616 | 41,670 | 3.5873 |
©2024 cpdn.org