Name | hadcm3n_o1hx_1940_40_008381477_3 |
Workunit | 8532336 |
Created | 9 Jul 2013, 20:42:09 UTC |
Sent | 11 Jul 2013, 20:32:25 UTC |
Report deadline | 11 Oct 2013, 3:59:36 UTC |
Received | 15 Aug 2013, 20:03:34 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1209810 |
Run time | 9 days 18 hours 55 min 12 sec |
CPU time | 9 days 0 hours 24 min 12 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 2.71 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> Das Gerät erkennt den Befehl nicht. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold 18:09:57 (5404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2000, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:01:16 (5688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5024, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3228, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:04:15 (5276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 21:10:23 (1332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4512, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4512, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4512, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4512, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4512, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4512, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Aug 2013 17:09:16 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 518,400 | 742,617 | 1.4325 |
14 Aug 2013 15:59:39 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 492,480 | 705,195 | 1.4319 |
14 Aug 2013 15:59:39 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 466,560 | 667,992 | 1.4317 |
14 Aug 2013 15:59:39 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 440,640 | 630,800 | 1.4316 |
14 Aug 2013 15:59:39 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 414,720 | 592,963 | 1.4298 |
14 Aug 2013 15:59:39 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 388,800 | 555,248 | 1.4281 |
14 Aug 2013 15:59:39 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 362,880 | 516,650 | 1.4237 |
14 Aug 2013 15:59:38 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 336,960 | 480,210 | 1.4251 |
14 Aug 2013 15:59:38 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 311,040 | 443,870 | 1.4271 |
14 Aug 2013 15:59:38 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 285,120 | 407,814 | 1.4303 |
14 Aug 2013 15:59:38 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 259,200 | 370,572 | 1.4297 |
26 Jul 2013 07:11:53 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 233,280 | 334,955 | 1.4358 |
23 Jul 2013 20:33:43 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 207,360 | 296,831 | 1.4315 |
23 Jul 2013 20:16:13 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 181,440 | 259,381 | 1.4296 |
23 Jul 2013 19:56:49 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 155,520 | 222,569 | 1.4311 |
23 Jul 2013 18:44:22 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 129,600 | 185,322 | 1.4300 |
23 Jul 2013 18:44:22 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 103,680 | 149,359 | 1.4406 |
23 Jul 2013 18:44:22 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 77,760 | 112,678 | 1.4490 |
23 Jul 2013 18:44:21 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 51,840 | 75,555 | 1.4575 |
23 Jul 2013 18:44:21 | 1209810 | 15889752 | hadcm3n_o1hx_1940_40_008381477_3 | 25,920 | 38,048 | 1.4679 |
©2024 cpdn.org