Name | hadcm3n_scry_1940_40_009114476_0 |
Workunit | 9244812 |
Created | 22 Oct 2014, 15:32:48 UTC |
Sent | 23 Oct 2014, 5:20:13 UTC |
Report deadline | 22 Jan 2015, 12:47:24 UTC |
Received | 31 Oct 2014, 14:28:27 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1241734 |
Run time | 6 days 18 hours 47 min 24 sec |
CPU time | 6 days 9 hours 4 min 6 sec |
Validate state | Invalid |
Credit | 5,287.68 |
Device peak FLOPS | 3.41 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> El dispositivo no reconoce el comando. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 11:59:40 (6592): No heartbeat from core client for 30 sec - exiting 11:59:41 (6592): No heartbeat from core client for 30 sec - exiting 11:59:42 (6592): No heartbeat from core client for 30 sec - exiting 11:59:43 (6592): No heartbeat from core client for 30 sec - exiting 11:59:44 (6592): No heartbeat from core client for 30 sec - exiting 11:59:45 (6592): No heartbeat from core client for 30 sec - exiting 11:59:46 (6592): No heartbeat from core client for 30 sec - exiting 11:59:47 (6592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:42:44 (6896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 11:27:52 (7252): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Oct 2014 09:45:24 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 440,640 | 544,960 | 1.2367 |
31 Oct 2014 00:40:23 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 414,720 | 515,757 | 1.2436 |
30 Oct 2014 13:42:01 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 388,800 | 480,084 | 1.2348 |
29 Oct 2014 19:10:52 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 362,880 | 447,117 | 1.2321 |
29 Oct 2014 06:18:38 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 336,960 | 404,666 | 1.2009 |
28 Oct 2014 19:40:54 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 311,040 | 368,072 | 1.1834 |
28 Oct 2014 10:05:53 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 285,120 | 334,057 | 1.1716 |
27 Oct 2014 22:05:46 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 259,200 | 293,376 | 1.1319 |
26 Oct 2014 22:42:38 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 233,280 | 265,211 | 1.1369 |
26 Oct 2014 12:20:15 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 207,360 | 229,165 | 1.1052 |
25 Oct 2014 20:36:31 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 181,440 | 199,189 | 1.0978 |
25 Oct 2014 09:34:31 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 155,520 | 171,755 | 1.1044 |
25 Oct 2014 00:24:21 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 129,600 | 143,186 | 1.1048 |
24 Oct 2014 16:18:01 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 103,680 | 115,828 | 1.1172 |
24 Oct 2014 08:10:54 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 77,760 | 87,681 | 1.1276 |
24 Oct 2014 00:04:44 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 51,840 | 59,405 | 1.1459 |
23 Oct 2014 16:08:22 | 1241734 | 17256531 | hadcm3n_scry_1940_40_009114476_0 | 25,920 | 31,356 | 1.2097 |
©2024 cpdn.org