Name | hadcm3n_oftg_1900_40_008475623_0 |
Workunit | 8626462 |
Created | 27 Sep 2013, 10:38:29 UTC |
Sent | 27 Sep 2013, 12:58:05 UTC |
Report deadline | 27 Dec 2013, 20:25:16 UTC |
Received | 18 Oct 2013, 0:25:05 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1167410 |
Run time | 18 days 5 hours 3 min 24 sec |
CPU time | 17 days 11 hours 4 min 13 sec |
Validate state | Invalid |
Credit | 9,642.24 |
Device peak FLOPS | 2.69 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 10:13:47 (552): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Oct 2013 14:58:08 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 803,520 | 1,506,279 | 1.8746 |
16 Oct 2013 23:53:50 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 777,600 | 1,456,832 | 1.8735 |
16 Oct 2013 09:19:36 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 751,680 | 1,407,248 | 1.8721 |
15 Oct 2013 17:48:56 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 725,760 | 1,358,038 | 1.8712 |
15 Oct 2013 03:07:53 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 699,840 | 1,308,943 | 1.8703 |
14 Oct 2013 12:29:08 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 673,920 | 1,259,295 | 1.8686 |
13 Oct 2013 23:12:13 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 648,000 | 1,211,519 | 1.8696 |
13 Oct 2013 09:54:38 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 622,080 | 1,163,690 | 1.8706 |
12 Oct 2013 20:38:57 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 596,160 | 1,115,862 | 1.8717 |
12 Oct 2013 07:24:16 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 570,240 | 1,068,017 | 1.8729 |
11 Oct 2013 18:06:48 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 544,320 | 1,020,224 | 1.8743 |
10 Oct 2013 21:24:14 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 518,400 | 971,312 | 1.8737 |
10 Oct 2013 05:24:17 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 492,480 | 923,426 | 1.8751 |
09 Oct 2013 16:09:52 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 466,560 | 875,801 | 1.8771 |
09 Oct 2013 02:50:14 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 440,640 | 828,151 | 1.8794 |
08 Oct 2013 13:35:16 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 414,720 | 780,567 | 1.8822 |
08 Oct 2013 00:13:45 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 388,800 | 732,958 | 1.8852 |
07 Oct 2013 10:41:39 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 362,880 | 684,664 | 1.8868 |
06 Oct 2013 21:27:29 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 336,960 | 637,270 | 1.8912 |
06 Oct 2013 07:08:59 | 1167410 | 16046337 | hadcm3n_oftg_1900_40_008475623_0 | 311,040 | 588,555 | 1.8922 |
©2024 cpdn.org