Name | hadcm3n_yian_1940_40_007682505_1 |
Workunit | 7837592 |
Created | 15 Jan 2012, 23:48:23 UTC |
Sent | 15 Jan 2012, 23:48:35 UTC |
Report deadline | 16 Apr 2012, 7:15:46 UTC |
Received | 2 Jul 2012, 11:26:23 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1017301 |
Run time | 9 days 9 hours 37 min 30 sec |
CPU time | 7 days 21 hours 44 min 7 sec |
Validate state | Invalid |
Credit | 5,287.68 |
Device peak FLOPS | 2.74 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.25</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 15:42:34 (6004): No heartbeat from core client for 30 sec - exiting 15:42:35 (6004): No heartbeat from core client for 30 sec - exiting 15:42:36 (6004): No heartbeat from core client for 30 sec - exiting 15:42:37 (6004): No heartbeat from core client for 30 sec - exiting 15:42:38 (6004): No heartbeat from core client for 30 sec - exiting 15:42:39 (6004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:45:05 (10692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3216, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish Ocean Restart file copy failed on yianko.dae1220 Ocean Restart file copy failed on yianko.dae4ch0 Ocean Restart file copy failed on yianko.dae4ci0 Ocean Restart file copy failed on yianko.daf4b20 Ocean Restart file copy failed on yianko.daf8740 Ocean Restart file copy failed on yianko.daf8750 Ocean Restart file copy failed on yianko.daf8760 Ocean Restart file copy failed on yianko.daf8770 Ocean Restart file copy failed on yianko.daf8780 Ocean Restart file copy failed on yianko.daf8790 Ocean Restart file copy failed on yianko.daf87a0 Ocean Restart file copy failed on yianko.daf87b0 Ocean Restart file copy failed on yianko.daf87c0 Ocean Restart file copy failed on yianko.daf87d0 Ocean Restart file copy failed on yianko.daf87e0 Ocean Restart file copy failed on yianko.daf87f0 Ocean Restart file copy failed on yianko.daf87g0 Ocean Restart file copy failed on yianko.daf87h0 Ocean Restart file copy failed on yianko.daf87i0 07:13:01 (18056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: U_MODEL: Illegal combination of submodels tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Jul 2012 11:29:05 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 440,640 | 659,692 | 1.4971 |
30 Jun 2012 13:46:11 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 414,720 | 619,452 | 1.4937 |
30 Jun 2012 00:24:58 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 388,800 | 578,974 | 1.4891 |
29 Jun 2012 11:25:30 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 362,880 | 538,421 | 1.4837 |
28 Jun 2012 22:44:10 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 336,960 | 498,564 | 1.4796 |
28 Jun 2012 11:26:29 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 311,040 | 460,161 | 1.4794 |
27 Jun 2012 22:54:06 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 285,120 | 421,668 | 1.4789 |
27 Jun 2012 11:35:39 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 259,200 | 384,444 | 1.4832 |
27 Jun 2012 00:14:42 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 233,280 | 347,458 | 1.4894 |
26 Jun 2012 13:05:00 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 207,360 | 310,830 | 1.4990 |
26 Jun 2012 01:41:59 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 181,440 | 274,384 | 1.5123 |
25 Jun 2012 14:41:44 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 155,520 | 237,877 | 1.5296 |
25 Jun 2012 02:42:44 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 129,600 | 200,862 | 1.5499 |
24 Jun 2012 09:35:39 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 103,680 | 164,056 | 1.5823 |
23 Jun 2012 21:07:13 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 77,760 | 126,209 | 1.6231 |
23 Jun 2012 09:08:44 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 51,840 | 88,689 | 1.7108 |
22 Jun 2012 21:50:17 | 1017301 | 13924010 | hadcm3n_yian_1940_40_007682505_1 | 25,920 | 51,097 | 1.9713 |
©2024 cpdn.org