Name | hadcm3n_yjde_1900_40_007357964_2 |
Workunit | 7555394 |
Created | 8 Jul 2011, 20:15:52 UTC |
Sent | 8 Jul 2011, 20:28:05 UTC |
Report deadline | 8 Oct 2011, 3:55:16 UTC |
Received | 17 Oct 2011, 9:55:54 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 969683 |
Run time | 12 days 19 hours 13 min 37 sec |
CPU time | 10 days 11 hours 17 min 40 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 2.57 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.6.20</core_client_version> <![CDATA[ <message> El dispositivo no reconoce el comando. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CreateFile error 32 when trying set file time CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:47:29 (5648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:47:30 (5648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 11:19:52 (7572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 09:27:49 (5916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:27:50 (5916): No heartbeat from core client for 30 sec - exiting 11:17:17 (7188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 10:02:49 (4756): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:02:50 (4756): No heartbeat from core client for 30 sec - exiting 09:54:16 (6908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:35:03 (3936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:35:04 (3936): No heartbeat from core client for 30 sec - exiting No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6352, selfPID=6352, iMonCtr=1 CPDN Monitor - Quit request from BOINC... 10:05:02 (1828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:05:03 (1828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 08:37:47 (3228): No heartbeat from core client for 30 sec - exiting 08:37:48 (3228): No heartbeat from core client for 30 sec - exiting 08:37:49 (3228): No heartbeat from core client for 30 sec - exiting 08:37:50 (3228): No heartbeat from core client for 30 sec - exiting 08:37:51 (3228): No heartbeat from core client for 30 sec - exiting 08:37:52 (3228): No heartbeat from core client for 30 sec - exiting 08:37:53 (3228): No heartbeat from core client for 30 sec - exiting 08:37:54 (3228): No heartbeat from core client for 30 sec - exiting 08:37:55 (3228): No heartbeat from core client for 30 sec - exiting 08:37:56 (3228): No heartbeat from core client for 30 sec - exiting 08:37:57 (3228): No heartbeat from core client for 30 sec - exiting 08:37:58 (3228): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:19:30 (4204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:06:22 (5368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold Ocean Restart file copy failed on yjdeko.dab55a0 Ocean Restart file copy failed on yjdeko.dab55b0 Ocean Restart file copy failed on yjdeko.dab55c0 12:09:33 (3776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:09:34 (3776): No heartbeat from core client for 30 sec - exiting 12:09:35 (3776): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold CPDN Monitor - Quit request from BOINC... 11:06:05 (5572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:38:29 (3760): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:37:38 (1752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:04:27 (676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 10:37:15 (1304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:37:43 (5300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:37:51 (3736): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:05:39 (5872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 10:09:30 (1288): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:37:20 (1588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x7C921780 read attempt to address 0x400D47D8 Engaging BOINC Windows Runtime Debugger... 11:06:49 (7560): No heartbeat from core client for 30 sec - exiting 11:06:51 (7560): No heartbeat from core client for 30 sec - exiting 11:06:52 (7560): No heartbeat from core client for 30 sec - exiting 11:06:54 (7560): No heartbeat from core client for 30 sec - exiting 11:06:55 (7560): No heartbeat from core client for 30 sec - exiting 11:06:56 (7560): No heartbeat from core client for 30 sec - exiting 11:06:57 (7560): No heartbeat from core client for 30 sec - exiting 11:06:58 (7560): No heartbeat from core client for 30 sec - exiting 11:06:59 (7560): No heartbeat from core client for 30 sec - exiting 11:07:00 (7560): No heartbeat from core client for 30 sec - exiting 11:07:01 (7560): No heartbeat from core client for 30 sec - exiting 11:07:02 (7560): No heartbeat from core client for 30 sec - exiting 11:07:03 (7560): No heartbeat from core client for 30 sec - exiting 11:07:04 (7560): No heartbeat from core client for 30 sec - exiting 11:07:05 (7560): No heartbeat from core client for 30 sec - exiting 11:07:06 (7560): No heartbeat from core client for 30 sec - exiting 11:07:07 (7560): No heartbeat from core client for 30 sec - exiting 11:07:08 (7560): No heartbeat from core client for 30 sec - exiting 11:07:10 (7560): No heartbeat from core client for 30 sec - exiting 11:07:11 (7560): No heartbeat from core client for 30 sec - exiting 11:07:12 (7560): No heartbeat from core client for 30 sec - exiting 11:07:13 (7560): No heartbeat from core client for 30 sec - exiting 11:07:15 (7560): No heartbeat from core client for 30 sec - exiting 11:08:12 (7560): No heartbeat from core client for 30 sec - exiting 11:08:16 (7560): No heartbeat from core client for 30 sec - exiting 11:08:18 (7560): No heartbeat from core client for 30 sec - exiting 11:08:19 (7560): No heartbeat from core client for 30 sec - exiting 11:08:20 (7560): No heartbeat from core client for 30 sec - exiting 11:08:21 (7560): No heartbeat from core client for 30 sec - exiting 11:08:22 (7560): No heartbeat from core client for 30 sec - exiting 11:08:23 (7560): No heartbeat from core client for 30 sec - exiting 11:08:24 (7560): No heartbeat from core client for 30 sec - exiting 11:08:25 (7560): No heartbeat from core client for 30 sec - exiting 11:08:27 (7560): No heartbeat from core client for 30 sec - exiting 11:08:28 (7560): No heartbeat from core client for 30 sec - exiting 11:08:29 (7560): No heartbeat from core client for 30 sec - exiting 11:08:30 (7560): No heartbeat from core client for 30 sec - exiting 11:08:31 (7560): No heartbeat from core client for 30 sec - exiting 11:08:33 (7560): No heartbeat from core client for 30 sec - exiting 11:08:34 (7560): No heartbeat from core client for 30 sec - exiting 11:08:35 (7560): No heartbeat from core client for 30 sec - exiting 11:08:36 (7560): No heartbeat from core client for 30 sec - exiting 11:08:38 (7560): No heartbeat from core client for 30 sec - exiting 11:08:39 (7560): No heartbeat from core client for 30 sec - exiting 11:08:40 (7560): No heartbeat from core client for 30 sec - exiting 11:08:41 (7560): No heartbeat from core client for 30 sec - exiting 11:08:42 (7560): No heartbeat from core client for 30 sec - exiting 11:08:43 (7560): No heartbeat from core client for 30 sec - exiting 11:08:44 (7560): No heartbeat from core client for 30 sec - exiting 11:08:45 (7560): No heartbeat from core client for 30 sec - exiting 11:08:46 (7560): No heartbeat from core client for 30 sec - exiting 11:08:47 (7560): No heartbeat from core client for 30 sec - exiting 11:08:48 (7560): No heartbeat from core client for 30 sec - exiting 11:08:49 (7560): No heartbeat from core client for 30 sec - exiting 11:08:50 (7560): No heartbeat from core client for 30 sec - exiting 11:08:52 (7560): No heartbeat from core client for 30 sec - exiting 11:08:53 (7560): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1588, selfPID=1588, iMonCtr=1 09:13:51 (6452): No heartbeat from core client for 30 sec - exiting 09:13:52 (6452): No heartbeat from core client for 30 sec - exiting 09:13:53 (6452): No heartbeat from core client for 30 sec - exiting 09:13:54 (6452): No heartbeat from core client for 30 sec - exiting 09:13:55 (6452): No heartbeat from core client for 30 sec - exiting 09:13:56 (6452): No heartbeat from core client for 30 sec - exiting 09:13:57 (6452): No heartbeat from core client for 30 sec - exiting 09:13:58 (6452): No heartbeat from core client for 30 sec - exiting Signal 11 received, exiting... Called boinc_finish Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x7C9208D3 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... MainError: 07:09:08 AM No files match the supplied pattern. MainError: 07:09:08 AM No files match the supplied pattern. Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7560, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7560, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7560, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7560, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
05 Oct 2011 13:04:42 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 518,400 | 861,786 | 1.6624 |
03 Oct 2011 11:40:28 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 492,480 | 815,351 | 1.6556 |
28 Sep 2011 14:40:09 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 466,560 | 770,450 | 1.6513 |
26 Sep 2011 09:25:13 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 440,640 | 725,402 | 1.6462 |
21 Sep 2011 14:26:05 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 414,720 | 679,580 | 1.6386 |
19 Sep 2011 08:37:42 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 388,800 | 634,811 | 1.6327 |
14 Sep 2011 12:59:46 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 362,880 | 590,995 | 1.6286 |
12 Sep 2011 10:32:41 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 336,960 | 548,721 | 1.6284 |
06 Sep 2011 11:25:04 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 311,040 | 504,227 | 1.6211 |
28 Jul 2011 09:24:04 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 285,120 | 459,876 | 1.6129 |
26 Jul 2011 11:04:52 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 259,200 | 415,439 | 1.6028 |
25 Jul 2011 17:26:50 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 233,280 | 370,606 | 1.5887 |
25 Jul 2011 17:24:52 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 207,360 | 330,151 | 1.5922 |
25 Jul 2011 17:24:52 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 181,440 | 290,195 | 1.5994 |
25 Jul 2011 17:24:52 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 155,520 | 250,309 | 1.6095 |
25 Jul 2011 17:24:52 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 129,600 | 208,926 | 1.6121 |
25 Jul 2011 17:24:52 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 103,680 | 167,069 | 1.6114 |
10 Jul 2011 18:11:29 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 77,760 | 125,096 | 1.6087 |
10 Jul 2011 03:24:47 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 51,840 | 82,915 | 1.5994 |
09 Jul 2011 13:36:59 | 969683 | 13133828 | hadcm3n_yjde_1900_40_007357964_2 | 25,920 | 41,347 | 1.5952 |
©2024 cpdn.org