Name | hadcm3n_ymfz_1900_40_007361945_0 |
Workunit | 7559375 |
Created | 6 Jul 2011, 15:22:28 UTC |
Sent | 7 Jul 2011, 8:53:28 UTC |
Report deadline | 6 Oct 2011, 16:20:39 UTC |
Received | 8 Sep 2011, 20:31:44 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 283705 |
Run time | 40 days 1 hours 24 min 55 sec |
CPU time | 40 days 1 hours 24 min 55 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 0.78 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>5.2.13</core_client_version> <message>The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 17:38:30 (3020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 17:54:46 (1600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:01:02 (1120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:07:43 (3692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:07:44 (3692): No heartbeat from core client for 30 sec - exiting 19:36:20 (2300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:36:23 (2300): No heartbeat from core client for 30 sec - exiting 21:54:00 (760): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:54:01 (760): No heartbeat from core client for 30 sec - exiting 22:00:09 (112): No heartbeat from core client for 30 sec - exiting 22:00:20 (112): No heartbeat from core client for 30 sec - exiting 22:00:32 (112): No heartbeat from core client for 30 sec - exiting 22:00:37 (112): No heartbeat from core client for 30 sec - exiting 22:00:43 (112): No heartbeat from core client for 30 sec - exiting 22:00:46 (112): No heartbeat from core client for 30 sec - exiting 22:00:52 (112): No heartbeat from core client for 30 sec - exiting 22:00:59 (112): No heartbeat from core client for 30 sec - exiting 22:01:07 (112): No heartbeat from core client for 30 sec - exiting 22:01:08 (112): No heartbeat from core client for 30 sec - exiting 22:01:20 (112): No heartbeat from core client for 30 sec - exiting 22:01:24 (112): No heartbeat from core client for 30 sec - exiting 22:01:32 (112): No heartbeat from core client for 30 sec - exiting 22:01:38 (112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:01:47 (112): No heartbeat from core client for 30 sec - exiting 22:01:57 (112): No heartbeat from core client for 30 sec - exiting 22:02:08 (112): No heartbeat from core client for 30 sec - exiting 22:02:18 (112): No heartbeat from core client for 30 sec - exiting 22:02:30 (112): No heartbeat from core client for 30 sec - exiting 22:08:18 (1120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:08:25 (1120): No heartbeat from core client for 30 sec - exiting 22:08:26 (1120): No heartbeat from core client for 30 sec - exiting 22:13:04 (1556): No heartbeat from core client for 30 sec - exiting 22:13:09 (1556): No heartbeat from core client for 30 sec - exiting 22:13:13 (1556): No heartbeat from core client for 30 sec - exiting 22:13:18 (1556): No heartbeat from core client for 30 sec - exiting 22:13:19 (1556): No heartbeat from core client for 30 sec - exiting 22:13:24 (1556): No heartbeat from core client for 30 sec - exiting 22:13:29 (1556): No heartbeat from core client for 30 sec - exiting 22:13:34 (1556): No heartbeat from core client for 30 sec - exiting 22:13:39 (1556): No heartbeat from core client for 30 sec - exiting 22:13:43 (1556): No heartbeat from core client for 30 sec - exiting 22:13:44 (1556): No heartbeat from core client for 30 sec - exiting 22:13:49 (1556): No heartbeat from core client for 30 sec - exiting 22:13:54 (1556): No heartbeat from core client for 30 sec - exiting 22:13:59 (1556): No heartbeat from core client for 30 sec - exiting 22:14:04 (1556): No heartbeat from core client for 30 sec - exiting 22:14:08 (1556): No heartbeat from core client for 30 sec - exiting 22:14:09 (1556): No heartbeat from core client for 30 sec - exiting 22:14:14 (1556): No heartbeat from core client for 30 sec - exiting 22:14:19 (1556): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:22:22 (3396): No heartbeat from core client for 30 sec - exiting 03:22:23 (3396): No heartbeat from core client for 30 sec - exiting 03:22:24 (3396): No heartbeat from core client for 30 sec - exiting 03:22:25 (3396): No heartbeat from core client for 30 sec - exiting 03:22:26 (3396): No heartbeat from core client for 30 sec - exiting 03:22:27 (3396): No heartbeat from core client for 30 sec - exiting 03:22:31 (3396): No heartbeat from core client for 30 sec - exiting 03:22:32 (3396): No heartbeat from core client for 30 sec - exiting 03:22:34 (3396): No heartbeat from core client for 30 sec - exiting 03:22:35 (3396): No heartbeat from core client for 30 sec - exiting 03:22:36 (3396): No heartbeat from core client for 30 sec - exiting 03:22:37 (3396): No heartbeat from core client for 30 sec - exiting 03:22:38 (3396): No heartbeat from core client for 30 sec - exiting 03:22:39 (3396): No heartbeat from core client for 30 sec - exiting 03:22:40 (3396): No heartbeat from core client for 30 sec - exiting 03:22:41 (3396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:41:51 (4020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:41:52 (4020): No heartbeat from core client for 30 sec - exiting 12:21:32 (184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:21:33 (184): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 16:56:26 (1832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 06:41:39 (848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:02:15 (3600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:02:16 (3600): No heartbeat from core client for 30 sec - exiting 22:02:17 (3600): No heartbeat from core client for 30 sec - exiting 00:39:34 (3060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:00:18 (356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:03:45 (2652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:03:46 (2652): No heartbeat from core client for 30 sec - exiting 18:41:12 (2588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:41:13 (2588): No heartbeat from core client for 30 sec - exiting 23:23:31 (420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:23:32 (420): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 11:46:58 (712): No heartbeat from core client for 30 sec - exiting 11:46:59 (712): No heartbeat from core client for 30 sec - exiting 11:47:00 (712): No heartbeat from core client for 30 sec - exiting 11:47:01 (712): No heartbeat from core client for 30 sec - exiting 11:47:02 (712): No heartbeat from core client for 30 sec - exiting 11:47:03 (712): No heartbeat from core client for 30 sec - exiting 11:47:04 (712): No heartbeat from core client for 30 sec - exiting 11:47:05 (712): No heartbeat from core client for 30 sec - exiting 11:47:09 (712): No heartbeat from core client for 30 sec - exiting 11:47:10 (712): No heartbeat from core client for 30 sec - exiting 11:47:11 (712): No heartbeat from core client for 30 sec - exiting 11:47:12 (712): No heartbeat from core client for 30 sec - exiting 11:47:13 (712): No heartbeat from core client for 30 sec - exiting 11:47:14 (712): No heartbeat from core client for 30 sec - exiting 11:47:15 (712): No heartbeat from core client for 30 sec - exiting 11:47:16 (712): No heartbeat from core client for 30 sec - exiting 11:47:17 (712): No heartbeat from core client for 30 sec - exiting 11:47:18 (712): No heartbeat from core client for 30 sec - exiting 11:47:20 (712): No heartbeat from core client for 30 sec - exiting 11:47:26 (712): No heartbeat from core client for 30 sec - exiting 11:47:27 (712): No heartbeat from core client for 30 sec - exiting 11:47:31 (712): No heartbeat from core client for 30 sec - exiting 11:47:32 (712): No heartbeat from core client for 30 sec - exiting 11:47:33 (712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:47:34 (712): No heartbeat from core client for 30 sec - exiting 11:47:35 (712): No heartbeat from core client for 30 sec - exiting 11:47:36 (712): No heartbeat from core client for 30 sec - exiting 11:47:37 (712): No heartbeat from core client for 30 sec - exiting 11:47:40 (712): No heartbeat from core client for 30 sec - exiting 11:47:44 (712): No heartbeat from core client for 30 sec - exiting 11:47:48 (712): No heartbeat from core client for 30 sec - exiting 11:47:53 (712): No heartbeat from core client for 30 sec - exiting 11:47:58 (712): No heartbeat from core client for 30 sec - exiting 11:48:02 (712): No heartbeat from core client for 30 sec - exiting 11:48:06 (712): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:50:24 (3648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:51:43 (3768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:51:44 (3768): No heartbeat from core client for 30 sec - exiting 17:51:46 (3768): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 22:28:05 (3504): No heartbeat from core client for 30 sec - exiting 22:28:06 (3504): No heartbeat from core client for 30 sec - exiting 22:28:07 (3504): No heartbeat from core client for 30 sec - exiting 22:28:08 (3504): No heartbeat from core client for 30 sec - exiting 22:28:09 (3504): No heartbeat from core client for 30 sec - exiting 22:28:10 (3504): No heartbeat from core client for 30 sec - exiting 22:28:11 (3504): No heartbeat from core client for 30 sec - exiting 22:28:12 (3504): No heartbeat from core client for 30 sec - exiting 22:28:13 (3504): No heartbeat from core client for 30 sec - exiting 22:28:14 (3504): No heartbeat from core client for 30 sec - exiting 22:28:15 (3504): No heartbeat from core client for 30 sec - exiting 22:28:16 (3504): No heartbeat from core client for 30 sec - exiting 22:28:17 (3504): No heartbeat from core client for 30 sec - exiting 22:28:18 (3504): No heartbeat from core client for 30 sec - exiting 22:28:19 (3504): No heartbeat from core client for 30 sec - exiting 22:28:20 (3504): No heartbeat from core client for 30 sec - exiting 22:28:21 (3504): No heartbeat from core client for 30 sec - exiting 22:28:22 (3504): No heartbeat from core client for 30 sec - exiting 22:28:23 (3504): No heartbeat from core client for 30 sec - exiting 22:28:24 (3504): No heartbeat from core client for 30 sec - exiting 22:28:25 (3504): No heartbeat from core client for 30 sec - exiting 22:28:26 (3504): No heartbeat from core client for 30 sec - exiting 22:28:27 (3504): No heartbeat from core client for 30 sec - exiting 22:28:29 (3504): No heartbeat from core client for 30 sec - exiting 22:28:32 (3504): No heartbeat from core client for 30 sec - exiting 22:28:33 (3504): No heartbeat from core client for 30 sec - exiting 22:28:34 (3504): No heartbeat from core client for 30 sec - exiting 22:28:38 (3504): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... zip error: Could not create output file (was replacing the original zip file) cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_ymfz_1900_40_007361945/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_ymfz_1900_40_007361945/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_ymfz_1900_40_007361945/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_ymfz_1900_40_007361945/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_ymfz_1900_40_007361945/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_ymfz_1900_40_007361945/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_ymfz_1900_40_007361945/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_ymfz_1900_40_007361945/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_ymfz_1900_40_007361945/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_ymfz_1900_40_007361945/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_ymfz_1900_40_007361945/dataout/atmos_restart.day after 11 attempts cpdnmonitor: cannot open input file C:\Program Files\BOINC/projects/climateprediction.net/hadcm3n_ymfz_1900_40_007361945/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Sep 2011 19:31:23 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 518,400 | 3,461,315 | 6.6769 |
06 Sep 2011 14:04:07 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 492,480 | 3,288,567 | 6.6776 |
31 Aug 2011 11:55:53 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 466,560 | 3,115,267 | 6.6771 |
29 Aug 2011 05:35:22 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 440,640 | 2,942,153 | 6.6770 |
27 Aug 2011 02:11:31 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 414,720 | 2,769,515 | 6.6780 |
24 Aug 2011 04:32:19 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 388,800 | 2,596,323 | 6.6778 |
12 Aug 2011 06:53:24 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 362,880 | 2,422,948 | 6.6770 |
10 Aug 2011 04:17:15 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 336,960 | 2,250,253 | 6.6781 |
05 Aug 2011 05:24:01 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 311,040 | 2,077,440 | 6.6790 |
03 Aug 2011 02:38:43 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 285,120 | 1,904,181 | 6.6785 |
01 Aug 2011 08:52:20 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 259,200 | 1,731,171 | 6.6789 |
01 Aug 2011 08:52:20 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 233,280 | 1,557,953 | 6.6785 |
01 Aug 2011 08:52:20 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 207,360 | 1,385,332 | 6.6808 |
01 Aug 2011 08:52:20 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 181,440 | 1,212,648 | 6.6835 |
25 Jul 2011 18:52:22 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 155,520 | 1,039,461 | 6.6838 |
25 Jul 2011 16:48:34 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 129,600 | 866,422 | 6.6854 |
25 Jul 2011 14:56:27 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 103,680 | 693,623 | 6.6900 |
25 Jul 2011 13:18:08 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 77,760 | 520,509 | 6.6938 |
25 Jul 2011 13:18:08 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 51,840 | 346,497 | 6.6840 |
09 Jul 2011 11:46:21 | 283705 | 13127761 | hadcm3n_ymfz_1900_40_007361945_0 | 25,920 | 173,192 | 6.6818 |
©2024 cpdn.org