Name | famous_ucvd_799_200_006649524_6 |
Workunit | 6852896 |
Created | 16 Aug 2010, 6:04:45 UTC |
Sent | 16 Aug 2010, 6:27:22 UTC |
Report deadline | 15 Nov 2010, 13:54:33 UTC |
Received | 30 Aug 2010, 1:31:30 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1064392 |
Run time | 1 days 9 hours 35 min 4 sec |
CPU time | 1 days 9 hours 19 min 53 sec |
Validate state | Invalid |
Credit | 1,142.71 |
Device peak FLOPS | 3.40 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Signal 4 received, exiting... 00:18:08 (2728): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2712, iMonCtr=1 Model crash detected, will try to restart... 19:40:24 (220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:41:22 (1068): Can't acquire lockfile (32) - waiting 35s 19:41:57 (1068): Can't acquire lockfile (32) - exiting 19:41:57 (1068): Error: The process cannot access the file because it is being used by another process. (0x20) 19:41:57 (3856): Can't acquire lockfile (32) - waiting 35s 19:42:32 (3856): Can't acquire lockfile (32) - exiting 19:42:32 (3856): Error: The process cannot access the file because it is being used by another process. (0x20) 19:42:32 (2520): Can't acquire lockfile (32) - waiting 35s 19:43:07 (2520): Can't acquire lockfile (32) - exiting 19:43:07 (2520): Error: The process cannot access the file because it is being used by another process. (0x20) 19:43:07 (4840): Can't acquire lockfile (32) - waiting 35s 20:47:43 (4840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:48:30 (3236): Can't acquire lockfile (32) - waiting 35s 20:49:05 (3236): Can't acquire lockfile (32) - exiting 20:49:05 (3236): Error: The process cannot access the file because it is being used by another process. (0x20) 20:49:05 (1352): Can't acquire lockfile (32) - waiting 35s 20:49:40 (1352): Can't acquire lockfile (32) - exiting 20:49:40 (1352): Error: The process cannot access the file because it is being used by another process. (0x20) 20:49:40 (4036): Can't acquire lockfile (32) - waiting 35s 20:50:15 (4036): Can't acquire lockfile (32) - exiting 20:50:15 (4036): Error: The process cannot access the file because it is being used by another process. (0x20) 20:50:16 (2544): Can't acquire lockfile (32) - waiting 35s Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4000, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 19:32:45 (4048): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3668, iMonCtr=1 Model crash detected, will try to restart... Signal 4 received, exiting... 21:18:17 (3564): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3668, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3668, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 MainError: 10:11:17 PM No files match the supplied pattern. MainError: 10:11:17 PM No files match the supplied pattern. cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_ucvd_799_200_006649524/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_ucvd_799_200_006649524/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_ucvd_799_200_006649524/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_ucvd_799_200_006649524/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_ucvd_799_200_006649524/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_ucvd_799_200_006649524/dataout/ocean_restart.day after 11 attempts Model crashed: READ_FLH: I/O error tmp/pipe_dummy Sorry, too many model crashes! :-( 17:31:42 (3148): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 Aug 2010 02:09:04 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 346,346 | 118,764 | 0.3429 |
29 Aug 2010 22:44:51 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 336,986 | 115,468 | 0.3426 |
29 Aug 2010 21:41:57 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 327,626 | 112,162 | 0.3423 |
29 Aug 2010 20:43:16 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 318,266 | 108,999 | 0.3425 |
29 Aug 2010 08:22:04 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 308,906 | 105,829 | 0.3426 |
29 Aug 2010 07:02:15 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 299,546 | 102,525 | 0.3423 |
27 Aug 2010 02:16:54 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 290,186 | 99,219 | 0.3419 |
27 Aug 2010 00:26:50 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 280,826 | 95,912 | 0.3415 |
25 Aug 2010 07:04:05 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 271,466 | 92,620 | 0.3412 |
25 Aug 2010 06:04:50 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 262,106 | 89,317 | 0.3408 |
25 Aug 2010 05:07:29 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 252,746 | 88,074 | 0.3485 |
24 Aug 2010 05:12:18 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 243,386 | 84,741 | 0.3482 |
24 Aug 2010 04:14:39 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 234,026 | 81,466 | 0.3481 |
24 Aug 2010 03:17:06 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 224,666 | 78,186 | 0.3480 |
23 Aug 2010 06:56:45 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 215,306 | 76,540 | 0.3555 |
23 Aug 2010 06:03:39 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 205,946 | 73,230 | 0.3556 |
23 Aug 2010 05:06:07 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 196,586 | 69,943 | 0.3558 |
23 Aug 2010 04:08:18 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 187,226 | 66,632 | 0.3559 |
23 Aug 2010 03:10:20 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 177,866 | 63,316 | 0.3560 |
23 Aug 2010 02:32:51 | 1064392 | 11659496 | famous_ucvd_799_200_006649524_6 | 168,506 | 60,019 | 0.3562 |
©2024 cpdn.org