Name | famous_ug38_1399_200_006653695_3 |
Workunit | 6857067 |
Created | 10 Jun 2010, 14:02:22 UTC |
Sent | 21 Aug 2010, 7:28:18 UTC |
Report deadline | 20 Nov 2010, 14:55:29 UTC |
Received | 29 Aug 2010, 0:59:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1095309 |
Run time | 6 days 13 hours 37 min 33 sec |
CPU time | 6 days 11 hours 16 min 37 sec |
Validate state | Invalid |
Credit | 5,002.91 |
Device peak FLOPS | 2.78 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... 10:47:05 (1008): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=488, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Signal 11 received, exiting... 12:23:47 (3508): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=488, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 13:17:09 (3820): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=488, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... 11:05:31 (1916): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 13:40:02 (2364): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 15:17:55 (1252): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 20:32:38 (3596): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 22:28:50 (732): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 00:38:27 (3016): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 01:20:32 (3868): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 03:00:42 (1892): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 03:37:17 (3928): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... 13:00:12 (3384): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 16:21:12 (1132): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 16:36:05 (1180): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3576, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 22:30:43 (2680): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2548, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 23:36:39 (1976): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2548, iMonCtr=1 Model crash detected, will try to restart... Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Sorry, too many model crashes! :-( 01:52:36 (2548): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Aug 2010 00:13:13 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,516,346 | 556,472 | 0.3670 |
29 Aug 2010 00:12:28 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,506,986 | 553,119 | 0.3670 |
28 Aug 2010 22:17:04 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,497,626 | 549,768 | 0.3671 |
28 Aug 2010 21:24:14 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,488,266 | 546,422 | 0.3672 |
28 Aug 2010 20:25:59 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,478,906 | 543,070 | 0.3672 |
28 Aug 2010 19:32:01 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,469,546 | 539,720 | 0.3673 |
28 Aug 2010 18:36:30 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,460,186 | 536,374 | 0.3673 |
28 Aug 2010 17:36:12 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,450,826 | 533,027 | 0.3674 |
28 Aug 2010 16:43:14 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,441,466 | 529,680 | 0.3675 |
28 Aug 2010 15:44:47 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,432,106 | 526,324 | 0.3675 |
28 Aug 2010 14:51:39 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,422,746 | 522,960 | 0.3676 |
28 Aug 2010 13:53:46 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,413,386 | 519,602 | 0.3676 |
28 Aug 2010 12:55:26 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,404,026 | 516,242 | 0.3677 |
28 Aug 2010 12:02:11 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,394,666 | 512,880 | 0.3677 |
28 Aug 2010 11:03:47 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,385,306 | 509,517 | 0.3678 |
28 Aug 2010 10:09:37 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,375,946 | 506,152 | 0.3679 |
28 Aug 2010 09:14:21 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,366,586 | 502,795 | 0.3679 |
28 Aug 2010 08:20:23 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,357,226 | 499,431 | 0.3680 |
28 Aug 2010 08:20:23 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,347,866 | 496,066 | 0.3680 |
28 Aug 2010 06:27:33 | 1095309 | 11519319 | famous_ug38_1399_200_006653695_3 | 1,338,506 | 492,702 | 0.3681 |
©2024 cpdn.org