Name | hadcm3n_p6bw_1900_40_007225788_1 |
Workunit | 7424028 |
Created | 26 Apr 2011, 15:35:44 UTC |
Sent | 27 Apr 2011, 13:39:01 UTC |
Report deadline | 27 Jul 2011, 21:06:12 UTC |
Received | 26 May 2011, 4:19:55 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 969436 |
Run time | 25 days 20 hours 18 min 3 sec |
CPU time | 17 days 10 hours 6 min 12 sec |
Validate state | Invalid |
Credit | 9,642.24 |
Device peak FLOPS | 2.33 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-apple-darwin |
Stderr | <core_client_version>6.12.26</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Error: XML file: /Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_p6bw_1900_40_007225788/datain/varlevs.xml error in XML or file could not be opened. Error converting file to netcdf: dataout/p6bwko.pib0c10 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:13:55 (382): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:54:22 (760): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:51:19 (1029): No heartbeat from core client for 30 sec - exiting 14:51:20 (1029): No heartbeat from core client for 30 sec - exiting 14:51:21 (1029): No heartbeat from core client for 30 sec - exiting 14:51:22 (1029): No heartbeat from core client for 30 sec - exiting 14:51:23 (1029): No heartbeat from core client for 30 sec - exiting 14:51:24 (1029): No heartbeat from core client for 30 sec - exiting 14:51:25 (1029): No heartbeat from core client for 30 sec - exiting 14:51:26 (1029): No heartbeat from core client for 30 sec - exiting 14:51:27 (1029): No heartbeat from core client for 30 sec - exiting 14:51:28 (1029): No heartbeat from core client for 30 sec - exiting 14:51:29 (1029): No heartbeat from core client for 30 sec - exiting 14:51:31 (1029): No heartbeat from core client for 30 sec - exiting 14:51:32 (1029): No heartbeat from core client for 30 sec - exiting 14:51:33 (1029): No heartbeat from core client for 30 sec - exiting 14:51:34 (1029): No heartbeat from core client for 30 sec - exiting 14:51:35 (1029): No heartbeat from core client for 30 sec - exiting 14:51:36 (1029): No heartbeat from core client for 30 sec - exiting 14:51:37 (1029): No heartbeat from core client for 30 sec - exiting 14:51:38 (1029): No heartbeat from core client for 30 sec - exiting 14:51:39 (1029): No heartbeat from core client for 30 sec - exiting 14:51:40 (1029): No heartbeat from core client for 30 sec - exiting 14:51:41 (1029): No heartbeat from core client for 30 sec - exiting 14:51:42 (1029): No heartbeat from core client for 30 sec - exiting 14:51:43 (1029): No heartbeat from core client for 30 sec - exiting 14:51:44 (1029): No heartbeat from core client for 30 sec - exiting 14:51:45 (1029): No heartbeat from core client for 30 sec - exiting 14:51:46 (1029): No heartbeat from core client for 30 sec - exiting 14:51:47 (1029): No heartbeat from core client for 30 sec - exiting 14:51:48 (1029): No heartbeat from core client for 30 sec - exiting 14:51:49 (1029): No heartbeat from core client for 30 sec - exiting 14:51:50 (1029): No heartbeat from core client for 30 sec - exiting 14:51:51 (1029): No heartbeat from core client for 30 sec - exiting 14:51:52 (1029): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 02:20:45 (7335): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 137705) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=63352, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 137705) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=63352, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 137705) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=63352, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 137705) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=63352, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 137705) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=63352, iMonCtr=1 Model crash detected, will try to restart... execl(/Library/Application Support/BOINC Data/projects/climateprediction.net/hadcm3n_um_6.07_i686-apple-darwin, 137705) failed! Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=63352, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 May 2011 15:06:33 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 803,520 | 1,475,684 | 1.8365 |
24 May 2011 18:30:14 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 777,600 | 1,427,054 | 1.8352 |
23 May 2011 06:06:09 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 751,680 | 1,379,488 | 1.8352 |
22 May 2011 11:04:51 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 725,760 | 1,331,249 | 1.8343 |
19 May 2011 14:28:12 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 699,840 | 1,282,931 | 1.8332 |
18 May 2011 15:03:24 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 673,920 | 1,233,079 | 1.8297 |
17 May 2011 17:37:19 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 648,000 | 1,184,913 | 1.8286 |
16 May 2011 22:19:42 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 622,080 | 1,137,328 | 1.8283 |
16 May 2011 02:44:07 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 596,160 | 1,089,377 | 1.8273 |
15 May 2011 06:35:33 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 570,240 | 1,040,685 | 1.8250 |
14 May 2011 09:55:21 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 544,320 | 991,959 | 1.8224 |
13 May 2011 12:30:15 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 518,400 | 941,885 | 1.8169 |
12 May 2011 15:02:20 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 492,480 | 892,394 | 1.8120 |
11 May 2011 18:59:56 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 466,560 | 844,284 | 1.8096 |
11 May 2011 00:23:19 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 440,640 | 797,358 | 1.8095 |
10 May 2011 07:50:26 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 414,720 | 752,246 | 1.8139 |
09 May 2011 14:01:35 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 388,800 | 705,603 | 1.8148 |
08 May 2011 18:49:39 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 362,880 | 658,393 | 1.8144 |
08 May 2011 00:59:11 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 336,960 | 612,901 | 1.8189 |
07 May 2011 03:39:01 | 969436 | 12831504 | hadcm3n_p6bw_1900_40_007225788_1 | 311,040 | 566,239 | 1.8205 |
©2024 cpdn.org