Name | hadcm3n_u0t3_1980_40_007694394_1 |
Workunit | 7849502 |
Created | 23 Jan 2012, 23:06:44 UTC |
Sent | 23 Jan 2012, 23:13:55 UTC |
Report deadline | 24 Apr 2012, 6:41:06 UTC |
Received | 6 Apr 2012, 17:52:58 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1173710 |
Run time | 12 days 4 hours 24 min 4 sec |
CPU time | 12 days 3 hours 26 min 4 sec |
Validate state | Invalid |
Credit | 10,575.36 |
Device peak FLOPS | 3.28 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5576, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5136, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3272, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5872, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5872, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5872, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5872, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5512, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5512, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
04 Apr 2012 00:23:13 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 881,280 | 1,026,643 | 1.1649 |
01 Apr 2012 19:44:43 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 855,360 | 995,986 | 1.1644 |
01 Apr 2012 02:08:21 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 829,440 | 966,719 | 1.1655 |
31 Mar 2012 18:11:27 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 803,520 | 937,693 | 1.1670 |
29 Mar 2012 01:32:04 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 777,600 | 908,839 | 1.1688 |
26 Mar 2012 01:02:35 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 751,680 | 878,572 | 1.1688 |
25 Mar 2012 16:03:18 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 725,760 | 849,703 | 1.1708 |
20 Mar 2012 01:30:46 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 699,840 | 820,371 | 1.1722 |
15 Mar 2012 00:42:15 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 673,920 | 789,741 | 1.1719 |
11 Mar 2012 22:44:45 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 648,000 | 758,620 | 1.1707 |
06 Mar 2012 02:30:14 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 622,080 | 727,419 | 1.1693 |
04 Mar 2012 19:28:58 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 596,160 | 697,315 | 1.1697 |
03 Mar 2012 16:33:21 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 570,240 | 665,508 | 1.1671 |
27 Feb 2012 00:47:59 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 544,320 | 634,426 | 1.1655 |
24 Feb 2012 18:43:08 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 518,400 | 604,498 | 1.1661 |
24 Feb 2012 11:06:18 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 492,480 | 574,445 | 1.1664 |
24 Feb 2012 01:50:59 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 466,560 | 544,565 | 1.1672 |
23 Feb 2012 17:31:22 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 440,640 | 514,567 | 1.1678 |
23 Feb 2012 09:46:01 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 414,720 | 484,601 | 1.1685 |
23 Feb 2012 00:55:21 | 1173710 | 13958056 | hadcm3n_u0t3_1980_40_007694394_1 | 388,800 | 454,871 | 1.1699 |
©2024 cpdn.org