Name | hadcm3n_4guw_1940_40_008307048_3 |
Workunit | 8458183 |
Created | 18 Aug 2013, 13:23:45 UTC |
Sent | 18 Aug 2013, 13:24:48 UTC |
Report deadline | 17 Nov 2013, 20:51:59 UTC |
Received | 30 Sep 2013, 16:52:20 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 10780 (0x00002A1C) Unknown error code |
Computer ID | 459222 |
Run time | 7 days 18 hours 40 min 45 sec |
CPU time | 7 days 16 hours 54 min 47 sec |
Validate state | Invalid |
Credit | 9,953.28 |
Device peak FLOPS | 3.27 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 10780 (0x2a1c) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10752, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10752, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10752, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12160, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11236, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10556, iMonCtr=1 Model crash detected, will try to restart... C19:11:33 (4820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:11:34 (4820): No heartbeat from core client for 30 sec - exiting 19:11:35 (4820): No heartbeat from core client for 30 sec - exiting 19:11:36 (4820): No heartbeat from core client for 30 sec - exiting 19:11:37 (4820): No heartbeat from core client for 30 sec - exiting 19:11:38 (4820): No heartbeat from core client for 30 sec - exiting 19:11:39 (4820): No heartbeat from core client for 30 sec - exiting 19:11:40 (4820): No heartbeat from core client for 30 sec - exiting 19:11:41 (4820): No heartbeat from core client for 30 sec - exiting 19:11:42 (4820): No heartbeat from core client for 30 sec - exiting 19:11:43 (4820): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1668, iMonCtr=1 Model crash detected, will try to restart... 17:29:47 (10028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6780, iMonCtr=1 Model crash detected, will try to restart... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 Sep 2013 15:54:53 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 829,440 | 662,915 | 0.7992 |
29 Sep 2013 13:30:52 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 803,520 | 640,509 | 0.7971 |
28 Sep 2013 20:19:45 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 777,600 | 619,207 | 0.7963 |
28 Sep 2013 14:28:36 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 751,680 | 598,505 | 0.7962 |
28 Sep 2013 08:39:40 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 725,760 | 577,705 | 0.7960 |
27 Sep 2013 16:37:24 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 699,840 | 556,059 | 0.7946 |
26 Sep 2013 15:44:28 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 673,920 | 533,712 | 0.7920 |
25 Sep 2013 09:23:31 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 648,000 | 513,225 | 0.7920 |
23 Sep 2013 18:07:02 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 622,080 | 492,131 | 0.7911 |
23 Sep 2013 15:18:41 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 596,160 | 469,738 | 0.7879 |
19 Sep 2013 16:21:36 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 570,240 | 448,508 | 0.7865 |
17 Sep 2013 19:58:47 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 544,320 | 426,626 | 0.7838 |
16 Sep 2013 19:54:52 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 518,400 | 405,418 | 0.7821 |
15 Sep 2013 18:16:45 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 492,480 | 384,482 | 0.7807 |
07 Sep 2013 08:13:57 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 466,560 | 363,295 | 0.7787 |
06 Sep 2013 15:44:11 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 440,640 | 342,874 | 0.7781 |
05 Sep 2013 15:50:58 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 414,720 | 322,747 | 0.7782 |
03 Sep 2013 19:10:13 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 388,800 | 302,938 | 0.7792 |
02 Sep 2013 16:38:21 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 362,880 | 282,289 | 0.7779 |
01 Sep 2013 16:37:27 | 459222 | 15925251 | hadcm3n_4guw_1940_40_008307048_3 | 336,960 | 261,975 | 0.7775 |
©2024 cpdn.org