climateprediction.net home page
Task 16725059

Task 16725059

Name hadam3p_eu_na3k_2013_1_008807584_0
Workunit 8953562
Created 7 Jul 2014, 17:06:58 UTC
Sent 1 Aug 2014, 16:18:06 UTC
Report deadline 14 Jul 2015, 21:38:06 UTC
Received 3 Aug 2014, 16:30:54 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1298068
Run time 14 hours 27 min 44 sec
CPU time
Validate state Invalid
Credit 200.38
Device peak FLOPS 3.05 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
i686-apple-darwin
Stderr
<core_client_version>7.0.65</core_client_version>
<![CDATA[
<message>
process exited with code 25 (0x19, -231)
</message>
<stderr_txt>
17:13:29 (16978): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:07:40 (19733): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:19:32 (22360): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1
forrtl: No space left on device
forrtl: severe (38): error during write, unit 0, file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadam3p_eu_na3k_2013_1_008807584/dataout/xaakg.err
Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  00365662  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0036412B  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  003289AD  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  002DA7E5  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  002D9F47  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0031F669  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0031C1CF  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  000212FB  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  002B9D2C  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0000192B  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00001859  Unknown               Unknown  Unknown
Unknown            00000003  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=32555, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=39190, iMonCtr=2
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=61409, iMonCtr=2
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=66282, iMonCtr=2
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=70918, selfPID=70914, iMonCtr=1
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=72071, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=73269, iMonCtr=2
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=75616, iMonCtr=2
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=78242, iMonCtr=2
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=81721, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=81743, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=83181, iMonCtr=2
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=89224, selfPID=89225, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=90301, iMonCtr=2
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=92520, iMonCtr=2
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7441, iMonCtr=2
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13475, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13618, iMonCtr=2
Model crash detected, will try to restart...
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=22231, selfPID=22231, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23505, iMonCtr=2
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=25146, selfPID=25147, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25177, iMonCtr=2
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=26414, iMonCtr=2
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=30596, selfPID=30598, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=41113, selfPID=41114, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=43413, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=48435, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=55951, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=63992, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=75790, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=75832, iMonCtr=2
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=86300, selfPID=86301, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=93383, selfPID=93384, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10997, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12002, iMonCtr=2
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=19820, selfPID=19821, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=22792, selfPID=22793, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=24938, selfPID=24929, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30481, iMonCtr=2
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=60992, iMonCtr=2
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Aug 2014 01:12:49 1298068 16725059 hadam3p_eu_na3k_2013_1_008807584_0 11,616 27,364 2.3557


©2024 climateprediction.net