climateprediction.net home page
Task 16725065

Task 16725065

Name hadam3p_eu_na3q_2013_1_008807590_0
Workunit 8953568
Created 7 Jul 2014, 17:06:58 UTC
Sent 1 Aug 2014, 16:18:06 UTC
Report deadline 14 Jul 2015, 21:38:06 UTC
Received 3 Aug 2014, 16:30:54 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 1 (0x00000001) Unknown error code
Computer ID 1298068
Run time 10 hours 32 min 29 sec
CPU time
Validate state Invalid
Credit 200.38
Device peak FLOPS 3.05 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
i686-apple-darwin
Stderr
<core_client_version>7.0.65</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
17:13:30 (16915): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
SIGSEGV: segmentation violation
forrtl: No space left on device
forrtl: severe (38): error during write, unit 6, file /Library/Application Support/BOINC Data/projects/climateprediction.net/hadam3p_eu_na3q_2013_1_008807590/dataout/xaakg.out
Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  00365662  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0036412B  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  003289AD  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  002DA7E5  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  002D9F47  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0031F669  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00001D4F  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00020A67  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0002204D  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  002B9D2C  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0000192B  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  00001859  Unknown               Unknown  Unknown
Unknown            00000003  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19731, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
zip I/O error: No space left on device

zip error: Could not create output file (../hadam3p_eu_na3q_2013_1_008807590_0_13.zip)
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=30580, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31757, iMonCtr=2
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40631, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=43765, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=44488, selfPID=44484, iMonCtr=1
Model crash detected, will try to restart...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=44756, selfPID=44757, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=46004, selfPID=45999, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=47153, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=47418, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=48654, iMonCtr=2
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=58419, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=62085, iMonCtr=2
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=69900, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=81137, iMonCtr=2
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=86933, selfPID=86934, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=88050, iMonCtr=2
Model crash detected, will try to restart...
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=90042, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=90240, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=92579, iMonCtr=2
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=97043, selfPID=97039, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=99004, selfPID=99005, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2291, iMonCtr=2
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6820, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=10500, selfPID=10494, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11185, selfPID=11186, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11218, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12422, selfPID=12415, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12946, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=16213, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18148, selfPID=18129, iMonCtr=1
Model crash detected, will try to restart...
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20051, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20650, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23224, iMonCtr=2
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23416, iMonCtr=2
Model crash detected, will try to restart...
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24792, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=25630, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=27847, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=28556, iMonCtr=2
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=35725, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=38272, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40090, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=42620, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=52631, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=57864, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=58515, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=64489, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=68495, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=70824, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=72868, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=78085, selfPID=78075, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=78085, selfPID=78085, iMonCtr=2
Model crash detected, will try to restart...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=91964, selfPID=91965, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92052, selfPID=92053, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92052, selfPID=92052, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=92185, selfPID=92186, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Called boinc_finish
Signal 3 received, exiting...
Signal 3 received, exiting...
Called boinc_finish
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=99844, selfPID=99845, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8497, selfPID=8498, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8915, selfPID=8909, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=17415, selfPID=17409, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=19979, selfPID=19973, iMonCtr=1
Model crash detected, will try to restart...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=27335, selfPID=27336, iMonCtr=1
Signal 3 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Aug 2014 01:12:49 1298068 16725065 hadam3p_eu_na3q_2013_1_008807590_0 11,616 27,289 2.3493


©2024 climateprediction.net