climateprediction.net home page
Task 15502114

Task 15502114

Name hadcm3n_o5ni_2140_40_008269310_0
Workunit 8424434
Created 23 Dec 2012, 22:35:59 UTC
Sent 26 Dec 2012, 4:34:49 UTC
Report deadline 27 Mar 2013, 12:02:00 UTC
Received 23 Feb 2013, 12:39:28 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 988438
Run time 26 days 12 hours 27 min 19 sec
CPU time 24 days 20 hours 39 min 44 sec
Validate state Invalid
Credit 8,709.12
Device peak FLOPS 1.44 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
03:06:08 (25536): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
08:04:32 (25850): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Suspended CPDN Monitor - Suspend request from BOINC...
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:19:05 (8571): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Suspended CPDN Monitor - Suspend request from BOINC...
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Suspended CPDN Monitor - Suspend request from BOINC...
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Suspended CPDN Monitor - Suspend request from BOINC...
07:37:58 (15036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:37:59 (15036): No heartbeat from core client for 30 sec - exiting
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Suspended CPDN Monitor - Suspend request from BOINC...
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:59:40 (19052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Suspended CPDN Monitor - Suspend request from BOINC...
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Suspended CPDN Monitor - Suspend request from BOINC...
15:39:53 (5158): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Unable to load library hadcm3n_se_6.07_i686-pc-linux-gnu.so
dlopen error: libnsl.so.1: cannot open shared object file: No such file or directory
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	01:34:45 PM	No files match the supplied pattern.
MainError:	01:34:45 PM	No files match the supplied pattern.
MainError:	12:57:43 AM	No files match the supplied pattern.
MainError:	12:57:43 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	10:26:31 AM	No files match the supplied pattern.
MainError:	10:26:31 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	08:43:41 AM	No files match the supplied pattern.
MainError:	08:43:41 AM	No files match the supplied pattern.
MainError:	07:15:47 AM	No files match the supplied pattern.
MainError:	07:15:47 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
07:48:10 (20551): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
07:48:11 (20551): No heartbeat from core client for 30 sec - exiting
07:48:12 (20551): No heartbeat from core client for 30 sec - exiting
07:48:13 (20551): No heartbeat from core client for 30 sec - exiting
07:48:14 (20551): No heartbeat from core client for 30 sec - exiting
MainError:	07:51:41 PM	No files match the supplied pattern.
MainError:	07:51:41 PM	No files match the supplied pattern.
MainError:	03:48:58 PM	No files match the supplied pattern.
MainError:	03:48:58 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
MainError:	01:29:34 PM	No files match the supplied pattern.
MainError:	01:29:34 PM	No files match the supplied pattern.
21:06:55 (11027): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	04:40:43 PM	No files match the supplied pattern.
MainError:	04:40:43 PM	No files match the supplied pattern.
Suspended CPDN Monitor - Suspend request from BOINC...
22:02:13 (16474): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
00:52:28 (20704): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14676, selfPID=14676, iMonCtr=1
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=21250, selfPID=21250, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
cpdnmonitor: cannot open input file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/dataout/ocean_restart.day after 11 attempts
forrtl: severe (24): end-of-file during read, unit 5, file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/jobs/climate.cpdc
Image              PC        Routine            Line        Source             
hadcm3n_um_6.07_i  0848EB7D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0848D975  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0845F3CF  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F90D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F257  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  084399E4  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  083403FC  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0838F198  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0839BDF8  Unknown               Unknown  Unknown
libc.so.6          F7596BD6  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0804CB11  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18535, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/dataout/ocean_restart.day after 11 attempts
forrtl: severe (24): end-of-file during read, unit 5, file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/jobs/climate.cpdc
Image              PC        Routine            Line        Source             
hadcm3n_um_6.07_i  0848EB7D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0848D975  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0845F3CF  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F90D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F257  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  084399E4  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  083403FC  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0838F198  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0839BDF8  Unknown               Unknown  Unknown
libc.so.6          F7592BD6  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0804CB11  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18535, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/dataout/ocean_restart.day after 11 attempts
forrtl: severe (24): end-of-file during read, unit 5, file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/jobs/climate.cpdc
Image              PC        Routine            Line        Source             
hadcm3n_um_6.07_i  0848EB7D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0848D975  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0845F3CF  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F90D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F257  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  084399E4  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  083403FC  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0838F198  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0839BDF8  Unknown               Unknown  Unknown
libc.so.6          F7551BD6  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0804CB11  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18535, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/dataout/ocean_restart.day after 11 attempts
forrtl: severe (24): end-of-file during read, unit 5, file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/jobs/climate.cpdc
Image              PC        Routine            Line        Source             
hadcm3n_um_6.07_i  0848EB7D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0848D975  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0845F3CF  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F90D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F257  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  084399E4  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  083403FC  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0838F198  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0839BDF8  Unknown               Unknown  Unknown
libc.so.6          F7624BD6  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0804CB11  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18535, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/dataout/ocean_restart.day after 11 attempts
forrtl: severe (24): end-of-file during read, unit 5, file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/jobs/climate.cpdc
Image              PC        Routine            Line        Source             
hadcm3n_um_6.07_i  0848EB7D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0848D975  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0845F3CF  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F90D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F257  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  084399E4  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  083403FC  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0838F198  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0839BDF8  Unknown               Unknown  Unknown
libc.so.6          F75E4BD6  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0804CB11  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18535, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/dataout/ocean_restart.day after 11 attempts
forrtl: severe (24): end-of-file during read, unit 5, file /scratch/.boinc.beo-39/projects/climateprediction.net/hadcm3n_o5ni_2140_40_008269310/jobs/climate.cpdc
Image              PC        Routine            Line        Source             
hadcm3n_um_6.07_i  0848EB7D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0848D975  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0845F3CF  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F90D  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0841F257  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  084399E4  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  083403FC  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0838F198  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0839BDF8  Unknown               Unknown  Unknown
libc.so.6          F7552BD6  Unknown               Unknown  Unknown
hadcm3n_um_6.07_i  0804CB11  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18535, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Jan 2013 16:45:47 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 725,760 2,089,506 2.8791
22 Jan 2013 13:30:20 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 699,840 2,018,072 2.8836
21 Jan 2013 15:50:52 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 673,920 1,945,122 2.8863
20 Jan 2013 19:56:19 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 648,000 1,873,507 2.8912
18 Jan 2013 07:24:43 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 622,080 1,800,178 2.8938
17 Jan 2013 08:44:36 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 596,160 1,723,040 2.8902
16 Jan 2013 10:28:05 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 570,240 1,647,711 2.8895
15 Jan 2013 13:05:14 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 544,320 1,571,337 2.8868
14 Jan 2013 13:35:26 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 518,400 1,494,652 2.8832
13 Jan 2013 13:10:06 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 492,480 1,412,764 2.8687
12 Jan 2013 12:45:07 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 466,560 1,335,210 2.8618
11 Jan 2013 14:20:27 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 440,640 1,256,719 2.8520
10 Jan 2013 15:35:44 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 414,720 1,177,923 2.8403
09 Jan 2013 18:01:21 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 388,800 1,101,527 2.8331
08 Jan 2013 20:15:04 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 362,880 1,025,569 2.8262
07 Jan 2013 19:36:48 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 336,960 950,698 2.8214
06 Jan 2013 21:43:03 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 311,040 877,175 2.8201
05 Jan 2013 23:03:53 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 285,120 803,612 2.8185
05 Jan 2013 01:09:03 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 259,200 730,489 2.8182
03 Jan 2013 06:35:27 988438 15502114 hadcm3n_o5ni_2140_40_008269310_0 233,280 657,450 2.8183


©2024 climateprediction.net