X-MimeOLE: Produced By Microsoft Exchange V6.5
Received: by onstor-exch02.onstor.net 
	id <01C882E3.F385ACA2@onstor-exch02.onstor.net>; Mon, 10 Mar 2008 12:21:33 -0700
MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----_=_NextPart_001_01C882E3.F385ACA2"
References: <BB375AF679D4A34E9CA8DFA650E2B04E03E9A693@onstor-exch02.onstor.net><BB375AF679D4A34E9CA8DFA650E2B04E08C0FA4F@onstor-exch02.onstor.net> <20080310121552.58b53407@ripper.onstor.net>
Content-class: urn:content-classes:message
Subject: RE: system config reset
Date: Mon, 10 Mar 2008 12:19:49 -0700
Message-ID: <BB375AF679D4A34E9CA8DFA650E2B04E04228F64@onstor-exch02.onstor.net>
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
Thread-Topic: system config reset
Thread-Index: AciC4yg/xwqJ3Ke4RYy6URMSoBkyLQAAI2HD
From: "Raj Kumar" <raj.kumar@onstor.com>
To: "Andy Sharp" <andy.sharp@onstor.com>
Cc: "Chris Vandever" <chris.vandever@onstor.com>,
	"Larry Scheer" <larry.scheer@onstor.com>

This is a multi-part message in MIME format.

------_=_NextPart_001_01C882E3.F385ACA2
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

I am very positive I committed my changes before exiting. Even now I can =
see the changes. Should I try to reset again and capture everything?
=20
From /etc/onstor/initial-config option 3:
>
> Current Settings:
>    Node Name: g11r10
>    Date & Time: Mon Mar 10 11:40:47 PDT 2008
>    Network Settings:
>       Mgmt port 1 IP: 10.2.10.11 NETMASK: 255.255.0.0
>       Mgmt port 2 IP: address NETMASK: netmask
>       Current default route: 10.2.0.1


________________________________

From: Andy Sharp
Sent: Mon 3/10/2008 12:15 PM
To: Raj Kumar
Cc: Chris Vandever; Larry Scheer
Subject: Re: system config reset



Cougar doesn't currently use FTI, so you're saved.

I don't think you commited your changes during the initial config
menus.  You can log your telnet session using the screen command in
order to capture everything.  If you're not familiar, stop by and I'll
fill you in.

Cheers,

a

PS, meanwhile, I'll try it on my cougar to see what happens.


On Mon, 10 Mar 2008 12:09:35 -0700 "Raj Kumar" <raj.kumar@onstor.com>
wrote:

> I assume that's why clusterDB and cluster.conf are not created since
> the cluster services didn't start?
>
> To make progress, should I just copy the pmtab and start pm?
>
> _____________________________________________
> From: Chris Vandever
> Sent: Monday, March 10, 2008 12:07 PM
> To: Raj Kumar; Larry Scheer; Andy Sharp
> Subject: RE: system config reset
>
> Actually, it does.  That went in with the FTI changes so we don't
> start the bulk of the apps until we're configured.  We just start
> what's needed for nfxsh to run so we can do the config.  That
> explains the missing log entries (the apps weren't started because
> they weren't in pmtab), and it explains why we didn't see the message
> I expected from ClusterCtrl_Init().
>
> ChrisV
>
> _____________________________________________
> From: Raj Kumar
> Sent: Monday, March 10, 2008 12:04 PM
> To: Chris Vandever; Larry Scheer; Andy Sharp
> Subject: RE: system config reset
>
> Does pmtab gets wiped out during config reset? Didn't think so.
>
> g11r10:~# cat /onstor/etc/pmtab
> initwait: /onstor/bin/elog
> initwait: /onstor/bin/sscccc
>
> g11r10:~#
> g11r10:~# ps ax | grep onstor
>  4403 ?        Ss     0:00 /onstor/bin/sshd
>  4591 ?        Ss     0:00 /onstor/bin/pm
>  4603 ?        S      0:12 /onstor/bin/elog
>  4612 ?        S      0:00 /onstor/bin/sscccc
>  7025 ?        Ss     0:00 /bin/sh /onstor/bin/emrscron -g stats
>  7335 ?        S      0:00 /bin/sh /onstor/bin/support.sh -e
> nfxsh_connect  -g stats -s --
>  9325 ?        Ss     0:00 /bin/sh /onstor/bin/emrscron -g h_res_stats
>  9357 pts/0    R+     0:00 grep onstor
> g11r10:~#
>
> _____________________________________________
> From: Chris Vandever
> Sent: Monday, March 10, 2008 12:01 PM
> To: Raj Kumar; Larry Scheer; Andy Sharp
> Subject: RE: system config reset
>
> What apps does ps show running?
>
> Based on the elog, clustering hasn't even started (but as evidenced by
> the second reboot, there are more messages missing from the elog than
> IN the elog).  The messages that look like they're from clustering are
> actually from libcluster being called by an app that starts prior to
> clustering.  I see the 'system config reset' at 10:24 with the initial
> reboot at 10:28.  The only app that made it to the log is elog:
>
> Mar 10 10:24:37 g11r10 : 0:0:eventd:CRITICAL: Process-EVENT Node: Name
> 'local', State Down, Msg 'Node going down for reboot! ('system config
> reset' issued from nfxsh).'
> Mar 10 10:28:10 g11r10 pm: /onstor/bin/elog: finished initialization.
> Mar 10 10:30:20 g11r10 : 0:0:cluster2:ERROR: Cluster_RetrieveConfig:
> Cluster cfg file /onstor/conf/cluster.conf missing or corrupted or
> node intentionally removed from cluster, defaulting to standalone
> mode, err 0
>
> There's another boot at 10:35, and based on the subsequent reboot we
> made it at least as far as sscccc (which is at the end of pmtab just
> before sendmail):
>
> Mar 10 10:35:28 g11r10 pm: /onstor/bin/elog: finished initialization.
> Mar 10 10:36:28 g11r10 : 0:0:cluster2:ERROR: Cluster_RetrieveConfig:
> Cluster cfg file /onstor/conf/cluster.conf missing or corrupted or
> node intentionally removed from cluster, defaulting to standalone
> mode, err 0 Mar 10 10:38:00 g11r10 pm: pm_terminate: child 1732
> (/onstor/bin/sscccc) terminated
> Mar 10 10:38:01 g11r10 pm: pm_terminate: child 1713 (/onstor/bin/elog)
> terminated
>
> ChrisV
> _____________________________________________
> From: Raj Kumar
> Sent: Monday, March 10, 2008 11:41 AM
> To: Chris Vandever; Larry Scheer; dl-Cougar
> Subject: RE: system config reset
>
> Yes, those messages are in elog after the 2nd attempt of config reset
> (didn't see them after the last attempt though). Elogs at
> /n/newcorevol/defect_22743
>
> From /etc/onstor/initial-config option 3:
>
> Current Settings:
>    Node Name: g11r10
>    Date & Time: Mon Mar 10 11:40:47 PDT 2008
>    Network Settings:
>       Mgmt port 1 IP: 10.2.10.11 NETMASK: 255.255.0.0
>       Mgmt port 2 IP: address NETMASK: netmask
>       Current default route: 10.2.0.1
>
>
> Pending changes:
>
> Press 'Enter' to continue...
>
> _____________________________________________
> From: Chris Vandever
> Sent: Monday, March 10, 2008 11:37 AM
> To: Raj Kumar; Larry Scheer; dl-Cougar
> Subject: RE: system config reset
>
> Can I get the full elogs?  They should contain a message like the
> following:
>
> Cluster_RetrieveConfig: Cluster cfg file cluster.conf missing or
> corrupted or node intentionally removed from cluster, defaulting to
> standalone mode, err 0
>
> When cluster_contrl starts it should create the missing cluster.conf
> file UNLESS it is unable to get an IP address for the local host.
> Then, it will complain:
>
> ClusterCtrl_InitUbik: fail to find any IP address
>
> And it will exit.  So, the question is, what happened to the IP
> address?
>
> ChrisV
>
> _____________________________________________
> From: Raj Kumar
> Sent: Monday, March 10, 2008 11:24 AM
> To: Larry Scheer; dl-Cougar
> Subject: RE: system config reset
>
> I couldn't cut and paste all those screens but I did set all those
> before exiting the script.
>
> _____________________________________________
> From: Larry Scheer
> Sent: Monday, March 10, 2008 11:23 AM
> To: Raj Kumar; dl-Cougar
> Subject: RE: system config reset
>
> Going strictly by the information you provided it is because you reset
> the configuration and exited the configuration script without setting
> any configuration information. You have no IP address, hostname,
> default route, etc.
>
> _____________________________________________
> From: Raj Kumar
> Sent: Monday, March 10, 2008 11:00 AM
> To: dl-Cougar
> Subject: FW: system config reset
>
> Any idea?
>
> _____________________________________________
> From: Raj Kumar
> Sent: Monday, March 10, 2008 10:51 AM
> To: dl-QA
> Subject: system config reset
>
> Hi,
>
> Did config reset on cougar soak g11r10 ( already tried twice). After
> reset the filer's services doesn't come up because Cluster DB and
> cluster.conf are missing. Any ideas?
>
> g11r10:~# ls -l /onstor/conf/   =20
> total 1433
> -rw-r--r-- 1 root root 693561 Feb 19 20:43 R4.0.0.0-021908.bom
> -rw-r--r-- 1 root root 693237 Feb 14 14:41 R4.0.0.0DBG-021408.bom
> lrwxrwxrwx 1 root root     19 Feb 20 13:11 current.bom ->
> R4.0.0.0-021908.bom
> -rw-r--r-- 1 root root   2046 Feb  6 16:05 emrs_client.pem
> -rw-r--r-- 1 root root   1363 Feb  6 16:05 emrs_server.crt
> drwx------ 2 root root  12288 Feb  7 20:00 lost+found
> lrwxrwxrwx 1 root root     22 Feb 20 13:06 previous.bom ->
> R4.0.0.0DBG-021408.bom
> -rw-r--r-- 1 root root  53742 Feb  6 16:05 sdm-devcap
> g11r10:~#
>
>      1. Configure Administrative Settings
>
>
>      2. Configure Network Settings
>
>
>      3. Display Current Settings
>
>
>      4. Commit Changes
>
>
>      5. Help
>
>
>      6. Copy Configuration Files From Secondary Flash
>
>
>      7. Exit
>
>
>     Enter Selection: 7
>
> Value Entered is 7
>
> .
> Setting up networking....
> Configuring network interfaces...SIOCADDRT: Network is unreachable
> run-parts: /etc/network/if-up.d/addroutes exited with return code 7
> address: Host name lookup failure
> ifconfig: `--help' gives usage information.
> Failed to bring up eth1.
> done.
> Starting portmap daemon....
> INIT: Entering runlevel: 2
> Starting system log daemon: syslogd.
> Starting kernel log daemon: klogd.
> Starting portmap daemon...Already running..
> Starting automounter: loading autofs4 kernel module, no automount maps
> defined.
> Setting NIS domainname to: NASgateway.
> Starting NIS services: ypserv yppasswdd ypxfrd ypbind.
> Starting MTA: exim4.
> * ALERT: exim paniclog /var/log/exim4/paniclog has non-zero size, mail
> system possibly broken
> Starting internet superserver: inetd.
> Starting OpenBSD Secure Shell server: sshd.
> Starting NFS common utilities: statd.
> Starting NTP server: ntpd.
> Starting deferred execution scheduler: atd.
> Starting periodic command scheduler: crond.
> Starting ONStor services: mgmtbus/onstor/bin/emrscron -f
>  pm.
>
> OnStor GNU/Linux 4.0 g11r10 duart0
>
> g11r10 login: Mar 10 10:46:46 g11r10 : 0:0:cluster2:ERROR:
> cluster_iUpdateRecordData: no reply bck -1
> Mar 10 10:46:47 g11r10 : 0:0:cluster2:ERROR: cluster_getRecordIdByKey:
> no reply bck -1
> Mar 10 10:46:47 g11r10 : 0:0:nfxsh:NOTICE: cmd[0]: elog display
> enable : status[11]
> Mar 10 10:47:36 g11r10 : 0:0:cluster2:ERROR: cluster_iGetRecordData:
> no reply bck -1
> Mar 10 10:48:09 g11r10 last message repeated 3 times
> Mar 10 10:48:19 g11r10 last message repeated 2 times
> Mar 10 10:48:29 g11r10 : 0:0:cluster2:ERROR: cluster_getRecordIdByKey:
> no reply bck -1
> Mar 10 10:48:29 g11r10 : 0:0:cluster2:ERROR: cluster_iGetRecordData:
> no reply bck -1
> Mar 10 10:48:39 g11r10 last message repeated 2 times
> Mar 10 10:48:50 g11r10 : 0:0:cluster2:ERROR: cluster_getRecordIdByKey:
> no reply bck -1
> Mar 10 10:48:51 g11r10 : 0:0:cluster2:ERROR: cluster_iGetRecordData:
> no reply bck -1
> Mar 10 10:49:01 g11r10 last message repeated 2 times
>
> Thanks.
>
> --kumar :-)
>



------_=_NextPart_001_01C882E3.F385ACA2
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<HTML dir=3Dltr><HEAD><TITLE>Re: system config reset</TITLE>=0A=
<META http-equiv=3DContent-Type content=3D"text/html; charset=3Dunicode">=0A=
<META content=3D"MSHTML 6.00.6000.16525" name=3DGENERATOR></HEAD>=0A=
<BODY>=0A=
<DIV id=3DidOWAReplyText23662 dir=3Dltr>=0A=
<DIV dir=3Dltr><FONT face=3DArial color=3D#000000 size=3D2>I am very =
positive I committed my changes before exiting. Even now I can see the =
changes. Should I try to reset again and capture everything?</FONT></DIV>=0A=
<DIV dir=3Dltr>&nbsp;</DIV>=0A=
<DIV dir=3Dltr><FONT size=3D2>From /etc/onstor/initial-config option =
3:<BR>&gt;<BR>&gt; Current Settings:<BR>&gt;&nbsp;&nbsp;&nbsp; Node =
Name: g11r10<BR>&gt;&nbsp;&nbsp;&nbsp; Date &amp; Time: Mon Mar 10 =
11:40:47 PDT 2008<BR>&gt;&nbsp;&nbsp;&nbsp; Network =
Settings:<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Mgmt port 1 IP: =
10.2.10.11 NETMASK: =
255.255.0.0<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Mgmt port 2 IP: =
address NETMASK: netmask<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
Current default route: 10.2.0.1</FONT><BR></DIV></DIV>=0A=
<DIV dir=3Dltr><BR>=0A=
<HR tabIndex=3D-1>=0A=
<FONT face=3DTahoma size=3D2><B>From:</B> Andy Sharp<BR><B>Sent:</B> Mon =
3/10/2008 12:15 PM<BR><B>To:</B> Raj Kumar<BR><B>Cc:</B> Chris Vandever; =
Larry Scheer<BR><B>Subject:</B> Re: system config =
reset<BR></FONT><BR></DIV>=0A=
<DIV>=0A=
<P><FONT size=3D2>Cougar doesn't currently use FTI, so you're =
saved.<BR><BR>I don't think you commited your changes during the initial =
config<BR>menus.&nbsp; You can log your telnet session using the screen =
command in<BR>order to capture everything.&nbsp; If you're not familiar, =
stop by and I'll<BR>fill you in.<BR><BR>Cheers,<BR><BR>a<BR><BR>PS, =
meanwhile, I'll try it on my cougar to see what happens.<BR><BR><BR>On =
Mon, 10 Mar 2008 12:09:35 -0700 "Raj Kumar" =
&lt;raj.kumar@onstor.com&gt;<BR>wrote:<BR><BR>&gt; I assume that's why =
clusterDB and cluster.conf are not created since<BR>&gt; the cluster =
services didn't start?<BR>&gt;<BR>&gt; To make progress, should I just =
copy the pmtab and start pm?<BR>&gt;<BR>&gt; =
_____________________________________________<BR>&gt; From: Chris =
Vandever<BR>&gt; Sent: Monday, March 10, 2008 12:07 PM<BR>&gt; To: Raj =
Kumar; Larry Scheer; Andy Sharp<BR>&gt; Subject: RE: system config =
reset<BR>&gt;<BR>&gt; Actually, it does.&nbsp; That went in with the FTI =
changes so we don't<BR>&gt; start the bulk of the apps until we're =
configured.&nbsp; We just start<BR>&gt; what's needed for nfxsh to run =
so we can do the config.&nbsp; That<BR>&gt; explains the missing log =
entries (the apps weren't started because<BR>&gt; they weren't in =
pmtab), and it explains why we didn't see the message<BR>&gt; I expected =
from ClusterCtrl_Init().<BR>&gt;<BR>&gt; ChrisV<BR>&gt;<BR>&gt; =
_____________________________________________<BR>&gt; From: Raj =
Kumar<BR>&gt; Sent: Monday, March 10, 2008 12:04 PM<BR>&gt; To: Chris =
Vandever; Larry Scheer; Andy Sharp<BR>&gt; Subject: RE: system config =
reset<BR>&gt;<BR>&gt; Does pmtab gets wiped out during config reset? =
Didn't think so.<BR>&gt;<BR>&gt; g11r10:~# cat /onstor/etc/pmtab<BR>&gt; =
initwait: /onstor/bin/elog<BR>&gt; initwait: =
/onstor/bin/sscccc<BR>&gt;<BR>&gt; g11r10:~#<BR>&gt; g11r10:~# ps ax | =
grep onstor<BR>&gt;&nbsp; 4403 =
?&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Ss&nbsp;&nbsp;&nbsp;&nbsp; =
0:00 /onstor/bin/sshd<BR>&gt;&nbsp; 4591 =
?&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Ss&nbsp;&nbsp;&nbsp;&nbsp; =
0:00 /onstor/bin/pm<BR>&gt;&nbsp; 4603 =
?&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
S&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 0:12 /onstor/bin/elog<BR>&gt;&nbsp; 4612 =
?&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
S&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 0:00 /onstor/bin/sscccc<BR>&gt;&nbsp; =
7025 ?&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
Ss&nbsp;&nbsp;&nbsp;&nbsp; 0:00 /bin/sh /onstor/bin/emrscron -g =
stats<BR>&gt;&nbsp; 7335 ?&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
S&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 0:00 /bin/sh /onstor/bin/support.sh =
-e<BR>&gt; nfxsh_connect&nbsp; -g stats -s --<BR>&gt;&nbsp; 9325 =
?&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Ss&nbsp;&nbsp;&nbsp;&nbsp; =
0:00 /bin/sh /onstor/bin/emrscron -g h_res_stats<BR>&gt;&nbsp; 9357 =
pts/0&nbsp;&nbsp;&nbsp; R+&nbsp;&nbsp;&nbsp;&nbsp; 0:00 grep =
onstor<BR>&gt; g11r10:~#<BR>&gt;<BR>&gt; =
_____________________________________________<BR>&gt; From: Chris =
Vandever<BR>&gt; Sent: Monday, March 10, 2008 12:01 PM<BR>&gt; To: Raj =
Kumar; Larry Scheer; Andy Sharp<BR>&gt; Subject: RE: system config =
reset<BR>&gt;<BR>&gt; What apps does ps show running?<BR>&gt;<BR>&gt; =
Based on the elog, clustering hasn't even started (but as evidenced =
by<BR>&gt; the second reboot, there are more messages missing from the =
elog than<BR>&gt; IN the elog).&nbsp; The messages that look like =
they're from clustering are<BR>&gt; actually from libcluster being =
called by an app that starts prior to<BR>&gt; clustering.&nbsp; I see =
the 'system config reset' at 10:24 with the initial<BR>&gt; reboot at =
10:28.&nbsp; The only app that made it to the log is =
elog:<BR>&gt;<BR>&gt; Mar 10 10:24:37 g11r10 : 0:0:eventd:CRITICAL: =
Process-EVENT Node: Name<BR>&gt; 'local', State Down, Msg 'Node going =
down for reboot! ('system config<BR>&gt; reset' issued from =
nfxsh).'<BR>&gt; Mar 10 10:28:10 g11r10 pm: /onstor/bin/elog: finished =
initialization.<BR>&gt; Mar 10 10:30:20 g11r10 : 0:0:cluster2:ERROR: =
Cluster_RetrieveConfig:<BR>&gt; Cluster cfg file =
/onstor/conf/cluster.conf missing or corrupted or<BR>&gt; node =
intentionally removed from cluster, defaulting to standalone<BR>&gt; =
mode, err 0<BR>&gt;<BR>&gt; There's another boot at 10:35, and based on =
the subsequent reboot we<BR>&gt; made it at least as far as sscccc =
(which is at the end of pmtab just<BR>&gt; before =
sendmail):<BR>&gt;<BR>&gt; Mar 10 10:35:28 g11r10 pm: /onstor/bin/elog: =
finished initialization.<BR>&gt; Mar 10 10:36:28 g11r10 : =
0:0:cluster2:ERROR: Cluster_RetrieveConfig:<BR>&gt; Cluster cfg file =
/onstor/conf/cluster.conf missing or corrupted or<BR>&gt; node =
intentionally removed from cluster, defaulting to standalone<BR>&gt; =
mode, err 0 Mar 10 10:38:00 g11r10 pm: pm_terminate: child 1732<BR>&gt; =
(/onstor/bin/sscccc) terminated<BR>&gt; Mar 10 10:38:01 g11r10 pm: =
pm_terminate: child 1713 (/onstor/bin/elog)<BR>&gt; =
terminated<BR>&gt;<BR>&gt; ChrisV<BR>&gt; =
_____________________________________________<BR>&gt; From: Raj =
Kumar<BR>&gt; Sent: Monday, March 10, 2008 11:41 AM<BR>&gt; To: Chris =
Vandever; Larry Scheer; dl-Cougar<BR>&gt; Subject: RE: system config =
reset<BR>&gt;<BR>&gt; Yes, those messages are in elog after the 2nd =
attempt of config reset<BR>&gt; (didn't see them after the last attempt =
though). Elogs at<BR>&gt; /n/newcorevol/defect_22743<BR>&gt;<BR>&gt; =
From /etc/onstor/initial-config option 3:<BR>&gt;<BR>&gt; Current =
Settings:<BR>&gt;&nbsp;&nbsp;&nbsp; Node Name: =
g11r10<BR>&gt;&nbsp;&nbsp;&nbsp; Date &amp; Time: Mon Mar 10 11:40:47 =
PDT 2008<BR>&gt;&nbsp;&nbsp;&nbsp; Network =
Settings:<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Mgmt port 1 IP: =
10.2.10.11 NETMASK: =
255.255.0.0<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Mgmt port 2 IP: =
address NETMASK: netmask<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
Current default route: 10.2.0.1<BR>&gt;<BR>&gt;<BR>&gt; Pending =
changes:<BR>&gt;<BR>&gt; Press 'Enter' to continue...<BR>&gt;<BR>&gt; =
_____________________________________________<BR>&gt; From: Chris =
Vandever<BR>&gt; Sent: Monday, March 10, 2008 11:37 AM<BR>&gt; To: Raj =
Kumar; Larry Scheer; dl-Cougar<BR>&gt; Subject: RE: system config =
reset<BR>&gt;<BR>&gt; Can I get the full elogs?&nbsp; They should =
contain a message like the<BR>&gt; following:<BR>&gt;<BR>&gt; =
Cluster_RetrieveConfig: Cluster cfg file cluster.conf missing or<BR>&gt; =
corrupted or node intentionally removed from cluster, defaulting =
to<BR>&gt; standalone mode, err 0<BR>&gt;<BR>&gt; When cluster_contrl =
starts it should create the missing cluster.conf<BR>&gt; file UNLESS it =
is unable to get an IP address for the local host.<BR>&gt; Then, it will =
complain:<BR>&gt;<BR>&gt; ClusterCtrl_InitUbik: fail to find any IP =
address<BR>&gt;<BR>&gt; And it will exit.&nbsp; So, the question is, =
what happened to the IP<BR>&gt; address?<BR>&gt;<BR>&gt; =
ChrisV<BR>&gt;<BR>&gt; =
_____________________________________________<BR>&gt; From: Raj =
Kumar<BR>&gt; Sent: Monday, March 10, 2008 11:24 AM<BR>&gt; To: Larry =
Scheer; dl-Cougar<BR>&gt; Subject: RE: system config =
reset<BR>&gt;<BR>&gt; I couldn't cut and paste all those screens but I =
did set all those<BR>&gt; before exiting the script.<BR>&gt;<BR>&gt; =
_____________________________________________<BR>&gt; From: Larry =
Scheer<BR>&gt; Sent: Monday, March 10, 2008 11:23 AM<BR>&gt; To: Raj =
Kumar; dl-Cougar<BR>&gt; Subject: RE: system config =
reset<BR>&gt;<BR>&gt; Going strictly by the information you provided it =
is because you reset<BR>&gt; the configuration and exited the =
configuration script without setting<BR>&gt; any configuration =
information. You have no IP address, hostname,<BR>&gt; default route, =
etc.<BR>&gt;<BR>&gt; =
_____________________________________________<BR>&gt; From: Raj =
Kumar<BR>&gt; Sent: Monday, March 10, 2008 11:00 AM<BR>&gt; To: =
dl-Cougar<BR>&gt; Subject: FW: system config reset<BR>&gt;<BR>&gt; Any =
idea?<BR>&gt;<BR>&gt; =
_____________________________________________<BR>&gt; From: Raj =
Kumar<BR>&gt; Sent: Monday, March 10, 2008 10:51 AM<BR>&gt; To: =
dl-QA<BR>&gt; Subject: system config reset<BR>&gt;<BR>&gt; =
Hi,<BR>&gt;<BR>&gt; Did config reset on cougar soak g11r10 ( already =
tried twice). After<BR>&gt; reset the filer's services doesn't come up =
because Cluster DB and<BR>&gt; cluster.conf are missing. Any =
ideas?<BR>&gt;<BR>&gt; g11r10:~# ls -l =
/onstor/conf/&nbsp;&nbsp;&nbsp;&nbsp;<BR>&gt; total 1433<BR>&gt; =
-rw-r--r-- 1 root root 693561 Feb 19 20:43 R4.0.0.0-021908.bom<BR>&gt; =
-rw-r--r-- 1 root root 693237 Feb 14 14:41 =
R4.0.0.0DBG-021408.bom<BR>&gt; lrwxrwxrwx 1 root =
root&nbsp;&nbsp;&nbsp;&nbsp; 19 Feb 20 13:11 current.bom -&gt;<BR>&gt; =
R4.0.0.0-021908.bom<BR>&gt; -rw-r--r-- 1 root root&nbsp;&nbsp; 2046 =
Feb&nbsp; 6 16:05 emrs_client.pem<BR>&gt; -rw-r--r-- 1 root =
root&nbsp;&nbsp; 1363 Feb&nbsp; 6 16:05 emrs_server.crt<BR>&gt; =
drwx------ 2 root root&nbsp; 12288 Feb&nbsp; 7 20:00 lost+found<BR>&gt; =
lrwxrwxrwx 1 root root&nbsp;&nbsp;&nbsp;&nbsp; 22 Feb 20 13:06 =
previous.bom -&gt;<BR>&gt; R4.0.0.0DBG-021408.bom<BR>&gt; -rw-r--r-- 1 =
root root&nbsp; 53742 Feb&nbsp; 6 16:05 sdm-devcap<BR>&gt; =
g11r10:~#<BR>&gt;<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 1. Configure =
Administrative =
Settings<BR>&gt;<BR>&gt;<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 2. =
Configure Network =
Settings<BR>&gt;<BR>&gt;<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 3. =
Display Current =
Settings<BR>&gt;<BR>&gt;<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 4. Commit =
Changes<BR>&gt;<BR>&gt;<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 5. =
Help<BR>&gt;<BR>&gt;<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 6. Copy =
Configuration Files From Secondary =
Flash<BR>&gt;<BR>&gt;<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 7. =
Exit<BR>&gt;<BR>&gt;<BR>&gt;&nbsp;&nbsp;&nbsp;&nbsp; Enter Selection: =
7<BR>&gt;<BR>&gt; Value Entered is 7<BR>&gt;<BR>&gt; .<BR>&gt; Setting =
up networking....<BR>&gt; Configuring network interfaces...SIOCADDRT: =
Network is unreachable<BR>&gt; run-parts: /etc/network/if-up.d/addroutes =
exited with return code 7<BR>&gt; address: Host name lookup =
failure<BR>&gt; ifconfig: `--help' gives usage information.<BR>&gt; =
Failed to bring up eth1.<BR>&gt; done.<BR>&gt; Starting portmap =
daemon....<BR>&gt; INIT: Entering runlevel: 2<BR>&gt; Starting system =
log daemon: syslogd.<BR>&gt; Starting kernel log daemon: klogd.<BR>&gt; =
Starting portmap daemon...Already running..<BR>&gt; Starting =
automounter: loading autofs4 kernel module, no automount maps<BR>&gt; =
defined.<BR>&gt; Setting NIS domainname to: NASgateway.<BR>&gt; Starting =
NIS services: ypserv yppasswdd ypxfrd ypbind.<BR>&gt; Starting MTA: =
exim4.<BR>&gt; * ALERT: exim paniclog /var/log/exim4/paniclog has =
non-zero size, mail<BR>&gt; system possibly broken<BR>&gt; Starting =
internet superserver: inetd.<BR>&gt; Starting OpenBSD Secure Shell =
server: sshd.<BR>&gt; Starting NFS common utilities: statd.<BR>&gt; =
Starting NTP server: ntpd.<BR>&gt; Starting deferred execution =
scheduler: atd.<BR>&gt; Starting periodic command scheduler: =
crond.<BR>&gt; Starting ONStor services: mgmtbus/onstor/bin/emrscron =
-f<BR>&gt;&nbsp; pm.<BR>&gt;<BR>&gt; OnStor GNU/Linux 4.0 g11r10 =
duart0<BR>&gt;<BR>&gt; g11r10 login: Mar 10 10:46:46 g11r10 : =
0:0:cluster2:ERROR:<BR>&gt; cluster_iUpdateRecordData: no reply bck =
-1<BR>&gt; Mar 10 10:46:47 g11r10 : 0:0:cluster2:ERROR: =
cluster_getRecordIdByKey:<BR>&gt; no reply bck -1<BR>&gt; Mar 10 =
10:46:47 g11r10 : 0:0:nfxsh:NOTICE: cmd[0]: elog display<BR>&gt; enable =
: status[11]<BR>&gt; Mar 10 10:47:36 g11r10 : 0:0:cluster2:ERROR: =
cluster_iGetRecordData:<BR>&gt; no reply bck -1<BR>&gt; Mar 10 10:48:09 =
g11r10 last message repeated 3 times<BR>&gt; Mar 10 10:48:19 g11r10 last =
message repeated 2 times<BR>&gt; Mar 10 10:48:29 g11r10 : =
0:0:cluster2:ERROR: cluster_getRecordIdByKey:<BR>&gt; no reply bck =
-1<BR>&gt; Mar 10 10:48:29 g11r10 : 0:0:cluster2:ERROR: =
cluster_iGetRecordData:<BR>&gt; no reply bck -1<BR>&gt; Mar 10 10:48:39 =
g11r10 last message repeated 2 times<BR>&gt; Mar 10 10:48:50 g11r10 : =
0:0:cluster2:ERROR: cluster_getRecordIdByKey:<BR>&gt; no reply bck =
-1<BR>&gt; Mar 10 10:48:51 g11r10 : 0:0:cluster2:ERROR: =
cluster_iGetRecordData:<BR>&gt; no reply bck -1<BR>&gt; Mar 10 10:49:01 =
g11r10 last message repeated 2 times<BR>&gt;<BR>&gt; =
Thanks.<BR>&gt;<BR>&gt; --kumar =
:-)<BR>&gt;<BR></FONT></P></DIV></BODY></HTML>
------_=_NextPart_001_01C882E3.F385ACA2--
