X-MimeOLE: Produced By Microsoft Exchange V6.5
Received: by onstor-exch02.onstor.net 
	id <01C86EB7.9EA9FABA@onstor-exch02.onstor.net>; Wed, 13 Feb 2008 20:13:50 -0700
MIME-Version: 1.0
Content-Type: text/plain;
	charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
Content-class: urn:content-classes:message
Subject: RE: Initial config wiki page
Date: Wed, 13 Feb 2008 20:13:50 -0700
Message-ID: <BB375AF679D4A34E9CA8DFA650E2B04E03E9A609@onstor-exch02.onstor.net>
In-Reply-To: <20080213185847.31fa978d@ripper.onstor.net>
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
Thread-Topic: Initial config wiki page
Thread-Index: AchutYTszDo84wX3QZuQbMvtNoUU+QAACZ+A
From: "Chris Vandever" <chris.vandever@onstor.com>
To: "Andy Sharp" <andy.sharp@onstor.com>,
	"Charissa Willard" <charissa.willard@onstor.com>
Cc: "Jan Seidel" <jan.seidel@onstor.com>,
	"Ian Brown" <ian.brown@onstor.com>,
	"Larry Scheer" <larry.scheer@onstor.com>

Check the defects Jan and I have entered this week (#22323 and #22297)
for the details of what files were supposedly copied.

One of the /etc/hostname.sc* files contained the correct IP address on
the original flash, and none of the /etc/hostname.sc* files contained it
on the secondary flash after the attempted config copy.

Jan has since installed sub5 on the secondary flash and was able to get
enough files copied correctly from nfxsh to boot properly.  I can't say
that sub5 works perfectly, but it at least limps, which is more than I
can say for sub7.  :(

Regarding elog.conf, you can only delete it if you can confirm the
behavior of elog doesn't change without it (log level, logging to the
console, etc.).  You will need to check this not only in a single node
environment, but also in a cluster.  You may also need to check the
behavior after a reboot.  I'm not sure when the code reads the info from
elog.conf and when it reads it from the clusDb.

ChrisV

-----Original Message-----
From: Andy Sharp=20
Sent: Wednesday, February 13, 2008 6:59 PM
To: Charissa Willard
Cc: Chris Vandever; Jan Seidel; Ian Brown; Larry Scheer
Subject: Re: Initial config wiki page

The list of files, which hasn't really shrunk much for BSD lately, is
contained amongst

nfx-tree/Tools/config-files
nfx-tree/Tools/Linux-config-files
nfx-tree/Tools/OpenBSD-config-files

Which are installed to /onstor/lib on the filer.

It won't/can't copy a file that doesn't exist, that's why it didn't
copy hostname.sc2, even though the file was/is in the list.

It would be pretty hard to "copy wrong" because it's just tar given a
list of files.  But then again.... ~:^)

It's highly likely that the list of config files is not
correct/complete.  Please hit us with errors so they can be remedied.

Cheers,

a

PS There is this mysterious file elog.conf which I think we took off
the list semi-recently.  Anybody have an objection to that?


On Wed, 13 Feb 2008 18:26:21 -0800 "Charissa Willard"
<charissa.willard@onstor.com> wrote:

> I was just listing the files for the /etc directory that include the
> files for initial configuration. I think Larry was consolidating the
> list of all files, so now I don't know where all of them are anymore
> :-(. I'll need to search for the complete list when I get a chance.
>=20
> =20
>=20
> The /etc/hostname.sc1 contains "dhcp" initially so that the initial
> config process will attempt to get the configuration from the dhcp
> server first. If the filer is not configured, this will be the case.=20
>=20
>=20
> From the Initial Config wiki page, under "Design":
>=20
>=20
> /etc/rc script will be called by init process during boot up.=20
>=20
> The rc script checks if the NG (NAS Gateway) is initialized. If the NG
> is in Uninitialzed or in Initializing state, it will perform the
> following operations=20
>=20
> *	Deletes /usr/local/agile/conf/nasgwayinit.conf [File that
> contains initial configuration values obtained via dhcp].=20
> *	Writes *dhcp* entry to /etc/hostname.sc1
> and /etc/hostname.sc2. This allows NG to query IP address from the
> dhcp server.=20
> *	Copies /usr/local/agile/etc/pmtab.init as
> /usr/local/agile/etc/pmtab.=20
> *	Creates /etc/passwd file from /etc/master.passwd file.=20
> *	Creates /etc/mail/aliases file for sendmail.=20
>=20
> rc then calls /etc/netstart script. netstart script will run
> /sbin/dhclient since the /etc/hostname.<interface> file contains
> *dhcp* entry.=20
>=20
> dhclient will then call /dbin/dhclient-script based on the result of
> the dhcp server query result. Below are the possible reasons for
> dhclient calling dhclient-script.=20
>=20
> =20
>=20
> =20
>=20
> ________________________________
>=20
> From: Chris Vandever=20
> Sent: Wednesday, February 13, 2008 6:10 PM
> To: Charissa Willard; Jan Seidel; Andy Sharp
> Subject: RE: Initial config wiki page
>=20
> =20
>=20
> That's really old code because /usr/local/agile has been deprecated,
> and agile.conf no longer exists.  The command should be copying not
> only files that are common between bsd and linux (which should also
> include /onstor/conf/cluster.conf and /onstor/conf/cluster.db.DB* off
> the top of my head), but should also copy files that are
> OS-specific.  I know that system upgrade and compact flash utilities
> were new for cougar in sub7, but they shouldn't have broken the bsd
> versions, which it appears they have.  :-(  I believe Andy was
> working in this area recently, so perhaps he can shed some light
> there.
>=20
> =20
>=20
> Note that /etc/hostname.sc2 was not copied, and /etc/hostname.sc1 did
> not appear to have been copied correctly as it contained "dhcp" rather
> than the correct IP addr and netmask.
>=20
> =20
>=20
> I've suggested to Jan that he install sub5 to get the config copied
> via FTI, as that's what I did yesterday and it seemed to work.
>=20
> =20
>=20
> ChrisV
>=20
> =20
>=20
> ________________________________
>=20
> From: Charissa Willard=20
> Sent: Wednesday, February 13, 2008 5:48 PM
> To: Jan Seidel
> Cc: Chris Vandever
> Subject: RE: Initial config wiki page
>=20
> =20
>=20
> The "system copy config" command in /ssc-nfxsh/cmd_system.c calls the
> do_copy_config function to copy the config files in the /etc directory
> to the secondary flash. The files are specified in the
> sys_config_files declared in the cmd_flash.c file.=20
>=20
> =20
>=20
> int
>=20
> do_copy_config(int argc, char **argv)
>=20
> {
>=20
>     return copy_to_secondary(sys_config_files, FALSE);
>=20
> }
>=20
> =20
>=20
> /* List of system configuration files, to copy by "copy config"
> command" */
>=20
> char *sys_config_files =3D "usr/local/agile/conf/agile.conf etc/hosts
> etc/mygate etc/hostname.sc1 etc/hostname.sc2 etc/master.passwd
> etc/passwd etc/pwd.db etc/spwd.db etc/myname";
>=20
> =20
>=20
> Which files listed above that exist on the primary flash were not
> copied to the secondary flash?
>=20
> =20
>=20
> ________________________________
>=20
> From: Jan Seidel=20
> Sent: Wednesday, February 13, 2008 5:02 PM
> To: Charissa Willard
> Cc: Chris Vandever
> Subject: RE: Initial config wiki page
>=20
> =20
>=20
> Hello Charissa,
>=20
> =20
>=20
> The 'system config copy' didn't copy all the hostname files. We didn't
> modify the nasgwayinit.status file, because we don't know what other
> files might be missing.
>=20
> See below the output of the system config copy command:
>=20
> =20
>=20
> eng25 diag> system config copy=20
>=20
> Mounted secondary disk
>=20
> Starting to copy the files to the secondary disk
>=20
> etc/hosts
>=20
> etc/mygate
>=20
> etc/hostname.sc1
>=20
> etc/master.passwd
>=20
> etc/passwd
>=20
> etc/pwd.db
>=20
> etc/spwd.db
>=20
> etc/myname
>=20
> onstor/conf/cluster.conf
>=20
> onstor/conf/cluster.db.DB0
>=20
> onstor/conf/cluster.db.DBSYS1
>=20
> etc/resolv.conf
>=20
> etc/ssh/ssh_host_dsa_key.pub
>=20
> etc/ssh/ssh_host_dsa_key
>=20
> etc/ssh/ssh_host_key.pub
>=20
> etc/ssh/ssh_host_key
>=20
> etc/ssh/ssh_host_rsa_key
>=20
> etc/ssh/ssh_host_rsa_key.pub
>=20
> etc/localtime
>=20
> etc/syslog.conf
>=20
> etc/ntp.conf
>=20
> etc/ntp/step-tickers
>=20
> onstor/etc/elog.conf
>=20
> onstor/etc/syslog.conf
>=20
> onstor/etc/snmpd.conf
>=20
> Unmounting /mnt1/onstor/conf...done.
>=20
> Unmounting /mnt1/tmp...done.
>=20
> Unmounting /mnt1/var...done.
>=20
> Unmounting /mnt1/...done.
>=20
> eng25 diag>
>=20
> =20
>=20
> ________________________________
>=20
> From: Charissa Willard=20
> Sent: Wednesday, February 13, 2008 4:45 PM
> To: Jan Seidel
> Cc: Chris Vandever
> Subject: Initial config wiki page
>=20
> =20
>=20
> http://wiki.onstor.net/wiki/SW:Initial_Configuration
>=20
> =20
>=20
> NAS Gateway can be in one of the following states=20
>=20
> 1. Uninitialized - Initial parameters not configured until now or
> "system config reset" command=20
> has been issued
> 2. Initializing - some other user is currently doing filer
> initialization and initialization=20
> process is in progress
> 3. Initialized - filer is already configured
>=20
> These states are stored in /usr/local/agile/conf/nasgwayinit.status
> file
>=20
>=20
> =20
>=20
