AF:
NF:0
PS:10
SRH:1
SFN:
DSR:
MID:<20081029141802.2cd412e2@ripper.onstor.net>
CFG:
PT:0
S:andy.sharp@onstor.com
RQ:
SSV:onstor-exch02.onstor.net
NSV:
SSH:
R:<john.rogers@onstor.com>,<maxim.kozlovsky@onstor.com>,<chris.vandever@onstor.com>,<ed.kwan@onstor.com>,<dl-mightydog-alert@onstor.com>
MAID:1
X-Sylpheed-Privacy-System:
X-Sylpheed-Sign:0
SCF:#mh/Mailbox/sent
RMID:#imap/andys@onstor.net@onstor-exch02.onstor.net/INBOX	0	2779531E7C760D4491C96305019FEEB5175A6CF9AC@exch1.onstor.net
X-Sylpheed-End-Special-Headers: 1
Date: Wed, 29 Oct 2008 14:20:36 -0700
From: Andrew Sharp <andy.sharp@onstor.com>
To: "John Rogers" <john.rogers@onstor.com>
Cc: Maxim Kozlovsky <maxim.kozlovsky@onstor.com>, "Chris Vandever"
 <chris.vandever@onstor.com>, "Ed Kwan" <ed.kwan@onstor.com>,
 "dl-mightydog-alert" <dl-mightydog-alert@onstor.com>
Subject: Re: exim on dogfood
Message-ID: <20081029142036.648607d6@ripper.onstor.net>
In-Reply-To: <2779531E7C760D4491C96305019FEEB5175A6CF9AC@exch1.onstor.net>
References: <2779531E7C760D4491C96305019FEEB5175A6CF9AB@exch1.onstor.net>
	<2779531E7C760D4491C96305019FEEB5175A6CF9AC@exch1.onstor.net>
Organization: Onstor
X-Mailer: Sylpheed-Claws 2.6.0 (GTK+ 2.8.20; x86_64-pc-linux-gnu)
Mime-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit

Zee plan:

1.  Examine various config files wrt to what I think they should be,
what they are on the secondary CF, what they are on my cougar.

/etc/mailname
/etc/aliases
/etc/hosts
/etc/network/interfaces
/etc/udev/rules.d/z25.persistent-net.rules
/etc/resolv.conf


2.  If I find anything that I think I want to change, run that past you
first, pending approval, make those changes and see if things clear
up.  Possibly with a

/etc/init.d/exim4 stop; killall exim4; /etc/init.d/exim4 start

thrown in to clear things out.

3.  If no-joy on above, then retrofit changes from perforce checkins
30682 and 31009 to the cfmon files and exim4-rm-frozen scripts (not
bothering with the C changes).


On Wed, 29 Oct 2008 08:48:03 -0700 "John Rogers"
<john.rogers@onstor.com> wrote:

> The system in general is very unstable at the moment, we are in danger
> of hitting any number of defects due to the high number of procs
> running on the ssc. This seems to aggravate the rmc dropped messages.
> Mostly we are seeing dropped messages during storage  device
> inquiries. Volume commands time out, requests for volume information
> time out, etc.
> 
> If you have some time today Andy, lets discuss a quick solution. I
> happen to think the old kill`em and delete`em method is in order.
> 
> We also need perform the rmc tracing procedure that Chris has
> prescribed.
> 
> I also heard a suggestion that we should kill -4 ea.
> 
> If possible, I would like to put together, or be given a precise and
> comprehensive plan to gather the information we need and move the
> system into a healthier state.
> 
> 
> 
> John
> 
> -----Original Message-----
> From: John Rogers 
> Sent: Wednesday, October 29, 2008 8:22 AM
> To: Andy Sharp
> Cc: dl-mightydog-alert
> Subject: RE: exim on dogfood
> 
> See attached logs.
> 
> -----Original Message-----
> From: Andy Sharp 
> Sent: Friday, October 24, 2008 5:55 PM
> To: John Rogers
> Subject: exim on dogfood
> 
> Does dogfood still have some frozen messages on it?  I need to verify
> the output of the command
> 
> exiqgrep -z -i
> 
> which is supposed to output the message ids of frozen messages, but I
> can't verify the output format, which I need to do so that I can use
> it in some programs.  Because I can't find a machine with any frozen
> messages.
> 
> exiqgrep -z -c
> 
> should tell you if there are any frozen messages if you're not sure.
> 
> Thanks,
> 
> a
> 
