AF:
NF:0
PS:10
SRH:1
SFN:
DSR:
MID:<20080716105731.06e2297d@ripper.onstor.net>
CFG:
PT:0
S:andy.sharp@onstor.com
RQ:
SSV:onstor-exch02.onstor.net
NSV:
SSH:
R:<chris.vandever@onstor.com>,<jonathan.goldick@onstor.com>,<maxim.kozlovsky@onstor.com>
MAID:1
X-Sylpheed-Privacy-System:
X-Sylpheed-Sign:0
SCF:#mh/Mailbox/sent
RMID:#imap/andys@onstor.net@onstor-exch02.onstor.net/INBOX	0	BB375AF679D4A34E9CA8DFA650E2B04E0AE2295E@onstor-exch02.onstor.net
X-Sylpheed-End-Special-Headers: 1
Date: Wed, 16 Jul 2008 10:57:41 -0700
From: Andrew Sharp <andy.sharp@onstor.com>
To: "Chris Vandever" <chris.vandever@onstor.com>
Cc: "Jonathan Goldick" <jonathan.goldick@onstor.com>, Maxim Kozlovsky
 <maxim.kozlovsky@onstor.com>
Subject: Re: FYI: #24639 (Mightydog: One node in 2-node cluster has cluster
 state N/A)
Message-ID: <20080716105741.013a227e@ripper.onstor.net>
In-Reply-To: <BB375AF679D4A34E9CA8DFA650E2B04E0AE2295E@onstor-exch02.onstor.net>
References: <20080715205932.3d07c381@ripper.onstor.net>
	<BB375AF679D4A34E9CA8DFA650E2B04E0AE2295E@onstor-exch02.onstor.net>
Organization: Onstor
X-Mailer: Sylpheed-Claws 2.6.0 (GTK+ 2.8.20; x86_64-pc-linux-gnu)
Mime-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit

On Wed, 16 Jul 2008 10:25:08 -0700 "Chris Vandever"
<chris.vandever@onstor.com> wrote:

> There WAS a clustering outage.  One node was marked down, and VTM
> tried to initiate a failover, but got wedged trying to get filer
> info, so NO failovers will occur, nor can vsvrs be moved.  Who knows
> what else may have hung had it tried to access the clusDb?

Clustering outage?  Interesting phrasing.  Anyway, I meant data outage
to the clients.  It was believed that the temporary clustering outage
didn't cause downtime for the clients.

> Interestingly, the disruption lasted only a few minutes on one node,
> but 13 minutes on the other.
> 
> I'm okay with leaving it p3, but I think we need to understand what
> happened, so we understand our exposure, which we don't.  Is it
> possible for time to go backwards if we had to transition between NTP
> servers?

I'm not sure, maybe.  I would expect to see something in the elogs
about changing the ntp configuration, however.

> ChrisV
> 
> -----Original Message-----
> From: Andy Sharp 
> Sent: Tuesday, July 15, 2008 9:00 PM
> To: Chris Vandever
> Cc: Jonathan Goldick
> Subject: Re: FYI: #24639 (Mightydog: One node in 2-node cluster has
> cluster state N/A)
> 
> On Tue, 15 Jul 2008 18:46:03 -0700 "Chris Vandever"
> <chris.vandever@onstor.com> wrote:
> 
> > This is currently a p3 cougar.  Do we need to send it back to
> > triage, or is it acceptable to ship with it?
> > 
> > ChrisV
> 
> I think it's acceptable to leave it as is.  There was no outage or
> downtime, and it eventually fixed itself, hence the original P3
> designation awarded by triage.
