AF:
NF:0
PS:10
SRH:1
SFN:
DSR:
MID:<20080723171027.2dc72006@ripper.onstor.net>
CFG:
PT:0
S:andy.sharp@onstor.com
RQ:
SSV:onstor-exch02.onstor.net
NSV:
SSH:
R:<vikas.saini@onstor.com>
MAID:1
X-Sylpheed-Privacy-System:
X-Sylpheed-Sign:0
SCF:#mh/Mailbox/sent
RMID:#mh/Mailbox/clearcrap	0	WEBMAILyXLAMwBnuPnU000034a1@mail.onstor.com
X-Sylpheed-End-Special-Headers: 1
Date: Wed, 23 Jul 2008 17:10:40 -0700
From: Andrew Sharp <andy.sharp@onstor.com>
To: <vikas.saini@onstor.com>
Subject: Re: Defect  TED00024654 Cougar Migration, systems went into reboot
 loop for some time.
Message-ID: <20080723171040.5c103f82@ripper.onstor.net>
In-Reply-To: <WEBMAILyXLAMwBnuPnU000034a1@mail.onstor.com>
References: <WEBMAILyXLAMwBnuPnU000034a1@mail.onstor.com>
Organization: Onstor
X-Mailer: Sylpheed-Claws 2.6.0 (GTK+ 2.8.20; x86_64-pc-linux-gnu)
Mime-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit

On 23 Jul 2008 16:45:38 -0700 <vikas.saini@onstor.com> wrote:

> Headline: Cougar Migration, systems went into reboot loop for some
> time. id: TED00024654
> Note_Entry: What unit tests are run to verify that this problem is
> indeed gone...

Hi Vikas,

Here's what's going through my mind, besides a huge headache:

1. There's no clear problem; I went through the elogs for a while, and
each reboot seemed to be for a different reason.

2. We were doing a strange workflow: we were doing a migration to
cougar with the beta release, but beta was explicitly not supposed to
have to support that.  Some decisions were made along the way about
what did and didn't go into beta because of that, which doubtless had
an effect.  We did the best we could making these decisions, but it's
impossible to get it 100% right.

3. There's been a number of clear issues involved with that incident
(migrating mightydog to cougar hardware) and they've all be tracked
down and fixed, as far as I'm aware.  Correct me if you think otherwise.

4.  I didn't want to mark it NR, I wanted to resolve it, but I don't
have the time to wade through 200 checkins to see which might apply.
It's like more than 200 actually.  Will we not test migrations with the
GA release which is the one that is supposed to support it?

Please tell me what you think I should have done.  I'm just trying to
manage the bugs to get to the deadline the best I know how, without
cutting any corners or shipping a product I wouldn't want my name on.

Regards.

a


