AF:
NF:0
PS:10
SRH:1
SFN:
DSR:
MID:<20080327110805.5ca4988f@ripper.onstor.net>
CFG:
PT:0
S:andy.sharp@onstor.com
RQ:
SSV:onstor-exch02.onstor.net
NSV:
SSH:
R:<jonathan.goldick@onstor.com>
MAID:1
X-Sylpheed-Privacy-System:
X-Sylpheed-Sign:0
SCF:#mh/Mailbox/sent
RMID:#imap/andys@onstor.net@onstor-exch02.onstor.net/INBOX	0	BB375AF679D4A34E9CA8DFA650E2B04E091AA9B0@onstor-exch02.onstor.net
X-Sylpheed-End-Special-Headers: 1
Date: Thu, 27 Mar 2008 11:09:29 -0700
From: Andrew Sharp <andy.sharp@onstor.com>
To: "Jonathan Goldick" <jonathan.goldick@onstor.com>
Subject: Re: Linux kernel panic on TOT
Message-ID: <20080327110929.2ee0e162@ripper.onstor.net>
In-Reply-To: <BB375AF679D4A34E9CA8DFA650E2B04E091AA9B0@onstor-exch02.onstor.net>
References: <BB375AF679D4A34E9CA8DFA650E2B04E091AA9B0@onstor-exch02.onstor.net>
Organization: Onstor
X-Mailer: Sylpheed-Claws 2.6.0 (GTK+ 2.8.20; x86_64-pc-linux-gnu)
Mime-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit

That's a good one.  Looks like some kind of unhappy interaction between
the management bus driver and the compact flash subsystem.  Why is it
always involving CF?

I will look into it and see what I can find out.

2.6 kernels output the translated oops info right into the log file and
the console so there's no need for a separate oops file or crash dump.

But since this was a panic that took place in the interrupt handler
for the CF driver, I'm just going to guess that it didn't make it to
the log files ~:^)


On Thu, 27 Mar 2008 09:52:14 -0700 "Jonathan Goldick"
<jonathan.goldick@onstor.com> wrote:

> I was rebooting.  Not sure if an oops file is created somewhere,
> couldn't find it.  It's possible that I missed some step in upgrading
> my filer since I rarely use system upgrade.
> 
> Filer is g11r203, SSC IP 10.2.203.11
> 
> g11r203 diag> syst reb -f -y
> Data bus error, epc == ffffffff8218afe4, ra == ffffffff8204f614
> Oops[#1]:
> Cpu 0
> $ 0   : 0000000000000000 0000000014001fe1 ffffffffffffffff
> 9000000041000000
> $ 4   : 0000000000000038 a8000000049b6000 0100000000000000
> 0000000000000000
> $ 8   : a8000000049b6000 9000000000000000 ffffffff822f0198
> 681d30042b724904
> $12   : 0000000014001fe0 000000001000001f 0000000000010000
> d005080000000004
> $16   : a80000000490e620 0000000000000000 0000000000000038
> 0000000000000000
> $20   : 0000000000000001 900000f81a7084a0 0000000000000000
> ffffffff80804eb8
> $24   : 0000000000000005 ffffffff82195ec0
> 
> $28   : ffffffff822ac000 ffffffff822af9c0 0000000000000003
> ffffffff8204f614
> Hi    : 0000000000000000
> Lo    : 0000000000001340
> epc   : ffffffff8218afe4 yenta_interrupt+0x14/0x118     Not tainted
> ra    : ffffffff8204f614 handle_IRQ_event+0x6c/0xe8
> Status: 14001fe3    KX SX UX KERNEL EXL IE 
> Cause : 0080841c
> PrId  : 00040103
> Modules linked in: autofs4
> Process swapper (pid: 0, threadinfo=ffffffff822ac000,
> task=ffffffff822b02f0)
> Stack : ffffffff8204f614 0000000000000061 ffffffff822b6820
> 0000000000000038
>         a80000000490e620 fffffffffffffbff ffffffff82310000
> ffffffff8204f764
>         0100000000000000 a80000008e19f83e a800000082d36e68
> 900000008f0012e0
>         900000008f001520 ffffffff820011a4 ffffffff822afe40
> ffffffff82001840
>         0000000000000000 0000000000000008 900000f81a65fe00
> 000000000065fe00
>         900000f81a65fe80 a80000008e19f8c8 0000000000000005
> 0000000000000000
>         ff40020300085003 0c2438060000a216 0a08010100005f78
> 681d30042b724904
>         424d53ff29000000 ffffffff8212e6b4 0000000000010000
> d005080000000004
>         000000000000008d a80000008e19f83e a800000082d36e68
> 900000008f0012e0
>         900000008f001520 900000f81a7084a0 0000000000000000
> ffffffff80804eb8
>         ...
> Call Trace:
> [<ffffffff8218afe4>] yenta_interrupt+0x14/0x118
> [<ffffffff8204f614>] handle_IRQ_event+0x6c/0xe8
> [<ffffffff8204f764>] __do_IRQ+0xd4/0x160
> [<ffffffff820011a4>] plat_irq_dispatch+0x1e4/0x1f0
> [<ffffffff82001840>] ret_from_irq+0x0/0x4
> [<ffffffff8212e378>] less_than_4units+0x14/0x48
> [<ffffffff82195fd8>] mgmtbus_hard_start_xmit+0x118/0x178
> [<ffffffff821a9ea4>] dev_queue_xmit+0x30c/0x458
> [<ffffffff82220154>] relay_hard_start_xmit+0x164/0x1e0
> [<ffffffff821a9ea4>] dev_queue_xmit+0x30c/0x458
> [<ffffffff821c88dc>] ip_queue_xmit+0x21c/0x438
> [<ffffffff821dc0d0>] tcp_transmit_skb+0x500/0xa10
> [<ffffffff821dd4b0>] tcp_retransmit_skb+0x100/0x778
> [<ffffffff821e0994>] tcp_write_timer+0x3cc/0x7b8
> [<ffffffff8202efe4>] run_timer_softirq+0x164/0x258
> [<ffffffff8202a4d4>] __do_softirq+0x94/0x140
> [<ffffffff8202a610>] do_softirq+0x90/0x98
> [<ffffffff82001840>] ret_from_irq+0x0/0x4
> [<ffffffff82003848>] cpu_idle+0x20/0x68
> [<ffffffff822d4bac>] start_kernel+0x2dc/0x358
> 
> 
> Code: ffbf0000  dca30010  8c620000 <0040202d> ac620000  dca60010
> 8cc30000  90c2
> 0804  1480001f 
> Kernel panic - not syncing: Fatal exception in interrupt
> Rebooting in 5 seconds
