AF:
NF:0
PS:10
SRH:1
SFN:
DSR:
MID:<20080211180134.09f6396b@ripper.onstor.net>
CFG:
PT:0
S:andy.sharp@onstor.com
RQ:
SSV:onstor-exch02.onstor.net
NSV:
SSH:
R:<larry.scheer@onstor.com>
MAID:1
X-Sylpheed-Privacy-System:
X-Sylpheed-Sign:0
SCF:#mh/Mailbox/sent
RMID:#imap/andys@onstor.net@onstor-exch02.onstor.net/INBOX	0	BB375AF679D4A34E9CA8DFA650E2B04E056C9372@onstor-exch02.onstor.net
X-Sylpheed-End-Special-Headers: 1
Date: Mon, 11 Feb 2008 18:01:43 -0800
From: Andrew Sharp <andy.sharp@onstor.com>
To: "Larry Scheer" <larry.scheer@onstor.com>
Subject: Re: I am getting a kernel oops with bobcat linux
Message-ID: <20080211180143.0c8221d4@ripper.onstor.net>
In-Reply-To: <BB375AF679D4A34E9CA8DFA650E2B04E056C9372@onstor-exch02.onstor.net>
References: <20080211162737.1995db37@ripper.onstor.net>
	<BB375AF679D4A34E9CA8DFA650E2B04E056C9372@onstor-exch02.onstor.net>
Organization: Onstor
X-Mailer: Sylpheed-Claws 2.6.0 (GTK+ 2.8.20; x86_64-pc-linux-gnu)
Mime-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit

Yeah, this is the lovely mipsphys driver max wrote which does no sanity
checking on the passed in address from userspace.  In this case
chassisd is passing in a bogus address, most likely '00000008'.
Rendell might be able to shed some light on it.

On Mon, 11 Feb 2008 16:29:24 -0800 "Larry Scheer"
<larry.scheer@onstor.com> wrote:

> Booting Linux kernel...
> prom_init: env[0] = 'cpuclock=799988250'
> prom_init: env[1] = 'memsize=512'
> prom_init: env[2] = 'osloadoptions=mAt'
> prom_init: env[3] = 'boot=cold'
> prom_init: env[4] = 'busclock=133'
> prom_init: env[5] = 'ipaddr=10.2.11.4'
> prom_init: env[6] = 'netmask=255.255.0.0'
> prom_init: env[7] = 'macaddr0=00:07:34:02:d8:00'
> prom_init: env[8] = 'macaddr1=00:07:34:02:d8:01'
> Linux version 2.6.22-bc9k (larrys@linux-compile.onstor.net) (gcc
> version 4.1.2 20061115 (prerelease) (Debian 4.1.1-21)) #1 Mon Feb 11
> 13:26:02 PST 2008
> CPU revision is: 00003440
> FPU revision is: 00003420
> OnStor Bobcat: Board Assembly Rev. A4
> Booting from recovery prom
> Bus clock speed: 133MHz
> Determined physical RAM map:
>  memory: 03000000 @ 00000000 (ROM data)
>  memory: 0d000000 @ 03000000 (usable)
>  memory: 10000000 @ 20000000 (reserved)
>  memory: 02000000 @ 40000000 (reserved)
> Wasting 393216 bytes for tracking 12288 unused pages
> Built 1 zonelists.  Total pages: 52736
> Kernel command line: console=ttyS0,57600n8 console=ttyMM0,57600n8
> ip=dhcp
> Primary instruction cache 16kB, physically tagged, 4-way, linesize 32
> bytes.
> Primary data cache 16kB, 4-way, linesize 32 bytes.
> Secondary cache size 256K, linesize 32 bytes.
> Synthesized TLB refill handler (27 instructions).
> Synthesized TLB load handler fastpath (39 instructions).
> Synthesized TLB store handler fastpath (39 instructions).
> Synthesized TLB modify handler fastpath (38 instructions).
> PID hash table entries: 1024 (order: 10, 4096 bytes)
> Real Time Clock base address = bf000000
> Using 399.994 MHz high precision timer.
> Dentry cache hash table entries: 32768 (order: 5, 131072 bytes)
> Inode-cache hash table entries: 16384 (order: 4, 65536 bytes)
> Memory: 207872k/212992k available (2028k kernel code, 4996k reserved,
> 467k data, 112k init, 0k highmem)
> Mount-cache hash table entries: 512
> NET: Registered protocol family 16
> per_cpu_mappings: mvmemregion.va_start = c0000000
> per_cpu_mappings: marvel_base=bc000000
> bobcat_pci1_init: io_base = 0x18000000, io_size = 0x800000
> bobcat_pci1_init: mem0_base = 0x1a000000, mem0_size = 0x2000000
> bobcat_pci1_init: mem1_base = 0x19000000, mem1_size = 0x1000000
> bobcat_pci1_init: mem2_base = 0x18800000, mem2_size = 0x800000
> bobcat_pci1_init: mem_base = 0x18800000, mem_size = 0x1800000
> bobcat_pci1_init: mv_io_base = 0x0, mv_io_size = 0x0
> bobcat_pci1_init: mv_io_base = 0x18000000, mv_io_size = 0x800000
> bobcat_pci_init: pci io_v_base = 0xb8000000
> 
> bobcat_pci_init: mips_io_port_base = 0xb8000000
> 
> bobcat_pci_init: pci_probe_only=1
> Time: MIPS clocksource has been installed.
> NET: Registered protocol family 2
> IP route cache hash table entries: 2048 (order: 1, 8192 bytes)
> TCP established hash table entries: 8192 (order: 4, 65536 bytes)
> TCP bind hash table entries: 8192 (order: 3, 32768 bytes)
> TCP: Hash tables configured (established 8192 bind 8192)
> TCP reno registered
> DS1511 Real Time Clock Driver v1.1
> io scheduler noop registered
> io scheduler deadline registered (default)
> Serial: MPSC driver $Revision: 1.00 $
> mpsc_drv_probe: Adding MPSC 0
> ttyMM0 at MMIO 0xbc008000 (irq = 52) is a MPSC
> natsemi dp8381x driver, version 2.1, Sept 11, 2006
>   originally by Donald Becker <becker@scyld.com>
>   2.4.x kernel port by Jeff Garzik, Tjeerd Mulder
> natsemi eth0: NatSemi DP8381[56] at 0x19000000 (0000:01:00.0),
> 00:07:34:02:d8:00, IRQ 8, port TP.
> natsemi eth1: NatSemi DP8381[56] at 0x19800000 (0000:01:01.0),
> 00:07:34:02:d8:01, IRQ 9, port TP.
> Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2
> ide: Assuming 33MHz system bus speed for PIO modes; override with
> idebus=xx
> pd6729: Cirrus PD6729 PCI to PCMCIA Bridge at 0x18000000 on irq 0
> pd6729: PCI_COMMAND_MEMORY enabled by firmware.
> pd6729: Enabling PCI_COMMAND_MASTER.
> pd6729: ISA irqs = 10,11  polling status changes.
> pcmcia: parent PCI bridge I/O window: 0x18000000 - 0x187f0fff
> cs: IO port probe 0x18000000-0x187f0fff: clean.
> pcmcia: parent PCI bridge Memory window: 0x19000000 - 0x19ffffff
> pcmcia: parent PCI bridge Memory window: 0x18800000 - 0x188fffff
> pd6729[0]: chip revision 2 Cap 2
> pcmcia: parent PCI bridge I/O window: 0x18000000 - 0x187f0fff
> cs: IO port probe 0x18000000-0x187f0fff: clean.
> pcmcia: parent PCI bridge Memory window: 0x19000000 - 0x19ffffff
> pcmcia: parent PCI bridge Memory window: 0x18800000 - 0x188fffff
> pd6729[1]: chip revision 2 Cap 2
> i2c /dev entries driver
> MGMTBUS: using irqs 36, 38, 5
> MGMTBUS: registered net device mgmtbus @ c2000000
> rcon device will be in the hizzy @ c20000e0
> TCP cubic registered
> NET: Registered protocol family 1
> NET: Registered protocol family 17
> NET: Registered protocol family 30
> pccard: PCMCIA card inserted into slot 0
> cs: memory probe 0x19000000-0x19ffffff: excluding
> 0x19000000-0x19ffffff cs: memory probe 0x18800000-0x188fffff:
> excluding 0x18800000-0x188fffff cs: warning: no high memory space
> available! eth0: DSPCFG accepted after 0 usec.
> eth0: link up.
> eth0: Setting full-duplex based on negotiated link capability.
> eth1: DSPCFG accepted after 0 usec.
> pccard: PCMCIA card inserted into slot 1
> cs: memory probe 0x19000000-0x19ffffff: excluding
> 0x19000000-0x19ffffff cs: memory probe 0x18800000-0x188fffff:
> excluding 0x18800000-0x188fffff cs: warning: no high memory space
> available! Sending DHCP requests ., OK
> IP-Config: Got DHCP answer from 10.0.0.143, my address is 10.2.11.4
> IP-Config: Complete:
>       device=eth0, addr=10.2.11.4, mask=255.255.0.0, gw=10.2.0.1,
>      host=eng216, domain=, nis-domain=onstorlab,
>      bootserver=10.0.0.143, rootserver=10.0.0.143,
> rootpath=/var/nfsroot/bobcat
> Looking up port of RPC 100003/2 on 10.0.0.143
> Looking up port of RPC 100005/1 on 10.0.0.143
> VFS: Mounted root (nfs filesystem) readonly.
> Freeing unused kernel memory: 112k freed
> INIT: version 2.86 booting
> Starting the hotplug events dispatcher: udevd.
> Synthesizing the initial hotplug events...done.
> udevd-event[697]: udev_node_mknod: mknod(/dev/rtc, 020660, 10, 135)
> failed: File exists
> Waiting for /dev to be fully populated...done.
> Setting parameters of disc: (none).
> Activating swap...done.
> Setting the system clock..
> Cannot access the Hardware Clock via any known method.
> Use the --debug option to see the details of our search for an access
> method.
> Cleaning up ifupdown....
> Loading kernel modules...Software Watchdog Timer: 0.07 initialized.
> soft_noboot=0 soft_margin=60 sec (nowayout= 1)
> done.
> Loading device-mapper support.
> Checking file systems...fsck 1.40-WIP (14-Nov-2006)
> done.
> Setting kernel variables...done.
> Mounting local filesystems...done.
> Activating swapfile swap...done.
> Starting ONStor initial configuration utility:.
> Setting up networking....
> Configuring network interfaces...done.
> Starting portmap daemon....
> INIT: Entering runlevel: 2
> Starting system log daemon: syslogd.
> Starting kernel log daemon: klogd.
> Starting portmap daemon...Already running..
> Setting NIS domainname to: agilestorage.
> Starting NIS services: ypbind.
> Starting MTA: exim4.
> * ALERT: exim paniclog /var/log/exim4/paniclog has non-zero size, mail
> system possibly broken
> * Not starting internet superserver: no services enabled.
> Starting OpenBSD Secure Shell server: sshd.
> Starting NFS common utilities: statd.
> Starting automounter: loading autofs4 kernel module, done.
> Starting NTP server: ntpd.
> Starting deferred execution scheduler: atd.
> Starting periodic command scheduler: crond.
> Starting ONStor services: mgmtbus/onstor/bin/emrscron -f 
>  pm.
> Data bus error, epc == 83126c34, ra == 83073330
> Oops[#1]:
> Cpu 0
> $ 0   : 00000000 9000ef00 00000001 10060020
> $ 4   : b0060020 7f904a08 00000008 7f904a00
> $ 8   : 00000000 00000000 10060020 00000000
> $12   : 8e9335a0 00000008 8e559f40 00406ac0
> $16   : 8e9335a0 00000008 7f904a00 8e559f40
> $20   : 00000008 00401b10 00440000 00440000
> $24   : 00000024 2adb284c                  
> $28   : 8e558000 8e559eb8 7f904978 83073330
> Hi    : 0000033a
> Lo    : 0001e356
> epc   : 83126c34 read_mipsphys+0xc8/0x154     Not tainted
> ra    : 83073330 vfs_read+0xc0/0x160
> Status: 9000ef03    KERNEL EXL IE 
> Cause : 0080101c
> PrId  : 00003440
> Modules linked in: autofs4 softdog
> Process chassisd (pid: 2067, threadinfo=8e558000, task=8352f400)
> Stack : 00000000 00000000 00000001 00000001 83073330 83073308 8e559d34
> 832a0000
>         00000000 2aafef10 7f904e34 0040dab4 8e9335a0 ffffffe3 00000008
> 7f904a00
>         83073a50 00401b10 00440000 8302681c 00000000 2aafef10 00000000
> 0040dab4
>         00000006 7f904a00 10060020 00000000 8300a5e0 8300a5e0 00451a19
> 00451a38
>         7f80de4c 0040c00c 10060020 00000000 00000000 9000ef00 00001068
> 00000000
>         ...
> Call Trace:
> [<83126c34>] read_mipsphys+0xc8/0x154
> [<83073330>] vfs_read+0xc0/0x160
> [<83073a50>] sys_pread64+0x94/0xac
> [<8300a5e0>] stack_done+0x20/0x3c
> 
> 
> Code: 24a50008  00642021  00e52825 <8c820000> 8c830004  afa20008
> afa3000c  8f840018  00a42824 
> Data bus error, epc == 83126c34, ra == 83073330
> Oops[#2]:
> Cpu 0
> $ 0   : 00000000 9000ef00 00000001 10060020
> $ 4   : b0060020 7fc43a08 00000008 7fc43a00
> $ 8   : 00000000 00000000 10060020 00000000
> $12   : 8e5a1c80 00000008 8360bf40 00406ac0
> $16   : 8e5a1c80 00000008 7fc43a00 8360bf40
> $20   : 00000008 00401b10 00440000 00440000
> $24   : 00000024 2adb284c                  
> $28   : 8360a000 8360beb8 7fc43978 83073330
> Hi    : 0000033a
> Lo    : 0001e356
> epc   : 83126c34 read_mipsphys+0xc8/0x154     Not tainted
> ra    : 83073330 vfs_read+0xc0/0x160
> Status: 9000ef03    KERNEL EXL IE 
> Cause : 0080101c
> PrId  : 00003440
> Modules linked in: autofs4 softdog
> Process chassisd (pid: 2070, threadinfo=8360a000, task=83529000)
> Stack : 00000000 00000000 00000001 00000001 83073330 83073308 8360bd34
> 830ff5f0
>         00000000 2aafef10 7fc43e34 0040dab4 8e5a1c80 ffffffe3 00000008
> 7fc43a00
>         83073a50 00401b10 00440000 8302681c 00000000 2aafef10 00000000
> 0040dab4
>         00000006 7fc43a00 10060020 00000000 8300a5e0 8300a5e0 00451a19
> 00451a38
>         7f80de4c 0040c00c 10060020 00000000 00000000 9000ef00 00001068
> 00000000
>         ...
> Call Trace:
> [<83126c34>] read_mipsphys+0xc8/0x154
> [<83073330>] vfs_read+0xc0/0x160
> [<83073a50>] sys_pread64+0x94/0xac
> [<8300a5e0>] stack_done+0x20/0x3c
> 
> 
> Code: 24a50008  00642021  00e52825 <8c820000> 8c830004  afa20008
> afa3000c  8f840018  00a42824 
> SoftDog: Initiating system reboot.
> 
> PMON CPU 00000000 Initializing. Standby...
> EXCEPTIONPC=00000000 CONFIG=005264b0 STATUS=00400000
> Setting up SDRAM controllers
> R9K MEMORY SIZE=20000000
> SDM  REG=00000027
> SDT  REG=283f8482
> SDRAM Memory setup done.
> Clearing cache size memory...
> Cache init done.
> Initializing ECC memory.
> Decompression bss start.
> Init bss.
> 
> -----Original Message-----
> From: Andy Sharp 
> Sent: Monday, February 11, 2008 4:28 PM
> To: Larry Scheer
> Subject: Re: I am getting a kernel oops with bobcat linux
> 
> On Mon, 11 Feb 2008 15:47:49 -0800 "Larry Scheer"
> <larry.scheer@onstor.com> wrote:
> 
> > Andy, 
> >    Since submittal 6 or 7 I have seen this problem with bobcat Linux
> > on one system. Another system PM just seems to hang.
> > 
> > It may be PM or the management bus is panicking the kernel, but that
> > is just a wild ass guess on my part. Given this is a bobcat Linux it
> > is a low priority, but some folks are trying to use bobcats in lue
> > of cougars. (Svati is one.)
> > 
> > 
> > Attached is the file with the panic output. <<kernel-panic-bl>> 
> 
> 
> I can't read this attachment, application octet/stream, it's some
> bogus m$ monkey business.  Attach it as plain text or just insert it.
