> On Sat, Jan 29, 2011 at 7:25 PM, Pasi K*rkk*inen <pasik@xxxxxx
> > On Sat, Jan 29, 2011 at 07:03:16PM +0100, Jordan Pittier wrote:
> >> Thanks for your reply. LSI has indeed newer driver for the controler;
> >> but I can't "build" it, there's an error when I try to compile it [see
> >> attachement]. I will give another try in the next days.
> >> What is puzzling is that the IO errors only occurs with Xen HV. I am
> >> 100% willing to accept that the problem is the drivers, but how come
> >> the exact same kernel (the xenified one) could work fine without Xen
> >> loaded ? I am almost a noob in kernel/driver and stuff; but I thought
> >> the drivers were entirely in the kernel.
> > Yep, the driver is entirely in the kernel, but that's not the whole
> > Xen dom0 kernel does irq handling through Xen hypervisor,
> > so that might make some drivers behave in a different way baremetal vs.
> > Also remember dom0 is a *vm*, so some timing stuff might happen
> > differently on baremetal vs. dom0.
> >> I will try with the latest kernel in a few days.
> >> SLES11SP1 ships mptfusion 4.22
> >> >> * *:*
> >> > On Sat, Jan 29, 2011 at 04:27:25PM +0100, Jordan Pittier wrote:
> >> >> * *Hi,
> >> >> * *I have been tracking a bug affecting all my servers running
> Debian Squeeze
> >> >> * *for more than a month now, and I*desperately*need your help :)*
> >> >> * *I have 10 Sun v20z servers (2*66GB SCSI disk in RAID 1 ==
> mirror). 4 of
> >> >> * *them are running Debian Squeeze with the latest Xen Debian kernel
> >> >> * *(2.6.32-5-xen-amd64 ==*2.6.32-29). The rest are running Debian
> >> >> * *(2.6.26-2-xen-amd64 ==*2.6.26-26lenny1).
> >> >> * *On a Squeeze boxe, under very high IO (such as running a IO
> stress test,
> >> >> * *ie bonnie++), server starts behaving*weirdly and I see messages
> like these
> >> >> * *in kernel.log : [see attachement]. Then the server becomes
> >> >> * *unresponsive (but doesn't "freeze") and commands such as "ls" or
> >> >> * *don't work anymore. I have to do an hard reboot. After the server
> >> >> * *reboot, the RAID array seems degraded (I am using the mpt-status
> >> >> * *and starts rebuilding. After several hours, the raid array is
> >> >> * *("clean"). The raid controler is "LSI53C1030" U320, with driver
> >> >> * *MPT SPI Host driver 3.04.06". I have attached the result of
> >> >> * *None of my Lenny boxes are affected by this issue, all of my
> Squeeze boxes
> >> >> * *are.
> >> >> * *What does it have to do with Xen ? When I boot my Squeeze boxes
> >> >> * *the Xen hypervisor but the same Xen kernel, bonnie++
> >> >> * *The issue appears only with the Xen hypervisor loaded.*
> >> >> * *There is a debian bug report for this