[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] Regression, host crash with 4.5rc1



Hi Jan,

On 11/11/2014 0:05, Jan Beulich wrote:

And these

      [  199.775209] pcieport 0000:00:03.0: AER: Multiple Corrected error
received: id=0018
      [  199.775238] pcieport 0000:00:03.0: PCIe Bus Error:
severity=Corrected, type=Data Link Layer, id=0018(Transmitter ID)
      [  199.775251] pcieport 0000:00:03.0:   device [8086:340a] error
status/mask=00001100/00002000
      [  199.775255] pcieport 0000:00:03.0:    [ 8] RELAY_NUM Rollover
      [  199.775258] pcieport 0000:00:03.0:    [12] Replay Timer Timeout

hint at a problem in the system's design. 00:03.0 is the parent bridge
of 02:00.0 (and from what I can tell that's the only device behind that
bridge), and hence the above messages can only reasonably have
their origin at the passed through VGA device.


Okay, I did a bisection and was not able to correlate the above error message with the problem I'm seeing. Not saying it's not related, but I had plenty of successful test runs in the presence of that error.

Took me about a week (sometimes it takes as much as 6 hours to produce the error), but bisect narrowed it down to this commit:

http://xenbits.xen.org/gitweb/?p=xen.git;a=commit;h=9a727a813e9b25003e433b3dc3fa47e621f9e238

What do you think?

Thanks!

Steve

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.