[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [Xen-devel] XEN 4.0 + 2.6.31.13 pvops kernel : system crashes on starting 155th domU



> [    0.000000] nr_irqs_gsi: 48
> [    0.500076] NR_IRQS:5120 nr_irqs:944

That looks to be different from the previous bootup:
[    0.000000] NR_IRQS:4352 nr_irqs:256

>> YA: I think you may have mixed up the post by John. He has a different 
>> dual core machine
>> and he is trying the same edits as me. He was getting nr_irqs as 256 
>> while I was getting 944.
>> [    0.000000] nr_irqs_gsi: 48
>> [    0.500270] NR_IRQS:5120 nr_irqs:944
>> Also it did not matter if I changed the VECTORS to 1024 instead of the 
>> original 256.

>
> Also, as earlier it crashes on the same number of domU (154). I didnât
> mention earlier, this a dual core Nehalem machine  -- 2 (sockets) * 4 
> cores
> per CPU * 2 (hyperthreading)

Lots of logical CPUs, weird that your nr_irqs initially was that much lower.

>> No, as mentioned above, my machine has the larger number of CPUs while 
>> John McCullough
>> machine has lesser number of CPUs.

Anyhow, you mentioned that you narrowed it down to not being enough IRQs
- how did you find that out? Was there an kernel message when you
  started the 155th guest?

>>> yes, as I mentioned in an email, right after I started up the 154th 
>>> guest there was
>>> a log message printed out as below:
>>> Kernel Panic - not syncing: No available IRQs to bind to: increase 
>>> nr_irqs!

Oh, also you say that the /proc/interrupts showed the number descending
from 255 down to 89. With this it should have started at 944 and gone
down to 49? Which roughly means 175 guests?

>> Yes, the interrupts started counting down from 944 as I increased the 
>> number of domUs;
>> I am attaching the /proc/interrupts log file for three cases (no domU, 
>> after 1 domU,
>> and then after 150 domU).  The machine crashed after the 154th domU was 
>> started.

Thank you all for helping out to debug this. As I mentioned at the XEN 
summit, once I do fix these issues Iâd really like to create  wiki stub for 
any one else trying to do this. It looks like in the original 2.6.18 kernel 
people did indeed try this, but I guess some of the edits/changes didnât 
make it to the 2.6.31 pvops branch....

Attachment: interrupts-after1domU
Description: Binary data

Attachment: interrupts-after150domUs
Description: Binary data

Attachment: interrupts-beforestarting-domUs
Description: Binary data

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.