[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Linux DomU freezes and dies under heavy memory shuffling
Hi! all of a sudden (but only after a few days of running normally), on a stock Ubuntu 18.04 (Bionic with 4.15.0 kernel) DomU I'm seeing Microsoft's .net runtime go into a heave GC cycle and then freeze and die like what is shown below. This is under stock Xen 4.14.0 on a pretty unremarkable x86_64 box made by Supermicro. I would really appreciate any thoughts on the subject or at least directions in which I should go to investigate this. At this point -- this part of Xen is a bit of a mystery to me -- but I'm very much willing to learn ;-) >From my completely uneducated guess it feels like some kind of an issue between DomU shuffling memory much more than normal and Xen somehow getting unhappy about that: [376900.874560] watchdog: BUG: soft lockup - CPU#0 stuck for 23s! [dotnet:3518] [376900.874764] Kernel panic - not syncing: softlockup: hung tasks [376900.874793] CPU: 0 PID: 3518 Comm: dotnet Tainted: G L 4.15.0-112-generic #113-Ubuntu [376900.874824] Hardware name: Xen HVM domU, BIOS 4.14.0 12/15/2020 [376900.874847] Call Trace: [376900.874860] <IRQ> [376900.874874] dump_stack+0x6d/0x8e [376900.874892] panic+0xe4/0x254 [376900.874911] watchdog_timer_fn+0x21e/0x230 [376900.874928] ? watchdog+0x30/0x30 [376900.874947] __hrtimer_run_queues+0xdf/0x230 [376900.874970] hrtimer_interrupt+0xa0/0x1d0 [376900.874989] xen_timer_interrupt+0x20/0x30 [376900.875008] __handle_irq_event_percpu+0x44/0x1a0 [376900.875031] handle_irq_event_percpu+0x32/0x80 [376900.875053] handle_percpu_irq+0x3d/0x60 [376900.875071] generic_handle_irq+0x28/0x40 [376900.875090] __evtchn_fifo_handle_events+0x172/0x190 [376900.875112] evtchn_fifo_handle_events+0x10/0x20 [376900.875133] __xen_evtchn_do_upcall+0x49/0x80 [376900.875156] xen_evtchn_do_upcall+0x2b/0x50 [376900.875177] xen_hvm_callback_vector+0x90/0xa0 [376900.875197] </IRQ> [376900.875211] RIP: 0010:smp_call_function_single+0xdc/0x100 [376900.875230] RSP: 0018:ffffaaa3c1807c20 EFLAGS: 00000202 ORIG_RAX: ffffffffffffff0c [376900.875261] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000 [376900.875288] RDX: 0000000000000001 RSI: 0000000000000003 RDI: 0000000000000003 [376900.875314] RBP: ffffaaa3c1807c70 R08: fffffffffffffffc R09: 0000000000000002 [376900.875341] R10: 0000000000000040 R11: 0000000000000000 R12: ffff8e0ab2c1de70 [376900.875368] R13: 0000000000000000 R14: ffffffff95a7ecd0 R15: ffffaaa3c1807d08 [376900.875396] ? flush_tlb_func_common.constprop.10+0x230/0x230 [376900.875424] ? flush_tlb_func_common.constprop.10+0x230/0x230 [376900.875449] ? unmap_page_range+0xbbc/0xd00 [376900.875470] smp_call_function_many+0x1cc/0x250 [376900.875491] ? smp_call_function_many+0x1cc/0x250 [376900.875513] native_flush_tlb_others+0x3c/0xf0 [376900.875534] flush_tlb_mm_range+0xae/0x110 [376900.875552] tlb_flush_mmu_tlbonly+0x5f/0xc0 [376900.875574] arch_tlb_finish_mmu+0x3f/0x80 [376900.875592] tlb_finish_mmu+0x23/0x30 [376900.875610] unmap_region+0xf7/0x130 [376900.875629] do_munmap+0x276/0x450 [376900.875647] vm_munmap+0x69/0xb0 [376900.875664] SyS_munmap+0x22/0x30 [376900.875682] do_syscall_64+0x73/0x130 [376900.875701] entry_SYSCALL_64_after_hwframe+0x41/0xa6 [376900.875721] RIP: 0033:0x7f05ad52dd59 [376900.875737] RSP: 002b:00007f05a8037150 EFLAGS: 00000246 ORIG_RAX: 000000000000000b [376900.875765] RAX: ffffffffffffffda RBX: 000056517e2a08c0 RCX: 00007f05ad52dd59 [376900.875791] RDX: 0000000000000000 RSI: 0000000000006a00 RDI: 00007f05aad8f000 [376900.875818] RBP: 0000000000006a00 R08: 0000000000020b18 R09: 0000000000000000 [376900.875844] R10: 0000000000020ad0 R11: 0000000000000246 R12: 0000000000000001 [376900.875870] R13: 0000000000000000 R14: 000056517eb02300 R15: 00007f05aad8f000 Thanks, Roman.
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |