[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] "CPU N still not dead..." messages during microcode update stage of boot when smt=0


  • To: Andy Smith <andy@xxxxxxxxxxxxxx>
  • From: Jan Beulich <JBeulich@xxxxxxxx>
  • Date: Mon, 22 Jul 2019 09:16:08 +0000
  • Accept-language: en-US
  • Arc-authentication-results: i=1; mx.microsoft.com 1;spf=pass smtp.mailfrom=suse.com;dmarc=pass action=none header.from=suse.com;dkim=pass header.d=suse.com;arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=hbVyaBzImDh218bQFFKnw4paz/0Mzsp/qtnKxUxPqMc=; b=ctu1VUt//NmV0uSl6sZbwc91PWOahfBD7vkkqmeHixP3Dcu3ENAS1hjDYmTZ3GkE9lyXAhSoachwYRWxAd9dRtHtjwov+f8vZpQzH0a4oGcYf8yuNWXlZe0lD2KakIXX9rZ71jhbPYgOfWDdTW0HnT0K2LLkz0scPewUppSWZuF8NXswubR8enPSkO31B/tf0ZX+j0rJ5ncnkikqREnti+ONf4IdT7ulvtqJo513yuCUVyHpc284UgIx5HNNL33YYwelKyKDsyLtz8vW4BBm3VUk8zSpRvv0Enh5jipMA0vTmXkIv/eHhX5tbBVa+8Dby7Sivz4C0ayz/wW+66wFTg==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=jt7Ptsyu598LTls00qX/bQ7aI664FRLfcC0qy++hAkcD6oiDaL4JyBU3nNMwn1utMCXt4Mcgq+UL+aaKu7dPX4nGmcBEALqtwbBKXZPbE0OPVQ6bMm4DlMy5C6nY0gUabCYcuvoxbsn3wimIojXGaPsqthVKhicVgSTwopLYiXGUdV7++fCYOcUPn01ShQ7ENHEeKouPHfC7wdxSqItcmLhXK8/2jTD+zy5B+OtyuMoV5/0zUsX1NQ8byvnXiD+Prtt9rQzFoB7z4FfNIAnnMNNuVdPN21xEHSLT+oPJBrEewwHBNqxscHEaxEd4ljHAqdgjTaqG+OKMt69Ysbfnxw==
  • Authentication-results: spf=none (sender IP is ) smtp.mailfrom=JBeulich@xxxxxxxx;
  • Cc: xen-devel <xen-devel@xxxxxxxxxxxxxxxxxxxx>
  • Delivery-date: Mon, 22 Jul 2019 09:16:40 +0000
  • List-id: Xen developer discussion <xen-devel.lists.xenproject.org>
  • Thread-index: AQHVP//515Jyq7gbbE+1g3GDqkihQ6bWXCcA
  • Thread-topic: [Xen-devel] "CPU N still not dead..." messages during microcode update stage of boot when smt=0

On 21.07.2019 22:06, Andy Smith wrote:
> Hi,
> 
> My first time using smt=0 on hypervisor command line so not sure how
> many versions and different pieces of hardware this happens with,
> but I noticed this during the microcode update stage of boot:
> 
> (XEN) HVM: HAP page sizes: 4kB, 2MB, 1GB
> (XEN) Adding cpu 1 to runqueue 0
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) CPU 1 still not dead...
> (XEN) Removing cpu 1 from runqueue 0
> (XEN) microcode: CPU2 updated from revision 0x2000057 to 0x200005e, date = 
> 2019-04-02
> (XEN) Adding cpu 2 to runqueue 0
> (XEN) Adding cpu 3 to runqueue 0
> (XEN) Removing cpu 3 from runqueue 0
> (XEN) microcode: CPU4 updated from revision 0x2000057 to 0x200005e, date = 
> 2019-04-02
> (XEN) Adding cpu 4 to runqueue 0
> (XEN) Adding cpu 5 to runqueue 0
> (XEN) Removing cpu 5 from runqueue 0
> (XEN) microcode: CPU6 updated from revision 0x2000057 to 0x200005e, date = 
> 2019-04-02
> (XEN) Adding cpu 6 to runqueue 0
> (XEN) Adding cpu 7 to runqueue 0
> (XEN) Removing cpu 7 from runqueue 0
> (XEN) microcode: CPU8 updated from revision 0x2000057 to 0x200005e, date = 
> 2019-04-02
> (XEN) Adding cpu 8 to runqueue 0
> (XEN) Adding cpu 9 to runqueue 0
> (XEN) Removing cpu 9 from runqueue 0
> (XEN) microcode: CPU10 updated from revision 0x2000057 to 0x200005e, date = 
> 2019-04-02
> (XEN) Adding cpu 10 to runqueue 0
> (XEN) Adding cpu 11 to runqueue 0
> (XEN) Removing cpu 11 from runqueue 0
> (XEN) microcode: CPU12 updated from revision 0x2000057 to 0x200005e, date = 
> 2019-04-02
> (XEN) Adding cpu 12 to runqueue 0
> (XEN) Adding cpu 13 to runqueue 0
> (XEN) Removing cpu 13 from runqueue 0
> (XEN) microcode: CPU14 updated from revision 0x2000057 to 0x200005e, date = 
> 2019-04-02
> (XEN) Adding cpu 14 to runqueue 0
> (XEN) Adding cpu 15 to runqueue 0
> (XEN) Removing cpu 15 from runqueue 0
> (XEN) Brought up 8 CPUs
> (XEN) Parked 8 CPUs
> 
> It doesn't happen with smt=1 and it also doesn't happen when SMT is
> disabled in the BIOS.
> 
> Boot does continue normally after this point.
> 
> Is this expected? 4.12.

"Expected" isn't the right word. I've noticed this too on one or two
occasions, and I can't (yet) explain what's going on there, the more
that so far (including your report) this is only ever for the first
CPU to get re-offlined. Something to be looked into as time permits.

Jan
_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxxx
https://lists.xenproject.org/mailman/listinfo/xen-devel

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.