[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v4 05/21] IOMMU/x86: restrict IO-APIC mappings for PV Dom0


  • To: Roger Pau Monné <roger.pau@xxxxxxxxxx>
  • From: Jan Beulich <jbeulich@xxxxxxxx>
  • Date: Wed, 4 May 2022 11:32:51 +0200
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=qQ/tWqPXcQF2o2dkJ8KHWyYx75+mzdxX5FgcKh1v/Fc=; b=hPmdY8KKolCcgvffgrvs2KVczXum3JmFaWIqGA6116ZOiw6n91bFk3MbjE+67ChgZivheyGUOmkXOnIhEOC218xUH/Bcxm+YEwkOnkJyLH+CgR0b0a5GS5oR4drK+5SoLSlhUwDk9XAEI3y+QDAYRg1k3uJNpn91NUKYWDnI+fH58KP1kr+cjLb4nw+Osmj3oG7o8ZtDC2goW5FuiPuilxYu8ULA+YRLK553zS55aN3aUL/dqlTrEyUBhUQUE59J59D5xLsXlmzlyJb//mthibd/KoE9p7Nj9QAfIRV/tR7w1jMRDtMk+5pbawrgZ5lSISioW5uxvSVEeC2qihizTg==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=DAjJvvNxReMUyN4Grc0eOympn80EAirPFWe/RUoOrzR1JtC4LIVJQAoeZ2OLXCzYe/pwltlcgg8ddZruEqGgrzwyr597Nd8hvuqMOHedtrqvi93Y2QF8hO+P+nQqygKetiTQXjqReOaeB1m74B3rXTK8byGFrPw8X7H4rwj9dWU5nvtQk+FPiMatYh5nTG3km+ysKj6UHmHbRJqYke1x+md1dZI3tyyV/dgPwxd5m1f9CLS+4dJaz1hPOmUpZnaA2AM22AXMJZ3U+05S5fpndY3n13d+L/IiX2ujWz2f0EQMzR968i7gzOHG+BrMZ5CcYdDS0jp+fYgJemv+E+Yuaw==
  • Authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com;
  • Cc: "xen-devel@xxxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxxx>, Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, Paul Durrant <paul@xxxxxxx>
  • Delivery-date: Wed, 04 May 2022 09:33:07 +0000
  • List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 03.05.2022 16:50, Jan Beulich wrote:
> On 03.05.2022 15:00, Roger Pau Monné wrote:
>> On Mon, Apr 25, 2022 at 10:34:23AM +0200, Jan Beulich wrote:
>>> While already the case for PVH, there's no reason to treat PV
>>> differently here, though of course the addresses get taken from another
>>> source in this case. Except that, to match CPU side mappings, by default
>>> we permit r/o ones. This then also means we now deal consistently with
>>> IO-APICs whose MMIO is or is not covered by E820 reserved regions.
>>>
>>> Signed-off-by: Jan Beulich <jbeulich@xxxxxxxx>
>>> ---
>>> [integrated] v1: Integrate into series.
>>> [standalone] v2: Keep IOMMU mappings in sync with CPU ones.
>>>
>>> --- a/xen/drivers/passthrough/x86/iommu.c
>>> +++ b/xen/drivers/passthrough/x86/iommu.c
>>> @@ -275,12 +275,12 @@ void iommu_identity_map_teardown(struct
>>>      }
>>>  }
>>>  
>>> -static bool __hwdom_init hwdom_iommu_map(const struct domain *d,
>>> -                                         unsigned long pfn,
>>> -                                         unsigned long max_pfn)
>>> +static unsigned int __hwdom_init hwdom_iommu_map(const struct domain *d,
>>> +                                                 unsigned long pfn,
>>> +                                                 unsigned long max_pfn)
>>>  {
>>>      mfn_t mfn = _mfn(pfn);
>>> -    unsigned int i, type;
>>> +    unsigned int i, type, perms = IOMMUF_readable | IOMMUF_writable;
>>>  
>>>      /*
>>>       * Set up 1:1 mapping for dom0. Default to include only conventional 
>>> RAM
>>> @@ -289,44 +289,60 @@ static bool __hwdom_init hwdom_iommu_map
>>>       * that fall in unusable ranges for PV Dom0.
>>>       */
>>>      if ( (pfn > max_pfn && !mfn_valid(mfn)) || xen_in_range(pfn) )
>>> -        return false;
>>> +        return 0;
>>>  
>>>      switch ( type = page_get_ram_type(mfn) )
>>>      {
>>>      case RAM_TYPE_UNUSABLE:
>>> -        return false;
>>> +        return 0;
>>>  
>>>      case RAM_TYPE_CONVENTIONAL:
>>>          if ( iommu_hwdom_strict )
>>> -            return false;
>>> +            return 0;
>>>          break;
>>>  
>>>      default:
>>>          if ( type & RAM_TYPE_RESERVED )
>>>          {
>>>              if ( !iommu_hwdom_inclusive && !iommu_hwdom_reserved )
>>> -                return false;
>>> +                perms = 0;
>>>          }
>>> -        else if ( is_hvm_domain(d) || !iommu_hwdom_inclusive || pfn > 
>>> max_pfn )
>>> -            return false;
>>> +        else if ( is_hvm_domain(d) )
>>> +            return 0;
>>> +        else if ( !iommu_hwdom_inclusive || pfn > max_pfn )
>>> +            perms = 0;
>>>      }
>>>  
>>>      /* Check that it doesn't overlap with the Interrupt Address Range. */
>>>      if ( pfn >= 0xfee00 && pfn <= 0xfeeff )
>>> -        return false;
>>> +        return 0;
>>>      /* ... or the IO-APIC */
>>> -    for ( i = 0; has_vioapic(d) && i < d->arch.hvm.nr_vioapics; i++ )
>>> -        if ( pfn == PFN_DOWN(domain_vioapic(d, i)->base_address) )
>>> -            return false;
>>> +    if ( has_vioapic(d) )
>>> +    {
>>> +        for ( i = 0; i < d->arch.hvm.nr_vioapics; i++ )
>>> +            if ( pfn == PFN_DOWN(domain_vioapic(d, i)->base_address) )
>>> +                return 0;
>>> +    }
>>> +    else if ( is_pv_domain(d) )
>>> +    {
>>> +        /*
>>> +         * Be consistent with CPU mappings: Dom0 is permitted to establish 
>>> r/o
>>> +         * ones there, so it should also have such established for IOMMUs.
>>> +         */
>>> +        for ( i = 0; i < nr_ioapics; i++ )
>>> +            if ( pfn == PFN_DOWN(mp_ioapics[i].mpc_apicaddr) )
>>> +                return rangeset_contains_singleton(mmio_ro_ranges, pfn)
>>> +                       ? IOMMUF_readable : 0;
>>
>> If we really are after consistency with CPU side mappings, we should
>> likely take the whole contents of mmio_ro_ranges and d->iomem_caps
>> into account, not just the pages belonging to the IO-APIC?
>>
>> There could also be HPET pages mapped as RO for PV.
> 
> Hmm. This would be a yet bigger functional change, but indeed would further
> improve consistency. But shouldn't we then also establish r/w mappings for
> stuff in ->iomem_caps but not in mmio_ro_ranges? This would feel like going
> too far ...

FTAOD I didn't mean to say that I think such mappings shouldn't be there;
I have been of the opinion that e.g. I/O directly to/from the linear
frame buffer of a graphics device should in principle be permitted. But
which specific mappings to put in place can imo not be derived from
->iomem_caps, as we merely subtract certain ranges after initially having
set all bits in it. Besides ranges not mapping any MMIO, even something
like the PCI ECAM ranges (parts of which we may also force to r/o, and
which we would hence cover here if I followed your suggestion) are
questionable in this regard.

Jan




 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.