[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal

To: Oleksandr Andrushchenko <Oleksandr_Andrushchenko@xxxxxxxx>
From: Jan Beulich <jbeulich@xxxxxxxx>
Date: Thu, 18 Nov 2021 16:41:06 +0100
Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none
Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=vflEYNAaxUGaHkx6BA9lxKG3NWyibnUr4GFhwV81uuA=; b=go5HdboeHmwgVSkXLxkZMtvWwgmFBfj8Pa1HEdLNMyj23EFhXd1H3LnNLoRsWI8sePVJS17TiSVjsrd4gXDAkg3uOrDx3/NOx4Oqk/kUYWiy8gq4Q+kslHdv0Lku5hShvpdzmo5XCVM8iNCslIyx2G72HYvJTdZISiGSkfuvl/ts0sKyI5SCY4g+LwlTeDNWzVO8N3adtU7lDMlstKagzl255+myEsiQzEvkViz3xew+KuglZfcYUQ1XXbKCgRpf/NhO6bCb7bzhmJ50SWfr8vjtFynYGl4TBO14517UGRd12j+2e2qsWZF61KeS/MnjHjq787vojzW0TlIbFI7hLQ==
Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=kekVtLac1v4DSbqTw/evAKLCj6fzp9hZay1Kt8gj8Wfosj9ZmOBNNSO4wXJTLA0ijK3ABDk+BRUT703bOtFOEf+8ytDhmJ9VU/7OJxFFjNdtLOgAf4U7ANemZykQIis0O+bjSoHfy7bDrJI/OEoEWhzfjz0RmFm4AnzxhagYcc76f6Zcyd3r5OC2VGMGtNlZKayNSvFXtBUcW1kruWq/k99MEUAi165YpHJjb2RuaOJzaT/6AwCAv0iINJzhrmpVLCCv5bj478VYIFLPe7b51/KP+KIlKflYz1vF7Q1kHhP/2Ju6nAmUvSGotBm4ZKI+1EOx6VdJIRdsFAetmTmOsA==
Authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com;
Cc: "julien@xxxxxxx" <julien@xxxxxxx>, "sstabellini@xxxxxxxxxx" <sstabellini@xxxxxxxxxx>, Oleksandr Tyshchenko <Oleksandr_Tyshchenko@xxxxxxxx>, Volodymyr Babchuk <Volodymyr_Babchuk@xxxxxxxx>, Artem Mygaiev <Artem_Mygaiev@xxxxxxxx>, "andrew.cooper3@xxxxxxxxxx" <andrew.cooper3@xxxxxxxxxx>, "george.dunlap@xxxxxxxxxx" <george.dunlap@xxxxxxxxxx>, "paul@xxxxxxx" <paul@xxxxxxx>, Bertrand Marquis <bertrand.marquis@xxxxxxx>, Rahul Singh <rahul.singh@xxxxxxx>, "xen-devel@xxxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxxx>, Roger Pau Monné <roger.pau@xxxxxxxxxx>
Delivery-date: Thu, 18 Nov 2021 15:41:31 +0000
List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 18.11.2021 16:21, Oleksandr Andrushchenko wrote:
> On 18.11.21 17:16, Jan Beulich wrote:
>> On 18.11.2021 16:11, Oleksandr Andrushchenko wrote:
>>> On 18.11.21 16:35, Jan Beulich wrote:
>>>> On 18.11.2021 15:14, Oleksandr Andrushchenko wrote:
>>>>> On 18.11.21 16:04, Roger Pau Monné wrote:
>>>>>> Indeed. In the physdevop failure case this comes from an hypercall
>>>>>> context, so maybe you could do the mapping in place using hypercall
>>>>>> continuations if required. Not sure how complex that would be,
>>>>>> compared to just deferring to guest entry point and then dealing with
>>>>>> the possible cleanup on failure.
>>>>> This will solve one part of the equation:
>>>>>
>>>>> pci_physdev_op
>>>>>           pci_add_device
>>>>>               init_bars -> modify_bars -> defer_map -> 
>>>>> raise_softirq(SCHEDULE_SOFTIRQ)
>>>>>           iommu_add_device <- FAILS
>>>>>           vpci_remove_device -> xfree(pdev->vpci)
>>>>>
>>>>> But what about the other one, e.g. vpci_process_pending is triggered in
>>>>> parallel with PCI device de-assign for example?
>>>> Well, that's again in hypercall context, so using hypercall continuations
>>>> may again be an option. Of course at the point a de-assign is initiated,
>>>> you "only" need to drain requests (for that device, but that's unlikely
>>>> to be worthwhile optimizing for), while ensuring no new requests can be
>>>> issued. Again, for the device in question, but here this is relevant -
>>>> a flag may want setting to refuse all further requests. Or maybe the
>>>> register handling hooks may want tearing down before draining pending
>>>> BAR mapping requests; without the hooks in place no new such requests
>>>> can possibly appear.
>>> This can be probably even easier to solve as we were talking about
>>> pausing all vCPUs:
>> I have to admit I'm not sure. It might be easier, but it may also be
>> less desirable.
>>
>>> void vpci_cancel_pending(const struct pci_dev *pdev)
>>> {
>>>       struct domain *d = pdev->domain;
>>>       struct vcpu *v;
>>>       int rc;
>>>
>>>       while ( (rc = domain_pause_except_self(d)) == -ERESTART )
>>>           cpu_relax();
>>>
>>>       if ( rc )
>>>           printk(XENLOG_G_ERR
>>>                  "Failed to pause vCPUs while canceling vPCI map/unmap for 
>>> %pp %pd: %d\n",
>>>                  &pdev->sbdf, pdev->domain, rc);
>>>
>>>       for_each_vcpu ( d, v )
>>>       {
>>>           if ( v->vpci.map_pending && (v->vpci.pdev == pdev) )
>>>
>>> This will prevent all vCPUs to run, but current, thus making it impossible
>>> to run vpci_process_pending in parallel with any hypercall.
>>> So, even without locking in vpci_process_pending the above should
>>> be enough.
>>> The only concern here is that domain_pause_except_self may return
>>> the error code we somehow need to handle...
>> Not just this. The -ERESTART handling isn't appropriate this way
>> either.
> Are you talking about cpu_relax()?

I'm talking about that spin-waiting loop as a whole.

>>   For the moment I can't help thinking that draining would
>> be preferable over canceling.
> Given that cancellation is going to happen on error path or
> on device de-assign/remove I think this can be acceptable.
> Any reason why not?

It would seem to me that the correctness of a draining approach is
going to be easier to prove than that of a canceling one, where I
expect races to be a bigger risk. Especially something that gets
executed infrequently, if ever (error paths in particular), knowing
things are well from testing isn't typically possible.

Jan

Follow-Ups:
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Oleksandr Andrushchenko

References:
- [PATCH v4 00/11] PCI devices passthrough on Arm, part 3
  - From: Oleksandr Andrushchenko
- [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Oleksandr Andrushchenko
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Jan Beulich
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Oleksandr Andrushchenko
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Jan Beulich
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Oleksandr Andrushchenko
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Jan Beulich
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Oleksandr Andrushchenko
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Jan Beulich
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Oleksandr Andrushchenko
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Roger Pau Monné
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Oleksandr Andrushchenko
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Jan Beulich
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Oleksandr Andrushchenko
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Jan Beulich
- Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
  - From: Oleksandr Andrushchenko

Prev by Date: Re: [RFC PATCH 0/2] Boot time cpupools
Next by Date: Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
Previous by thread: Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
Next by thread: Re: [PATCH v4 02/11] vpci: cancel pending map/unmap on vpci removal
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.