|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [PATCH 1/9] AMD/IOMMU: redo awaiting of command completion
On 09.06.2021 12:36, Andrew Cooper wrote:
> On 09/06/2021 10:26, Jan Beulich wrote:
>> The present abuse of the completion interrupt does not only stand in the
>> way of, down the road, using it for its actual purpose, but also
>> requires holding the IOMMU lock while waiting for command completion,
>> limiting parallelism and keeping interrupts off for non-negligible
>> periods of time. Have the IOMMU do an ordinary memory write instead of
>> signaling an otherwise disabled interrupt (by just updating a status
>> register bit).
>>
>> Signed-off-by: Jan Beulich <jbeulich@xxxxxxxx>
>> Reviewed-by: Paul Durrant <paul@xxxxxxx>
>
> While I agree with the direction of the patch, some of the details could
> do with improvement.
>
>>
>> --- a/xen/drivers/passthrough/amd/iommu_cmd.c
>> +++ b/xen/drivers/passthrough/amd/iommu_cmd.c
>> @@ -20,6 +20,9 @@
>> #include "iommu.h"
>> #include "../ats.h"
>>
>> +#define CMD_COMPLETION_INIT 0
>> +#define CMD_COMPLETION_DONE 1
>> +
>> static void send_iommu_command(struct amd_iommu *iommu,
>> const uint32_t cmd[4])
>> {
>> @@ -49,28 +52,31 @@ static void send_iommu_command(struct am
>> static void flush_command_buffer(struct amd_iommu *iommu,
>> unsigned int timeout_base)
>> {
>> + static DEFINE_PER_CPU(uint64_t, poll_slot);
>> + uint64_t *this_poll_slot = &this_cpu(poll_slot);
>> + paddr_t addr = virt_to_maddr(this_poll_slot);
>> uint32_t cmd[4];
>> s_time_t start, timeout;
>> static unsigned int __read_mostly threshold = 1;
>>
>> - /* RW1C 'ComWaitInt' in status register */
>> - writel(IOMMU_STATUS_COMP_WAIT_INT,
>> - iommu->mmio_base + IOMMU_STATUS_MMIO_OFFSET);
>> -
>> - /* send an empty COMPLETION_WAIT command to flush command buffer */
>> - cmd[3] = cmd[2] = 0;
>> - set_field_in_reg_u32(IOMMU_CMD_COMPLETION_WAIT, 0,
>> + ACCESS_ONCE(*this_poll_slot) = CMD_COMPLETION_INIT;
>> +
>> + /* send a COMPLETION_WAIT command to flush command buffer */
>> + cmd[0] = addr;
>> + set_field_in_reg_u32(IOMMU_CONTROL_ENABLED, cmd[0],
>> + IOMMU_COMP_WAIT_S_FLAG_MASK,
>> + IOMMU_COMP_WAIT_S_FLAG_SHIFT, &cmd[0]);
>
> set_field_in_reg_u32() is a disaster of a function - both in terms of
> semantics, and code gen - and needs to be purged from the code.
>
> It is a shame we don't have a real struct for objects in the command
> buffer, but in lieu of that, this is just
>
> cmd[0] = addr | IOMMU_COMP_WAIT_S_FLAG_MASK;
>
> which is the direction that previous cleanup has gone.
>
> There are no current users of IOMMU_COMP_WAIT_S_FLAG_SHIFT, and ...
>
>> + cmd[1] = addr >> 32;
>> + set_field_in_reg_u32(IOMMU_CMD_COMPLETION_WAIT, cmd[1],
>> IOMMU_CMD_OPCODE_MASK,
>> IOMMU_CMD_OPCODE_SHIFT, &cmd[1]);
>> - set_field_in_reg_u32(IOMMU_CONTROL_ENABLED, 0,
>> - IOMMU_COMP_WAIT_I_FLAG_MASK,
>> - IOMMU_COMP_WAIT_I_FLAG_SHIFT, &cmd[0]);
>
> ... this drops the final use of IOMMU_COMP_WAIT_I_FLAG_SHIFT, so both
> should be dropped.
>
> As for IOMMU_CMD_OPCODE_SHIFT, that can't be dropped yet, but it would
> still be better to use
>
> cmd[1] = (addr >> 32) | MASK_INSR(IOMMU_CMD_COMPLETION_WAIT,
> IOMMU_CMD_COMPLETION_WAIT);
>
> in the short term.
Okay, this conversion has indeed saved a single
and $0x0FFFFFFF, %eax
But we're down by two set_field_in_reg_u32() now; only some 30 left.
Jan
|
![]() |
Lists.xenproject.org is hosted with RackSpace, monitoring our |