[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [patch 09/10] PCI/MSI: Provide pci_msix_expand_vectors[_at]()



Hi Thomas,
On 11/26/2021 5:25 PM, Thomas Gleixner wrote:
Provide a new interface which allows to expand the MSI-X vector space if
the underlying irq domain implementation supports it.

Signed-off-by: Thomas Gleixner <tglx@xxxxxxxxxxxxx>
---
  drivers/pci/msi/msi.c |   41 +++++++++++++++++++++++++++++++++++++++++
  include/linux/pci.h   |   13 +++++++++++++
  2 files changed, 54 insertions(+)

--- a/drivers/pci/msi/msi.c
+++ b/drivers/pci/msi/msi.c
@@ -1025,6 +1025,47 @@ int pci_alloc_irq_vectors_affinity(struc
  EXPORT_SYMBOL(pci_alloc_irq_vectors_affinity);
/**
+ * pci_msix_expand_vectors_at - Expand MSI-X interrupts for a device
+ *
+ * @dev:       PCI device to operate on
+ * @at:                Allocate at MSI-X index. If @at == PCI_MSI_EXPAND_AUTO
+ *             the function expands automatically after the last
Not sure why some of these changes related to PCI_MSIX_EXPAND_AUTO and num_descs did not make it to the 'msi' branch.
Is this intentional?
+ *             active index.
+ * @nvec:      Number of vectors to allocate
+ *
+ * Expand the MSI-X vectors of a device after an initial enablement and
+ * allocation.
+ *
+ * Return: 0 if the allocation was successful, an error code otherwise.
+ */
+int pci_msix_expand_vectors_at(struct pci_dev *dev, unsigned int at, unsigned 
int nvec)
+{
+       struct msi_device_data *md = dev->dev.msi.data;
+       struct msi_range range = { .ndesc = nvec, };
+       unsigned int max_vecs;
+       int ret;
+
+       if (!pci_msi_enable || !dev || !dev->msix_enabled || !md)
+               return -ENOTSUPP;
+
+       if (!pci_msi_domain_supports_expand(dev))
+               return -ENOTSUPP;
+
+       max_vecs = pci_msix_vec_count(dev);
+       if (!nvec || nvec > max_vecs)
+               return -EINVAL;
+
+       range.first = at == PCI_MSIX_EXPAND_AUTO ? md->num_descs : at;
+
+       if (range.first >= max_vecs || nvec > max_vecs - range.first)
+               return -ENOSPC;
+
+       ret = msix_setup_interrupts(dev, dev->msix_base, &range, NULL, NULL, 
true);
+       return ret <= 0 ? ret : -ENOSPC;;
+}
+EXPORT_SYMBOL_GPL(pci_msix_expand_vectors_at);
+
I am having trouble fully comprehending how this expansion scheme would work..

For instance, say:
1. Driver requests for 5 vectors:
pci_enable_msix_range(dev, NULL, 5, 5)
=>num_descs = 5

2. Driver frees vectors at index 1,2:
range = {1, 2, 2};
pci_msi_teardown_msi_irqs(dev, range)
=>num_descs = 3; Current active vectors are at index: 0, 3, 4

3. Driver requests for 3 more vectors using the new API:
pci_msix_expand_vectors(dev, 3)
=>range.first = 3 => It will try to allocate index 3-5, but we already have 3,4 active? Ideally, we would want index 1,2 and 5 to be allocated for this request right?

Could you please let me know what I am missing?

With the 'range' approach, the issue is that we are trying to allocate contiguous indexes. Perhaps, we also need to check if all the indexes in the requested range are available, if not, find a contiguous range large enough to accommodate the request. But there will be fragmentation issues if we choose to go with this way...

I had a version of the dynamic MSI-X patch series (which never got sent out). For the expansion, I had the following: pci_add_msix_irq_vector(pdev): On each invocation, add 1 MSI-X vector to the device and return the msi-x index assigned by the kernel (using a bitmap) Correspondingly, pci_free_msix_irq_vector(pdev, irq) frees all the allocated resources associated with MSI-X interrupt with Linux IRQ number 'irq'. I had issues when trying to dynamically allocate more than 1 interrupt because I didn't have a clean way to communicate to the driver what indexes were assigned in the current allocation.

-Megha




 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.