Good brainer. ... I am geting back to few basics.
I will leave HVM and emulation aside :)
1. When PCI Passthrough support 'is not enabled' , how does domU access
PCI device ?
I suppose they continue to communicate using pcifront/back split
drivers and now dom0's
drivers are used ( right ? )
2. PCI-passthrough enables domU to own the PCI device. Now dom0 can no
longer use that device .
If domU has mapped address space of the required PCI device , why
need it talk to dom0 any further ?
What kind of hand shake is involved ( setup and tear down as you have
mentioned ) between domU and
domU " in passthrough case" ?
3. DomU discovers all PCI devices through Xenstore.
Mark Williamson wrote:
I have been trying to understand PCI -passthrough support. Please
correct me if I am wrong in my following inferences.
1. Device emulation and pass through are both implemented using split
drivers.
I'm going to be pedantic now ;-)
Device "emulation" is really what we do for fully virtualised (HVM)
guests: the device models provided by QEmu emulate real world devices
in terms of their responses to particular port IOs, mmapped IO
operations, etc. This isn't done using a front / back model because
the guest is just using it's normal drivers for the "real" devices.
The virtual devices used by the PV drivers are implemented using split
drivers, though, as you say.
The PCI passthrough support for PV guest is also implemented using a
split driver that implements the functions of the PCI bus in order to
give the guest the information it
Which is this information we are referring to here ?
requires to talk to its assigned device(s).
How are devices "assigned" to domU ? I am specifically talking about
late binding.
Having obtained this information, communication with the device is
possible directly using IO ports, memory IO regions, and DMA.
2. But, in case of emulation drivers of dom0 are used where as in case
of passthrough ( as the name suggests ) native drivers in domU are
used .
For true emulation (qemu device model), a userspace process in dom0
handles modelling a "real" device and then issues IO using normal
userspace APIs. These get serviced by the dom0 kernel using the
normal device driver.
For PV drivers, the frontend driver in the domU kernel issues requests
which are picked up by the backend driver in dom0's kernel, which then
issues requests into the IO stack. Again this uses the normal device
driver in dom0 to talk to the actual device, it's just that the
request is made using a kernel-internal API rather than a userspace
API (which results in slightly different actions being taken).
3. dom0 provides a virtual PCI device { an interface for device-OPs and
status of this virtual device} to domU and through associated event
channel domU makes
"synchronous" use of this device.
domU uses this for control plane operations, but for most work it can
talk to its PCI device directly without going through dom0.
===
Queries:
1. What i am really not so sure about is ... passthrough case
Will there be requirement to map the address space of this PCI
device in domU ? Will the page which was being shared so-far
{xen_pci_sharedinfo}
for emulation , be "flipped" ( transferred ) into domU ?
xen_pci_sharedinfo - is that the page used to talk to the PCI backend
from pcifront? If so, then no, that's just used for dom0-domU
communications.
Well As I read in the xen0linux code , xen_pci_sharedinfo contains
xen_pci_op where front end specifies which
operation is to be done on which pci device. And then it keeps checking
the status of this device through another flag in
xen_pci_sharedinfo. I infer here that dom0's drivers are being used. Can
you point me towards a code path where domU actually
uses its own drivers having owned the PCI device ?
In order to map the address space of the PCI device directly, the domU
is given permissions to map the IO memory regions of that device into
it's page tables. I think this is now possible to do using a grant
table operation...
It is also given permission to access certain IO port ranges so that
it can use the device's port IO interfaces.
2. Well , Having read the code for linux (dom0,domU) I see that
there are split
device drivers for PCI. (pci front and PCIback). Which are normally
communicating over xenbus. which looks almost like other split
drivers. How exactly then passthrough enables use of domU's drivers ?
The key thing to understand is that the pcifront / pciback is
basically just used for setup and teardown, not for the actual IO.
The real IO is done directly by the domU without going through dom0.
For the block and net drivers, *all* IO goes through dom0.
3. And if passthrough support isnt provided how will communication
between pcifront-pciback be different ? ( I guess netbsd , freebsd do
not have passthrough support yet )
I'm not entirely clear what you're asking here, but I'll take a stab
at it:
if pcifront (in domU) and pciback (in dom0) aren't available then
passthrough won't work. The dom0 has to support the backend functions
of PCI passthrough and the domU has to know how to talk to it. It's
also implicit that they're using the same interface version to talk to
each other - I'm not sure whether that's frozen stable or not.
So *if* NetBSD lacks pciback support, it can't pass PCI devices to
guests that do. Similarly, *if* it lacks pcifront support, it can't
have devices passed to it.
4. What restricts other domUs from accessing PCI device given to other
domU via passrthrough support.
There are some restrictions on what can be done in PCI config space to
prevent a guest fouling things up. These need to be relaxed for some
awkward devices, though.
For the device IO itself, domUs are only allowed to map mmio regions
and access io ports that are relevant to their device. It's possible
for these to overlap with those for other devices, in which case
you're trusting the domU to be well behaved. More crucially, though,
giving a domain a device with DMA capabilities is equivalent to giving
it the ability to subvert the entire machine. DMA can't be sandboxed
on most current hardware, so if you give DMA rights to a VM it's
automatically just as trusted as dom0 with respect to not fooling
about with other domains, hardware, etc.
Cheers,
Mark
Thanks,
Sanket
_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel
|