OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

virtio-dev message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: Re: [virtio-dev] On doorbells (queue notifications)


On Wed, 15 Jul 2020 18:25:14 +0100
Alex BennÃe <alex.bennee@linaro.org> wrote:

> 
> Cornelia Huck <cohuck@redhat.com> writes:
> 
> > On Wed, 15 Jul 2020 16:47:32 +0100
> > Stefan Hajnoczi <stefanha@redhat.com> wrote:
> >
> >> On Wed, Jul 15, 2020 at 02:29:04PM +0100, Alex BennÃe wrote:
> >> > Stefan Hajnoczi <stefanha@redhat.com> writes:  
> >> > > On Tue, Jul 14, 2020 at 10:43:36PM +0100, Alex BennÃe wrote:  
> >> > >> Finally I'm curious if this is just a problem avoided by the s390
> >> > >> channel approach? Does the use of messages over a channel just avoid the
> >> > >> sort of bouncing back and forth that other hypervisors have to do when
> >> > >> emulating a device?  
> >> > >
> >> > > What does "bouncing back and forth" mean exactly?  
> >> > 
> >> > Context switching between guest and hypervisor.  
> >> 
> >> I have CCed Cornelia Huck, who can explain the lifecycle of an I/O
> >> request on s390 channel I/O.
> >
> > Having read through this thread, I think this is mostly about
> > notifications?
> 
> Yes - as I understand it they are the only things that really cause a
> context switch between guest/hypervisor/host.
> 
> > These are not using channel programs (which are only
> > used for things like feature negotiation, or emulating reading/writing
> > a config space, which does not really exist for channel devices.)
> >
> > First, I/O and interrupts are highly abstracted on s390; much of the
> > register accesses or writes done on other architectures is just not
> > seen on s390.
> >
> > Traditionally, I/O interrupts on s390 are tied to a subchannel; you
> > have a rather heavyweight process for that:
> >
> > guest								host
> >
> > 					put status into subchannel
> > 					queue interrupt
> > open up for I/O interrupt
> > 					store some data into lowcore
> > 					do PSW swap
> > interrupt handler called
> > read from lowcore
> > call tsch for subchannel
> > 					store subchannel status into
> > 					control block
> > process control block
> > look at subchannel indicators
> > virtio queue processing
> >
> > This is only used for configuration change notifications, or for very
> > old legacy virtio implementations.
> >
> > There's an alternative mechanism not tied to a subchannel, called
> > 'adapter interrupts'. (It is even used to implement MSI-X on s390x,
> > which is why only virtio-pci devices using MSI-X are supported on
> > s390x.) It uses two-staged indicators: a global indicator to show
> > whether any secondary indicator is set, and secondary indicators (which
> > are per virtqueue in the virtio case.)
> >
> > guest								host
> >
> > 					set queue indicator(s)
> > 					set global indicator
> > 					queue interrupt iff global
> > 					indicator had not been set
> > open up for I/O interrupt
> > 					store some data into lowcore
> > 					do PSW swap
> > interrupt handler called
> > read from lowcore
> > look at indicators
> > virtio queue processing
> >
> > This has less context switches than traditional I/O interrupts; but I
> > think the main benefit comes from the ability to batch notifications:
> > as long as the guest is still processing indicators, the host does not
> > need to notify again, it can just set indicators (which is why the
> > guest always needs to do two passes at processing.) We can already
> > batch per-device indicators with the classic approach, but adapter
> > interrupts allow to batch even across many devices.
> 
> Thanks for the explanation.
> 
> I'm curious why the data that's going to be read from lowcore isn't
> loaded before the guest opens up (is this the same as unmasking?) for

You mean stored and not loaded, or?

> the interrupt? Is this because the host has to set up the guest IRQ
> itself?
> 

Hi Alex! IMHO Connie provided a detailed jet simplified and a little
confusing description  of the process of taking an IO interrupt on s390,
which is also called the interruption action.

A prerequisite for a CPU accepting an I/O interruption request is of
course  the CPU being open for it (controls: PSW, CR6). And yes this is
the masking/unmasking. The unmasking may or may not happen at the point
indicated in the ascii figures by Connie, what is important the cpu is
unmasked at that point. Right after the interruption action the
execution resumes at the interruption handler, whose address was read
(as a part of the interruption action) from the lowcore.

In that sense, there is only one interrupt handler for IO, as there
is only one new PSW slot in the lowcore. To figure out what sort of
event or events correspond to the interruption. This IO interrupt handler
looks at the so called IO interruption code. The IO interruption code
tells us if this is a subchannel associated, or an adapter IO
interruption.

If subchannel associated then, the interruption code also tells us which
subchannel is asking for attention.

If adapter interruption, further information is found (e.g.
interruption subclass) that may allow us (the guest) to limit the amount
of processing needed in order to figure out what events are associated
with this interruption. We may not need to scan all the indicator bits
(used by the
guest).

The interruption code is in turn stored by the interruption action,
might be executed by the hypervisor (is executed by the hypervisor for
subchannel interrupts, and may or may not be for adapter interrupts), and
must not happen if the cpu can not take the interruption, because it is
masked.

Regarding the number of context switches, if adapter interrupts are used,
if everything goes well even host->guest queue notifications that involve
an interrupt are done without getting a VCPU out of SIE (roughly
corresponds to VM EXIT) thanks to the mechanism
called GISA. But that is very s390 specific.

Regards,
Halil




[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]