virtio-dev message

Subject: Re: [virtio-dev] [PATCH] snd: Add virtio sound device specification

From: Liam Girdwood <liam.r.girdwood@linux.intel.com>
To: Jean-Philippe Brucker <jean-philippe@linaro.org>
Date: Tue, 12 Nov 2019 14:20:44 +0000

On Tue, 2019-11-12 at 12:09 +0100, Jean-Philippe Brucker wrote:
> Hi,
> 
> On Mon, Nov 11, 2019 at 03:20:55PM +0000, Liam Girdwood wrote:
> > > > > +struct virtio_snd_ctl_msg {
> > > > > +    /* device-read-only data */
> > > > > +    le32 request_code;
> > > > > +    u8 request_payload[];
> > > > 
> > > > How do I know how much data to read here ? Is size embedded in
> > > > request_code ?
> > > 
> > > Yes, you are right. Actual request/response length can be easily
> > > derived from 
> > > a request_code.
> > > 
> > 
> > Nak, please use a separate size and type code otherwise we will
> > have
> > real problems in the future when we come to add new features (with
> > new
> > codes or bigger sizes).
> 
> Other virtio devices rely on the buffer length already provided by
> virtio
> for this. For example see the virtio-net control virtqueue
> (virtio_net_ctrl*), which is extended with new commands from time to
> time.
> Most recently, the "receive-side scaling" feature introduces a large
> structure as payload to virtio_net_ctrl.
> 

Sorry, not following, what does this have to do with combining type and
size into request_code ? Or are you saying the size is encoded
elsewhere in the virtio transport data ?

> [...]
> > > > > +\begin{lstlisting}
> > > > > +struct virtio_snd_pcm_xfer {
> > > > > +    le32 stream;
> > > > > +    u8 data[];
> > > > > +    le32 status;
> > > > > +    le32 actual_length;
> > > > 
> > > > Not following this, is actual_length the size of data[]. If so,
> > > > it
> > > > must
> > > > preceed data[].
> > > 
> > > No, the actual_length field is supposed to be used by device side
> > > to
> > > report
> > > actual amount of bytes read from/written to a buffer.
> 
> By the way the written size is already provided by virtio (field len
> in
> the used descriptor), but not the read size.
> 
> > > In real world
> > > scenario, 
> > > if an I/O request contains N bytes, a device can play/capture *up
> > > to*
> > > N bytes. 
> > > Thus, it's required to report this length back to a driver.
> > > 
> > 
> > This is not how PCM audio buffering works. I don't copy extra
> > padding
> > if I dont need to, I only copy when my buffer is full. See the need
> > for
> > period size in earlier comments.
> 
> [...]
> > > > 2) Zero copy of audio data and stream positions. Essential for
> > > > any
> > > > low
> > > > latency audio use cases (if supported by HW, hypervisor, VM
> > > > framework).
> > > 
> > > What do you mean by zero-copy? Shared memory between device and
> > > driver sides?
> > 
> > Yes, but I see this is now a separate thread as it used by others
> > too.
> 
> The virtio transport already permits zero-copy, because one buffer
> can be
> described with a chain of descriptors, each pointing to a different
> area
> in memory. The driver can split the virtio_snd_pcm_xfer into header
> and
> data:
> 
>     virtqueue
>     desc ring
>     |       |
>     +-------+
>     |       |---------> header (copied)
>     +- - - -+
>     |       |---------> PCM data (mapped)
>     +-------+
>     |       |
> 
> So it might be a good idea to put all the metadata (stream, status,
> actual_length) at the beginning of the virtio_snd_pcm_xfer struct,
> otherwise the driver will need three descriptors instead of two to
> achieve

This would be a good improvement, it's less copying and would likely
improve user experience, however the buffer ptr still suffers latency
as it's queued (same for stream positions going the other way).

Low latency zero (software) copy audio needs to work like this:

1) The guest buffers SG PHY pages are shared with audio device driver
and/or DSP FW at stream init.

2) Device driver or DSP FW programs DMAC IP with SG PHY pages of guest
buffer.

3) DMA is started and audio data is copied directly to audio endpoint
(I2S, HDA, SDW etc) or directly to DSP.

At this point, the audio can be rendered by guests very close to the
DMA R/W positions (low latency) if they knew the HW pointer position
accurately. The guest needs to also be able to read the stream HW
position from the device driver or DSP using a low latency method too:

4) Common timestamp between audio HW/DSP and all guests can be used and
sent over virtio. Allows guests to interpolate any drift and estimate
audio position with good accuracy using common clock.

OR

5) Common stream status between audio HW/DSP and all guests (when
supported by HW). Device driver can export RO page of stream data to
each guest that DSP FW can update with realtime true rendering
positions (Intel HW supports this feature).  

Thanks

Liam

Follow-Ups:
- Re: [virtio-dev] [PATCH] snd: Add virtio sound device specification
  - From: Anton Yakovlev <anton.yakovlev@opensynergy.com>
- Re: [virtio-dev] [PATCH] snd: Add virtio sound device specification
  - From: Jean-Philippe Brucker <jean-philippe@linaro.org>

References:
- Re: [virtio-dev] [PATCH] snd: Add virtio sound device specification
  - From: Liam Girdwood <liam.r.girdwood@linux.intel.com>
- Re: [virtio-dev] [PATCH] snd: Add virtio sound device specification
  - From: Jean-Philippe Brucker <jean-philippe@linaro.org>