virtio-comment message

Subject: Re: [virtio-dev] Re: [PATCH v11] virtio-net: support inner header hash

From: Jason Wang <jasowang@redhat.com>
To: Heng Qi <hengqi@linux.alibaba.com>, "Michael S. Tsirkin" <mst@redhat.com>
Date: Thu, 23 Mar 2023 10:52:22 +0800


å 2023/3/21 22:49, Heng Qi åé:

å 2023/3/21 äå3:34, Michael S. Tsirkin åé:
On Tue, Mar 21, 2023 at 11:56:14AM +0800, Heng Qi wrote:
å 2023/3/21 äå3:43, Michael S. Tsirkin åé:
On Mon, Mar 20, 2023 at 07:18:40PM +0800, Heng Qi wrote:
1. Currently, a received encapsulated packet has an outer and aninner header, butthe virtio device is unable to calculate the hash for the innerheader. Multipleflows with the same outer header but different inner headers aresteered to the
same receive queue. This results in poor receive performance.
To address this limitation, a new feature VIRTIO_NET_F_HASH_TUNNELhas beenintroduced, which enables the device to advertise the capabilityto calculate thehash for the inner packet header. Compared with the out headerhash, it regains
better receive performance.
So this would be a very good argument however the cost would be itwould
seem we have to keep extending this indefinitely as new tunneling
protocols come to light.
But I believe in fact we don't at least for this argument:
the standard way to address this is actually by propagating entropy
from inner to outer header.
Yes, we don't argue with this.
So I'd maybe reorder the commit log and give the explanation 2 below
then say "for some legacy systems
including entropy in IP header
as done in modern protocols is not practical, resulting in
bad performance under RSS".
I agree. But not necessarily the legacy system, some scenarios need to
connect multiple tunnels, for compatibility, they will not use optional
fields or choose the old tunnel protocol.
compatibility ... with legacy systems, no?
2. The same flow can traverse through different tunnels, resultingin the encapsulatedpackets being spread across multiple receive queues (refer to thefigure below).However, in certain scenarios, it becomes necessary to directthese encapsulatedpackets of the same flow to a single receive queue. Thisfacilitates the processingof the flow by the same CPU to improve performance (warm caches,less locking, etc.).
ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ client1ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ client2
ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ |ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ |
ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ |ÂÂÂÂÂÂÂ +-------+ÂÂÂÂÂÂÂÂ |
ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ +------->|tunnels|<--------+
ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ +-------+
ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ |Â |
ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ |Â |
ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ vÂ v
ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ +-----------------+
ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ | processing host |
ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ +-----------------+
necessary is too strong a word I feel.
All this is, is an optimization, we don't really know how strong it is
even.

Here's how I understand this:

Imagine two clients client1 and client2 talking to each other.
A copy of all packets is sent to a processing host over a virtiodevice.
Two directions of the same flow between two clients might be
encapsulated in two different tunnels, with current RSS
strategies they would land on two arbitrary, unrelated queues.
As an optimization, some hosts might wish to make sure both directions
of the encapsulated flow land on the same queue.


Is this a good summary?
I think yes.
Now that things begin to be clearer, I kind of begin to agree with
Jason's suggestion that this is extremely narrow.Â And what if I want
one direction on queue1 and another one queue2 e.g. adjacentnumbers for
I don't understand why we need this, can you point out some usagescenarios?
If traffic is predominantly UDP, each queue can be processed in
parallel. If you need to look at the other side of the flow once
in a while, you can find it by doing ^1.
I'm not sure if I align with you, but I try to answer. When we try toplace traffic in one direction on a certain queue,it means that we have calculated the hash, we can record thefive-tuple information and the queue number. Whenthe traffic in the other direction comes, we can match what we justrecorded information and place it on the ^1 queue.
the same flow?Â If enough people agree this is needed we can acceptthis
but did you at all consider using something programmable like BPF for
I think the problem is that our virtio device cannot support ebpf,we canalso ask Alvaro, Parav if their virtio devices can support ebpfoffloading.
:)
This isn't ebpf, more like classic bpf. Just math done on packets,
no tables.
We would also really like to use simple bpf offloading, which is cool.But it still takes time, for example tosupport parsing of bpf instructions etc. on devices like fpga, whichthey can't do easily now. Few devicesare supported right now, I only see support for the netronome iNIC inthe kernel.
ÂÂ #git grep XDP_SETUP_PROG_HW
ÂÂ drivers/net/ethernet/netronome/nfp/nfp_net_common.c:ÂÂÂ caseXDP_SETUP_PROG_HW:ÂÂ drivers/net/netdevsim/bpf.c:ÂÂÂ if (bpf->command ==XDP_SETUP_PROG_HW && !ns->bpf_xdpoffload_accept) {ÂÂ drivers/net/netdevsim/bpf.c:ÂÂÂ if (bpf->command ==XDP_SETUP_PROG_HW) {
ÂÂ drivers/net/netdevsim/bpf.c:ÂÂÂ case XDP_SETUP_PROG_HW:
ÂÂ include/linux/netdevice.h:ÂÂÂÂÂ XDP_SETUP_PROG_HW,
ÂÂ net/core/dev.c: xdp.command = mode == XDP_MODE_HW ?XDP_SETUP_PROG_HW : XDP_SETUP_PROG;

Note that this is the eBPF hardware offloading which is much morecomplicated than what we propose now. For hash calculation, a simpleclassical bpf or other like P4 would be sufficient. The point is toallow the user to customize the hash calculation.

If this is too flexible for the hardware, it would be stillbetter toconsider a more general hash calculation pipeline (XOR, swap, hashmasks, hash key customization) like:


https://docs.napatech.com/r/Feature-Set-N-ANL10/Hash-Value-Generation

Thanks

References:
- [PATCH v11] virtio-net: support inner header hash
  - From: Heng Qi <hengqi@linux.alibaba.com>
- Re: [PATCH v11] virtio-net: support inner header hash
  - From: "Michael S. Tsirkin" <mst@redhat.com>
- Re: [virtio-dev] Re: [PATCH v11] virtio-net: support inner header hash
  - From: Heng Qi <hengqi@linux.alibaba.com>
- Re: [virtio-dev] Re: [PATCH v11] virtio-net: support inner header hash
  - From: "Michael S. Tsirkin" <mst@redhat.com>
- Re: [virtio-dev] Re: [PATCH v11] virtio-net: support inner header hash
  - From: Heng Qi <hengqi@linux.alibaba.com>