Xen project Mailing List

Re: [Xen-devel] [PATCH net-next] xen-netback: improve guest-receive-side flow control

To: Paul Durrant <Paul.Durrant@xxxxxxxxxx>

Date: Thu, 28 Nov 2013 16:27:04 +0000

Cc: David Vrabel <david.vrabel@xxxxxxxxxx>, Wei Liu <wei.liu2@xxxxxxxxxx>, Ian Campbell <Ian.Campbell@xxxxxxxxxx>, "xen-devel@xxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxx>

Delivery-date: Thu, 28 Nov 2013 16:27:12 +0000

List-id: Xen developer discussion <xen-devel.lists.xen.org>

On Thu, Nov 28, 2013 at 04:05:52PM +0000, Paul Durrant wrote: > > -----Original Message----- > > From: Wei Liu [mailto:wei.liu2@xxxxxxxxxx] > > Sent: 28 November 2013 15:51 > > To: Paul Durrant > > Cc: xen-devel@xxxxxxxxxxxxx; Wei Liu; Ian Campbell; David Vrabel > > Subject: Re: [PATCH net-next] xen-netback: improve guest-receive-side flow > > control > > > > On Thu, Nov 28, 2013 at 01:11:12PM +0000, Paul Durrant wrote: > > > The flow control code relies on a double pass of tke skb, firstly to count > > > the number of ring slots needed to supply sufficient grant references, and > > > another to actually consume those references and build the grant copy > > > operations. It transpires that, in some scenarios, the initial count and > > > the > > > number of references consumed differs and, after this happens a number > > of > > > times, flow control is completely broken. > > > > Please add blank lines between paragraphs in later versions. > > > > > This patch simplifies things by making a very quick static analysis of the > > > minimal number or ring slots needed for an skb and then only stopping the > > > queue if that minimal number of slots is not available. The worker thread > > > does a worst-case analysis and only processes what will definitely fit, > > > leaving remaining skbs on the internal queue. This patch also gets rid of > > > the > > > drop that occurs if an skb won't fit (because that doesn't matter - we > > always > > > internally queue *at least* what we can put in the shared ring) and adds > > > some hysteresis to waking the external queue. (It only gets woken when at > > > least half the shared ring is available). > > > > I think this is the right direction to go. However we've tripped over > > this several times so is it possible to test it with XenRT? > > > > That's on its way. It'll be a day or so to get results back though. > Good, thanks. > > > Without this patch I can trivially stall netback permanently by just doing > > > a large guest to guest file copy between two Windows Server 2008R2 VMs > > on a > > > single host. > > > Signed-off-by: Paul Durrant <paul.durrant@xxxxxxxxxx> > > > Cc: Wei Liu <wei.liu2@xxxxxxxxxx> > > > Cc: Ian Campbell <ian.campbell@xxxxxxxxxx> > > > Cc: David Vrabel <david.vrabel@xxxxxxxxxx> > > > --- [...] > > > - > > > - /* Allow xenvif_start_xmit() to peek ahead in the rx request > > > - * ring. This is a prediction of what rx_req_cons will be > > > - * once all queued skbs are put on the ring. > > > - */ > > > - RING_IDX rx_req_cons_peek; > > > + bool rx_event; > > > > > > > What is this for? Reading through the code my impression is that this is > > a flag indicating RX interrupt is triggered. Why is it useful? Why is > > skb_queue_empty not sufficient? > > > > The thread is kicked in two cases now; when a new packet is added and > when the frontend sends an event (e.g. when new requests are posted). > This is an explicit flag for the latter. I didn't want to risk missing > a queue wake in the case where the internal queue was fully drained > yielding a shared ring that was more than half full, which I think is > what would happen if I didn't have a flag such as this to control the > thread wake. OK, I get it. Better add comment for this flag. > > > > /* Given MAX_BUFFER_OFFSET of 4096 the worst case is that each > > > * head/fragment page uses 2 copy operations because it > > > @@ -198,8 +193,6 @@ void xenvif_xenbus_fini(void); > > > [...] > > > - count += nr_frags + 1; > > > + max_slots_needed = 2; > > > > I'm afraid this starting point is not correct. Consider you have a SKB > > with very large linear buffer, you might need more than 2 slots to fit > > that in, right? > > > > I was wondering about that. Would a roundup of mtu to the nearest 4k boundary > be sufficient? > I think we can reuse the logic in xenvif_gop_skb, there's a snippet used to calculate how many slots are necessary for linear area. We just need to avoid pulling requrests from the ring. > > > + for (i = 0; i < skb_shinfo(skb)->nr_frags; i++) { > > > + struct page *page; > > > > > > - __skb_queue_tail(&rxq, skb); > > > + page = skb_frag_page(&skb_shinfo(skb)->frags[i]); > > > + max_slots_needed += 1 << compound_order(page); > > > > This estimation might reserve slots more than necessary. > > > > Yes, it needs to be worse case - but not too bad. > > > The removal of xenvif_count_skb_slots() is OK because you're now > > estimating minimum slots, but the actual number of slots consumed needs > > to be acurate. Any reason you don't want to use xenvif_gop_skb()? > > > > I can't. xenvif_gop_skb() pulls requests off the ring and if there aren't > enough there it will crash'n'burn as far as I can tell. So, I need an upper > bound on the number of slots before I call it. > I get what you mean. We can at least use some of the snippet in it without pulling requests. See above. Wei. _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxx http://lists.xen.org/xen-devel

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.