Xen project Mailing List

Re: TCP wait_for transmit question

To: Balraj Singh <balraj.singh@xxxxxxxxxxxx>

From: Anil Madhavapeddy <anil@xxxxxxxxxx>

Date: Sat, 14 Jul 2012 17:10:42 +0100

Cc: Mirage List <cl-mirage@xxxxxxxxxxxxxxx>, Haris Rotsos <Charalampos.Rotsos@xxxxxxxxxxxx>

List-id: MirageOS development <cl-mirage.lists.cam.ac.uk>

Interesting; so this check is also clamped to the TX MSS (Tcp.Pcb.write_available) and not to the max_size of the application buffer. This is probably a good time to nail down the semantics of all these different modules, particularly as vchan/shmem will be coming along shortly. Channel: buffered I/O, manual flush required Flow: unbuffered I/O, will be triggered immediately Tcp.Pcb: buffered if delay writes are used, unbuffered with nodelay The TCP Nagle's buffer is necessary since only it knows if there are TX packets in flight, whereas the Channel module doesnt... -anil On Thu, Jul 12, 2012 at 09:06:40PM +0100, Balraj Singh wrote: > My mistake, yes User_buffer.write should block if the buffer is full, > but it doesn't - it always succeeds. I can't remember why, but I put > the check for buffer fullness in Flow.write (maybe there was something > similar there earlier and I just modified it). Flow.write calls > Tcp.Pcb.write which then calls User_buffer.write. For now if you use > Flow.write it should work as required. But I agree that it should be > moved to User_buffer.write, which should then block (or fail) if the > buffer is full. > > Here is the current write from flow.ml and the particular check to see > if the buffer has room is Tcp.Pcb.write_available: > > let rec write t view = > let vlen = Bitstring.bitstring_length view / 8 in > match Tcp.Pcb.write_available t with > |0 -> (* block for window to open *) > Tcp.Pcb.write_wait_for t 1 >> > write t view > |len when len < vlen -> (* do a short write *) > let v' = Bitstring.subbitstring view 0 (len * 8) in > Tcp.Pcb.write t v' >> > write t (Bitstring.subbitstring view (len*8) ((vlen-len)*8)) > |len -> (* full write *) > Tcp.Pcb.write t view > > > > On Thu, Jul 12, 2012 at 1:08 AM, Anil Madhavapeddy <anil@xxxxxxxxxx> wrote: > > On 12 Jul 2012, at 01:03, Haris Rotsos wrote: > >>> > >>> Haris, was your test case just calling Tcp.Pcb.write continuously and > >>> finding that > >>> it ran out of memory? > >> > >> yes. And by the way, because the code is written over the ns3 > >> simulation platform, I think the a call that that pushed packets to a > >> network interface will never block. The simulation hasn't got this > >> requirement fixed yet. > >> > >> This I guess in the case of xen or unix, will be handled with more > >> care as the write may block and create naturally a context switch in > >> the thread scheduler. > > > > Definitely; the Xen Netif has a fixed set of rings slots, and the Unix > > backend > > just uses (slow) blocking tuntap I/O. Both apply backpressure as a result. > > > >> > >>> > >>> In that case, it may be a regression that is the same problem as the ARP > >>> race > >>> condition (the OS.Netif.write is now too asynchronous). > >> > >> why would that affect the arp functionality? > > > > Only because our ARP support is super-minimal, and doesn't have a retransmit > > timer or anything. So you get one ARP query transmitted, and if it is lost > > for > > any reason, we never retransmit. I saw a few cases where we got stuck as a > > result. It's a quick job to add a retransmission timer to make it more > > RFC-compliant > > though. > > > > -anil > -- Anil Madhavapeddy http://anil.recoil.org

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.