[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-API] [XCP-1.1] High OVS cpu load and unresponsive host network while VMPR archive phase is running



On Thursday 02 August 2012 23:46:18 George Shuklin wrote:
> In product environment I saw that behavior few times. ovs-* processes
> starts to consume lot of cpu (over 100%) and start to cause packets drops.
> 
> That usually happens with 'hacked' customer VMs (sudden spike of
> outgoing traffic, cpu, and in few cases we assisted in research, actual
> trojans running on server because of some stupid php misconfiguration in
> yet another phpbb/cms/durpal/etc).

We have no customer VMs there, and we watch the vm traffic. Nothing unusual. 
The 
archive phase is running. It's 100% reproducible.

I suppose that, in my case, it has something to do with with the OpenFlow 
controller (Citrix DVS Controller) we tried to evaluate. Currently we do some 
tests in an testing environment to work out the problem.

But by the way, what do you do to protect your production environment against 
crashing caused by flooding the network? IIRC Jesse Gross told something about 
some work on patches preventing a single vm from being able to render the 
network unresponsive, maybe a year ago. What's the state?

> I'm not sure wat exactly happens, but my hypothesis is that it related
> to amount of flows. Then trojan starts to flood out traffic to different
> servers (smtp/www spam/etc) it cause lots of new connections..
> 
> On 01.08.2012 04:08, Ben Pfaff wrote:
> > Christian Fischer writes:
> >> On Tuesday 31 July 2012 18:08:18 Ben Pfaff wrote:
> >>> Christian Fischer
> >>> 
> >>> writes:
> >>>> We have no tagged vlans here, all physical switch ports running access
> >>>> mode. I wouldn't say that network load is increased when this happens,
> >>>> 15 kpps. Network performance could be poor due either a vswitch issue
> >>>> (runs at 180% CPU load if the vswitch log don't lie) or high load
> >>>> on/cheep hardware of the customer shared backup storage. I've never
> >>>> seen this stuff.
> >>> 
> >>> 180% CPU load is impossible for OVS 1.0.1, which has only a
> >>> single procsss with a single thread.
> >> 
> >> Yes, that's right, but we run OVS 1.4.2
> >> 
> >> XCP build: 1.1.0-50674c
> >> OVS build: 1.4.2
> >> NICs: BCM5709 Gigabit TOE iSCSI Offload
> >> OVS NIC bonding: active/active
> > 
> > Only the as-yet-unreleased post-1.8.0 Open vSwitch has more than
> > one process, and it still doesn't have multiple threads.
> > 
> > I suppose ovsdb-server and ovs-vswitchd could both go crazy at
> > the same time, but I haven't had any reports of that.
> > 
> > What process(es) add up to 180%?
> > 
> > _______________________________________________
> > Xen-api mailing list
> > Xen-api@xxxxxxxxxxxxx
> > http://lists.xen.org/cgi-bin/mailman/listinfo/xen-api
> 
> _______________________________________________
> Xen-api mailing list
> Xen-api@xxxxxxxxxxxxx
> http://lists.xen.org/cgi-bin/mailman/listinfo/xen-api


_______________________________________________
Xen-api mailing list
Xen-api@xxxxxxxxxxxxx
http://lists.xen.org/cgi-bin/mailman/listinfo/xen-api


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.