[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Xen-users] Apache CGI Performance Big Degration in Dom0 vs. Native


  • To: xen-users@xxxxxxxxxxxxxxxxxxx, xen-devel@xxxxxxxxxxxxxxxxxxx
  • From: 志波 唐 <hunter_tzb@xxxxxxxxxxxx>
  • Date: Wed, 23 May 2007 17:15:11 +0800 (CST)
  • Delivery-date: Thu, 24 May 2007 10:06:23 -0700
  • Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com.cn; h=X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:MIME-Version:Content-Type:Content-Transfer-Encoding:Message-ID; b=5+0H7JOUNyY1XTUgqNPwHgOnlD+5T+ZQEOX50X7M7zRNdmYGBEcoSJ8ndyPbE85pJIzGkmneTj4qaDLAtodVWVlPP5FCABlySL0r77P4jtEC0zTzqKBcW08OKAjLoVpQPDbOtPuk4pFnZbBNC6y4f6u3sQUxcnltLI625JFXolQ=;
  • List-id: Xen user discussion <xen-users.lists.xensource.com>

Hi there,

I ran a test on an Apache server, the workload is a helloworld.c compiled cgi, 
very simple. OS is SLES 10. The stress tool is ab (apache bench).

The performance looks big degration from native to Dom0:

Running in prefork mode:

                                       Native                              Dom0
Performance(request/s)   3700                                 . 650
CPU%                             75%                                  99%



Running in worker mode:
                                       Native                              Dom0
Performance(request/s)     1750.                               769
CPU%                             32%                                 26%


Also running XenOprof and Oprof as below clock cycles data:

Domain 0, worker:
CPU: Core 2, speed 2666.75 MHz (estimated)
Counted CPU_CLK_UNHALTED events (Clock cycles when not halted) with a unit mask 
of 0x00 (Unhalted core cycles) count 100000
samples  %        image name               app name                 symbol name
71855     4.6699  xen-syms-pae-3.0.4_13138-0.33 xen-syms-pae-3.0.4_13138-0.33 
get_page_from_l1e
71812     4.6672  xen-syms-pae-3.0.4_13138-0.33 xen-syms-pae-3.0.4_13138-0.33 
handle_exception
58634     3.8107  xen-syms-pae-3.0.4_13138-0.33 xen-syms-pae-3.0.4_13138-0.33 
put_page_from_l1e
56301     3.6591  xen-syms-pae-3.0.4_13138-0.33 xen-syms-pae-3.0.4_13138-0.33 
do_update_va_mapping
49543     3.2199  vmlinux-2.6.16.46-0.10-xenpae vmlinux-2.6.16.46-0.10-xenpae 
_spin_lock

Domain 0, Prefork:
CPU: Core 2, speed 2666.75 MHz (estimated)
Counted CPU_CLK_UNHALTED events (Clock cycles when not halted) with a unit mask 
of 0x00 (Unhalted core cycles) count 100000
samples  %        image name               app name                 symbol name
492796   20.0510  xen-syms-pae-3.0.4_13138-0.33 xen-syms-pae-3.0.4_13138-0.33 
do_update_va_mapping
268506   10.9251  xen-syms-pae-3.0.4_13138-0.33 xen-syms-pae-3.0.4_13138-0.33 
ptwr_do_page_fault
81074     3.2988  xen-syms-pae-3.0.4_13138-0.33 xen-syms-pae-3.0.4_13138-0.33 
test_all_events
72424     2.9468  xen-syms-pae-3.0.4_13138-0.33 xen-syms-pae-3.0.4_13138-0.33 
get_page_from_l1e
66056     2.6877  xen-syms-pae-3.0.4_13138-0.33 xen-syms-pae-3.0.4_13138-0.33 
handle_exception
61031     2.4832  xen-syms-pae-3.0.4_13138-0.33 xen-syms-pae-3.0.4_13138-0.33 
do_mmu_update

Native, Worker:
CPU: Core 2, speed 2667.14 MHz (estimated)
Counted CPU_CLK_UNHALTED events (Clock cycles when not halted) with a unit mask 
of 0x00 (Unhalted core cycles) count 100000
samples  %        app name                 symbol name
46048     5.8715  vmlinux-2.6.16.46-0.10-bigsmp kmap_atomic
41587     5.3027  vmlinux-2.6.16.46-0.10-bigsmp copy_page_range
39759     5.0696  vmlinux-2.6.16.46-0.10-bigsmp unmap_vmas
38722     4.9373  vmlinux-2.6.16.46-0.10-bigsmp page_fault
30638     3.9066  vmlinux-2.6.16.46-0.10-bigsmp page_remove_rmap
29481     3.7591  vmlinux-2.6.16.46-0.10-bigsmp __handle_mm_fault


Native Prefork:
CPU: Core 2, speed 2667.14 MHz (estimated)
Counted CPU_CLK_UNHALTED events (Clock cycles when not halted) with a unit mask 
of 0x00 (Unhalted core cycles) count 100000
samples  %        app name                 symbol name
47505     6.9430  vmlinux-2.6.16.46-0.10-bigsmp kmap_atomic
41182     6.0189  vmlinux-2.6.16.46-0.10-bigsmp page_fault
36425     5.3236  vmlinux-2.6.16.46-0.10-bigsmp copy_page_range
34533     5.0471  vmlinux-2.6.16.46-0.10-bigsmp unmap_vmas
32155     4.6996  vmlinux-2.6.16.46-0.10-bigsmp __handle_mm_fault
30848     4.5085  vmlinux-2.6.16.46-0.10-bigsmp find_get_page
28589     4.1784  vmlinux-2.6.16.46-0.10-bigsmp page_remove_rmap

On pure HTML workload, I find little performance gap between domain 0 and 
Native, and neither so much Memory and Page operation.
Is the Domain 0 performance for CGI degration because of Xen Memory and Page 
ineffiencency? Any hints or experience on this?


thanks in advance

Rgds
Hunter Tang


      ___________________________________________________________ 
抢注雅虎免费邮箱3.5G容量,20M附件! 
http://cn.mail.yahoo.com

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.