Hello,
I have consistently seen poor performance when a domain is serving a file from a locally-attached storage device over the xenbridge "network" to another, client domain on the same host. I have reduced the problem to the following very simple scenario involving two domUs: one client and one server. For my purpose the only difference is that server has an SSD mounted (as a block device) at /mnt. Each has 1 vcpu and 512mb RAM on a 4-hyperthreaded-core machine (shows as 8 "cores" in dom0).
server: eth1 IP address 192.168.62.110
client: eth1 IP address 192.168.62.202
(in the following, I am executing "echo 3 > /proc/sys/vm/drop_caches" on dom0 before each command shown)
First to test the speed of tearing through a random gigabyte of data I put there for the purpose: server# time cat /mnt/randgig > /dev/null ~4s (4 seconds, times here are averages over several runs, dropping caches between)
Now let's test the speed of the "network" between client and server without interference from the disk
server# dd if=/dev/zero bs=4096 count=262144 | nc -lvv -p 3500 -q0
client# time nc 192.168.62.110 3500 > /dev/null ~3.5s
Finally, let's actually transfer data from disk to the client server# dd if=/dev/zero bs=4096 count=262144 | nc -lvv -p 3500 -q0
client# time nc 192.168.62.110 3500 > /dev/null ~18.8s
So you see, it is much slower to both read from disk and transfer over the (xenbridge) network than to do either alone, even though (in theory) I have enough processors (4 or 8 depending on how you count) to do all the work.
If I move the client to a different (identically configured) host attached by 1Gbit ethernet through a switch, I get these revised times:
transfer a gig of /dev/zero from server to client: 9.5s instead of 3.5
transfer a gig of /mnt/randgig from server to client: 14.2s instead of 18.8s !!
This further confirms that there is some bad interaction between disk and network i/o scheduling, presumably in the dom0 backend but I am not sure how to tell for sure.
I have tried every combination of # of vcpus, pinning vcpus, etc on both domUs and dom0. I have also tried the experiment with dom0 as the server; the main difference is that the performance is worse in all cases but still better if the client is on a different host.
So in summary my questions are:
1) why is it so much slower to transfer a file from disk over the xenbridge network than either reading from the disk or sending bytes over the network alone?
2) what can I do about it?
I have searched in vain for any hint of this problem, except that the Xen documentation says somewhere I should pin and fix the number of dom0 cpus when doing I/O-intensive work in the guests, but I have tried this to no avail.
I would appreciate any insights.
Best,
Joe Whitney
_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users
|