WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-bugs

[Xen-bugs] [Bug 790] New: There us a CPU Fatal trap each time a create a

To: xen-bugs@xxxxxxxxxxxxxxxxxxx
Subject: [Xen-bugs] [Bug 790] New: There us a CPU Fatal trap each time a create and shutdown a non privilege domain with 4096Mo or more
From: bugzilla-daemon@xxxxxxxxxxxxxxxxxxx
Date: Tue, 10 Oct 2006 04:32:43 -0700
Delivery-date: Tue, 10 Oct 2006 04:33:37 -0700
Envelope-to: www-data@xxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-bugs-request@lists.xensource.com?subject=help>
List-id: Xen Bugzilla <xen-bugs.lists.xensource.com>
List-post: <mailto:xen-bugs@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-bugs>, <mailto:xen-bugs-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-bugs>, <mailto:xen-bugs-request@lists.xensource.com?subject=unsubscribe>
Reply-to: bugs@xxxxxxxxxxxxxxxxxx
Sender: xen-bugs-bounces@xxxxxxxxxxxxxxxxxxx
http://bugzilla.xensource.com/bugzilla/show_bug.cgi?id=790

           Summary: There us a CPU Fatal trap each time a create and
                    shutdown a non privilege domain with 4096Mo or more
           Product: Xen
           Version: unstable
          Platform: x86-64
        OS/Version: Linux
            Status: NEW
          Severity: major
          Priority: P2
         Component: Hypervisor
        AssignedTo: xen-bugs@xxxxxxxxxxxxxxxxxxx
        ReportedBy: guillaume.thouvenin@xxxxxxxx
                CC: guillaume.thouvenin@xxxxxxxx


I'm using Xen version 3.0-unstable (guill@[unknown]) (gcc version 4.1.2
20060901 (prerelease) (Debian 4.1.1-13)) Mon Oct  9 13:22:35 CEST 2006
 Latest ChangeSet: Mon Oct 02 18:04:56 2006 +0100 11686:f426f6e646eb. 
I downloaded this version from xen-3.0.3-testing.hg mercurial repository

The computer is an Intel Xeon with 4 processors (dual core + HT thus Xen sees
16 procs). I have 14Go of RAM. 

Here is more information about the system:

  (XEN) Command line: /boot/xen-3.0-unstable.gz watchdog dom0_mem=5242880   
              com1=19200,8n1
  (XEN) Physical RAM map: 
  (XEN)  0000000000000000 - 0000000000099000 (usable)
  (XEN)  0000000000099000 - 00000000000a0000 (reserved)
  (XEN)  00000000000e0000 - 0000000000100000 (reserved)
  (XEN)  0000000000100000 - 00000000e7f8dac0 (usable)
  (XEN)  00000000e7f8dac0 - 00000000e7f98880 (ACPI data)
  (XEN)  00000000e7f98880 - 00000000e8000000 (reserved)
  (XEN)  00000000fec00000 - 0000000100000000 (reserved)
  (XEN)  0000000100000000 - 0000000398000000 (usable)
  (XEN) System RAM: 14335MB (14679192kB)
  (XEN) Xen heap: 13MB (13860kB)
  (XEN) found SMP MP-table at 00099140

Steps to Reproduce the bug:

  1) I create a domU with a 2.6.16-29-xenU xenolinux that is patching with
support for SAS. The memory given to the domain is 4096M and the problem
doesn't occur if I give less the 4096M of memory to the domain. 

  2) I reboot (or shutdown) the domain by using 'xm reboot dom' and the system
is crashing with the following message:

  (XEN) Watchdog timer detects that CPU12 is stuck!
  (XEN) ----[ Xen-3.0-unstable  x86_64  debug=y  Not tainted ]----
  (XEN) CPU:    12
  (XEN) RIP:    e010:[<ffff830000191235>] percpu_traps_init+0x3bf/0x1183
  (XEN) RFLAGS: 0000000000000286   CONTEXT: hypervisor
  (XEN) rax: ffff8300002cd180   rbx: ffff830000f8b380   rcx: ffff8300002cd180
  (XEN) rdx: 0000000000000004   rsi: 0000000000000004   rdi: 0000000000000005
  (XEN) rbp: ffff830000fa7d18   rsp: ffff830000fa7c78   r8:  0000000000f8b380
  (XEN) r9:  0000000000000206   r10: ffff8300001aae30   r11: 0000000000000283
  (XEN) r12: ffff8300002cd180   r13: 00000000001830d1   r14: ffff828403c7a0a8
  (XEN) r15: ffffffff805e64e0   cr0: 000000008005003b   cr4: 00000000000026f0
  (XEN) cr3: 00000000e152b000   cr2: 0000000000558000
  (XEN) ds: 0000   es: 0000   fs: 0000   gs: 0000   ss: e018   cs: e010
  (XEN) Xen stack trace from rsp=ffff830000fa7c78:
  (XEN)    ffff82840024c890 ffff82840024c890 0000000000000000 ffff830000fa7cc8
  (XEN)    0000000100fa7ca8 ffff830000f8b380 0000000000000286 ffff830000fa7f28
  (XEN)    ffff830000fa7ce8 ffff83000013ac3f 00000000001830d1 0000014300000008
  (XEN)    0000000000000000 ffff830000fa7e58 ffff8300002cd180 ffff830000f8b380
  (XEN)    ffffffff805e8d60 ffff830000fa7f28 ffffffff805e8d60 ffff8300001c8020
  (XEN)    ffff830000fa7e58 ffff83000010d700 ffff830000f8b380 000000000013f614
  (XEN)    00000000001829c3 0000000100fa7f28 ffff830000f8b380 00000000001829c3
  (XEN)    ffff830000fa7e58 ffff83000013754b ffff830000fa7f28 ffff830000fa7f28
  (XEN)    ffff830000fa7f28 ffff830000fa7f28 ffff830000fa7f28 ffff830000fa7f28
  (XEN)    ffff830000fa7f28 ffff830000fa7f28 0000000100007ff0 ffffffff805e64e0
  (XEN)    0000000100fa7e08 ffff8300002ec080 ffff8300001bc4a0 ffff8300001bc4a0
  (XEN)    ffff8300001bc4a0 ffff8300001bc4a0 ffff88013f614000 0000000000000000
  (XEN)    ffffffff805e7cf0 ffff88013f614000 ffff830000fa7e58 0000000100136212
  (XEN)    00000001829c3001 000000000013f614 ffff8300001c8120 ffff8300001c8028
  (XEN)    ffffffff805e8d60 ffff830000fa7f28 ffffffff805e8d60 ffff8300001c8020
  (XEN)    ffff830000fa7f08 ffff83000011157b ffff8300001f3000 ffff8300001c8028
  (XEN)    ffff8300001c8030 ffff830000fa7e98 ffff8300001c8028 0000000200000003
  (XEN)    aaaaaaaaaaaaaaaa aaaaaaaaaaaaaaaa aaaaaaaaaaaaaaaa   
aaaaaaaaaaaaaaaa
  (XEN)    aaaaaaaaaaaaaaaa aaaaaaaaaaaaaaaa aaaaaaaaaaaaaaaa aaaaaaaaaaaaaaaa
  (XEN)    ffff830000000012 ffff8300002ec080 ffffffff805e20e0 0000000000000000
  (XEN) Xen call trace:
  (XEN)    [<ffff830000191235>] percpu_traps_init+0x3bf/0x1183
  (XEN)    [<ffff83000010d700>] do_grant_table_op+0xea8/0x1908
  (XEN)    [<ffff83000011157b>] do_multicall+0x1bb/0x2f0
  (XEN)    [<ffff830000183582>] syscall_enter+0xa2/0xfc
  (XEN)
  (XEN) ************************************
  (XEN) CPU12 FATAL TRAP 2 (nmi), ERROR_CODE 0000.
  (XEN) System shutting down -- need manual reset.
  (XEN) ************************************

I can reproduce the bug.

Sometimes message is different and looks like the following one:

  (XEN) Watchdog timer detects that CPU8 is stuck!
  (XEN) ----[ Xen-3.0-unstable  x86_64  debug=n  Not tainted ]----
  (XEN) CPU:    8
  (XEN) RIP:    e010:[<ffff830000124cba>] put_page_type+0xca/0xf0
  (XEN) RFLAGS: 0000000000000246   CONTEXT: hypervisor
  (XEN) rax: 00000000e8000001   rbx: 00000000e8000001   rcx: ffff830000000000
  (XEN) rdx: 00000000002c6f90   rsi: 00000000e8000001   rdi: ffff8284011bb678
  (XEN) rbp: 00000000e8000000   rsp: ffff83000026bb68   r8:  0000000000000000
  (XEN) r9:  0000000000000000   r10: 0000000000000000   r11: 0000000000000000
  (XEN) r12: ffff8284011bb678   r13: ffff828408091430   r14: ffff8300002bf098
  (XEN) r15: 00000000002bf080   cr0: 0000000080050033   cr4: 00000000000026f0
  (XEN) cr3: 00000001c519e000   cr2: 0000000040ffcc08
  (XEN) ds: 0000   es: 0000   fs: 0063   gs: 0000   ss: e018   cs: e010
  (XEN) Xen stack trace from rsp=ffff83000026bb68:
  (XEN)    ffff8284011bb678 ffff830336d4f000 ffff8300002bf080 ffff83000012507f
  (XEN)    ffff828408091430 ffff830336d4e478 ffff830336d4f000 ffff830000124a74
  (XEN)    ffff828408091458 0000000020000001 0000000020000000 ffff828408091430
  (XEN)    ffff828407f73378 ffff830000124c64 ffff828408091430 ffff83032fae4000
  (XEN)    000000000032fae3 ffff830000124fb1 ffff83032fae33a8 ffff830000124acf
  (XEN)    ffff828407a495a0 0000000040000001 0000000040000000 ffff828407f73378
  (XEN)    ffff828404f3a918 ffff830000124c64 ffff828407f73378 ffff8301fb108000
  (XEN)    00000000001fb107 ffff830000124d11 ffff8301fb107020 ffff830000124b2f
  (XEN)    00000000002bf080 0000000060000001 0000000060000000 ffff828404f3a918
  (XEN)    ffff828400a03e58 ffff830000124c64 ffff828404f3a918 000000000004018f
  (XEN)    ffff83004018f000 ffff830000124e51 0000000000000110 ffff830000124b8a
  (XEN)    0000000000000003 0000000080000001 0000000080000000 ffff828400a03e58
  (XEN)    ffff8300002bf080 ffff830000124c64 0000000080000004 ffff828400a03e58
  (XEN)    ffff828400a03e58 ffff83000011d349 ffff830000282080 0000000000000000
  (XEN)    ffff828400a3e5d0 ffff8300002bf080 0000000000000007 0000000000000007
  (XEN)    ffff83000026bda8 ffff83000011d66b ffff8300002bf080 ffff8300002bf080
  (XEN)    0000000000305000 ffff830000107158 0000000000000000 ffff83000010642e
  (XEN)    00000000000001ff 00000000000001ff 0000000000040c78 0000000000000092
  (XEN)    0000000300000002 0000000000480004 0000003000000018  0000000040fff620
  (XEN)    0000000040fff560 00000000004bab29 00002b9bf82e4ea0 00000000005f8560
  (XEN) Xen call trace:
  (XEN)    [<ffff830000124cba>] put_page_type+0xca/0xf0
  (XEN)    [<ffff83000012507f>] put_page_from_l1e+0x8f/0x130
  (XEN)    [<ffff830000124a74>] free_page_type+0x314/0x490
  (XEN)    [<ffff830000124c64>] put_page_type+0x74/0xf0
  (XEN)    [<ffff830000124fb1>] put_page_from_l2e+0x31/0x70
  (XEN)    [<ffff830000124acf>] free_page_type+0x36f/0x490
  (XEN)    [<ffff830000124c64>] put_page_type+0x74/0xf0
  (XEN)    [<ffff830000124d11>] put_page_from_l3e+0x31/0x70
  (XEN)    [<ffff830000124b2f>] free_page_type+0x3cf/0x490
  (XEN)    [<ffff830000124c64>] put_page_type+0x74/0xf0
  (XEN)    [<ffff830000124e51>] put_page_from_l4e+0x31/0x70
  (XEN)    [<ffff830000124b8a>] free_page_type+0x42a/0x490
  (XEN)    [<ffff830000124c64>] put_page_type+0x74/0xf0
  (XEN)    [<ffff83000011d349>] relinquish_memory+0xc9/0x1e0
  (XEN)    [<ffff83000011d66b>] domain_relinquish_resources+0x20b/0x220
  (XEN)    [<ffff830000107158>] domain_kill+0x28/0x60
  (XEN)    [<ffff83000010642e>] do_domctl+0x5fe/0xd20
  (XEN)    [<ffff83000012033a>] event_check_interrupt+0x2a/0x30
  (XEN)    [<ffff830000161bdf>] toggle_guest_mode+0xff/0x110
  (XEN)    [<ffff830000161dbb>] do_iret+0x7b/0x140
  (XEN)    [<ffff830000160dc2>] syscall_enter+0x62/0x67
  (XEN)
  (XEN) ************************************
  (XEN) CPU8 FATAL TRAP 2 (nmi), ERROR_CODE 0000.
  (XEN) System shutting down -- need manual reset.
  (XEN) ************************************

This cash appens only if domU has more than 4096M of memory. The configuration
file of the domain is:


  kernel="/boot/vmlinuz-2.6.16.29-xen-sas-testing"
  # Initial memory allocation (in megabytes) for the new domain.
  memory=4096
  # A name for your domain. All domains must have different names.
  name="capablanca"
  disk=['phy:sdb5,sdb1,w','phy:sdb6,sdb2,w']
  vif=[' ']
  vcpus = 16
  root="/dev/sdb1 ro"


-- 
Configure bugmail: 
http://bugzilla.xensource.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

_______________________________________________
Xen-bugs mailing list
Xen-bugs@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-bugs

<Prev in Thread] Current Thread [Next in Thread>