WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-devel

[Xen-devel] Unable to bring up x86_64 SMP DomUs

To: "xen-devel" <xen-devel@xxxxxxxxxxxxxxxxxxx>
Subject: [Xen-devel] Unable to bring up x86_64 SMP DomUs
From: "Puthiyaparambil, Aravindh" <aravindh.puthiyaparambil@xxxxxxxxxx>
Date: Wed, 19 Oct 2005 15:57:41 -0400
Delivery-date: Wed, 19 Oct 2005 19:55:02 +0000
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id: Xen developer discussion <xen-devel.lists.xensource.com>
List-post: <mailto:xen-devel@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
Sender: xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
Thread-index: AcXU5100H4bo/UdgT/OnjMIomRkgrQ==
Thread-topic: Unable to bring up x86_64 SMP DomUs
I am on changeset 7425:7c951e3eb5ab. I am seeing strange things when I
try to bring up x86_64 DomUs. Sometimes the DomU dies and then keeps
rebooting. 

Here is the output

DomU OUTPUT
-----------

NET: Registered protocol family 16
xenbus_probe_init
Brought up 2 CPUs
Unable to handle kernel NULL pointer dereference at 0000000000000010
RIP: 
<ffffffff8012b00e>{scheduler_tick+115}
PGD 0 
Oops: 0002 [1] SMP 
CPU 0 
Modules linked in:
Pid: 1, comm: swapper Not tainted 2.6.12.5-xenU
RIP: e030:[<ffffffff8012b00e>] <ffffffff8012b00e>{scheduler_tick+115}
RSP: e02b:ffff8800014c3b38  EFLAGS: 00010202
RAX: 0000000000000000 RBX: ffff8800014c1470 RCX: 000000349d0260c7
RDX: ffffffff80340718 RSI: 0000000000000002 RDI: ffff880001387360
RBP: ffff8800014c3b58 R08: 0000000000000008 R09: ffffffff8037e908
R10: 0000000000000010 R11: 0000000000000202 R12: ffff880001388c20
R13: ffff880001387360 R14: 0000000000000000 R15: 0000000000000000
FS:  0000000000000000(0000) GS:ffffffff8037e500(0000)
knlGS:0000000000000000
CS:  e033 DS: 0000 ES: 0000
Process swapper (pid: 1, threadinfo 0000000000000000, task
ffff8800014c1470)
Stack: ffff8800014c1470 0000000000000000 ffff880001387360
0000000000000bda 
       0000000000000000 ffffffff801387f5 0000000000000000
0000000000000bda 
       ffffffff803b0420 ffffffff80118dc3 
Call Trace:<ffffffff801387f5>{update_process_times+241}
<ffffffff80118dc3>{timer_interrupt+512}
       <ffffffff8014fd26>{handle_IRQ_event+76}
<ffffffff8014fe3a>{__do_IRQ+201}
       <ffffffff80115670>{do_IRQ+52}
<ffffffff8010e915>{evtchn_do_upcall+181}
       <ffffffff80157b7c>{do_ccupdate_local+0}
<ffffffff80157b7c>{do_ccupdate_local+0}
       <ffffffff80112ae9>{do_hypervisor_callback+17}
<ffffffff80157b7c>{do_ccupdate_local+0}
       <ffffffff80157b7c>{do_ccupdate_local+0}
<ffffffff80115f6c>{xen_send_IPI_shortcut+241}
       <ffffffff80115cdc>{__smp_call_function+105}
<ffffffff8011ac20>{do_page_fault+101}
       <ffffffff80157b7c>{do_ccupdate_local+0}
<ffffffff801580bb>{smp_call_function_all_cpus+99}
       <ffffffff801590aa>{do_tune_cpucache+180}
<ffffffff80112a4b>{error_exit+0}
       <ffffffff8011ac35>{do_page_fault+122}
<ffffffff8011ac20>{do_page_fault+101}
       <ffffffff80112cfb>{child_rip+8} <ffffffff8010d03f>{init+0}
       <ffffffff80112cf3>{child_rip+0} 

Code: f0 0f ba 68 10 03 e9 01 03 00 00 4c 89 e7 e8 b3 40 17 00 8b 
RIP <ffffffff8012b00e>{scheduler_tick+115} RSP <ffff8800014c3b38>
CR2: 0000000000000010
 <0>Kernel panic - not syncing: Aiee, killing interrupt handler!

SERIAL OUTPUT
-------------
(XEN) (file=traps.c, line=968) Non-priv domain attempted
RDMSR(00000000c0000080,00010000,00010000).
(XEN) (file=traps.c, line=960) Non-priv domain attempted
WRMSR(00000000c0000100,00000000,00000000).
(XEN) (file=traps.c, line=960) Non-priv domain attempted
WRMSR(00000000c0000102,00000000,00000000).
(XEN) (file=traps.c, line=968) Non-priv domain attempted
RDMSR(00000000c0000080,00000000,00000000).
(XEN) (file=traps.c, line=968) Non-priv domain attempted
RDMSR(00000000c0000100,00000045,00000001).
(XEN) (file=traps.c, line=968) Non-priv domain attempted
RDMSR(00000000c0000101,00000000,00000000).
(XEN) (file=traps.c, line=968) Non-priv domain attempted
RDMSR(00000000c0000102,8037e500,ffffffff).
(XEN) Domain 1 (vcpu#1) crashed on cpu#0:
(XEN) CPU:    0
(XEN) RIP:    e030:[<ffffffff8011ac35>] ???
(XEN) RFLAGS: 0000000000010202   CONTEXT: hypervisor
(XEN) rax: 0000000000000000   rbx: 0000000000000000   rcx:
0000000000000000
(XEN) rdx: ffff880001383048   rsi: 000000000000000d   rdi:
ffffffff80340718
(XEN) rbp: 0000000000000000   rsp: ffff880001383028   r8:
0000000000000000
(XEN) r9:  0000000000000000   r10: 0000000000000000   r11:
0000000000000000
(XEN) r12: 0000000000000000   r13: 0000000000000034   r14:
0000000000000000
(XEN) r15: ffff880001383128   cr0: 000000008005003b   cr3:
000000007fe01000
(XEN) Xen stack trace from rsp=ffff880001383028:
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    ffff880001383128 ffffffff802b1c5e 0000000000000000
0000000b0000000e 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000034 0000000000000000 ffff8800013832d8
ffffffff80112a4b 
(XEN)    ffff8800013832d8 0000000000000000 0000000000000034
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    ffff8800013831f8 000000000000000d ffffffff80340718
ffffffffffffffff 
(XEN)    ffffffff8011ac35 000000010000e030 0000000000010202
ffff8800013831d8 
(XEN)    000000000000e02b ffffffff8011ac20 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 ffff8800013832d8
ffffffff802b1c5e 
(XEN)    0000000000000000 0000000b0000000e 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN) Xen call trace:
(XEN)    [<ffffffff8011ac35>] ???
(XEN)    
(XEN) DOM32767: (file=mm.c, line=460) Non-privileged attempt to map I/O
space 0000000e
(XEN) DOM32767: (file=mm.c, line=2944) ptwr: Could not re-validate l1
page
(XEN) (file=traps.c, line=968) Non-priv domain attempted
RDMSR(00000000c0000080,00010000,00010000).
(XEN) (file=traps.c, line=960) Non-priv domain attempted
WRMSR(00000000c0000100,00000000,00000000).
(XEN) (file=traps.c, line=960) Non-priv domain attempted
WRMSR(00000000c0000102,00000000,00000000).
(XEN) (file=traps.c, line=968) Non-priv domain attempted
RDMSR(00000000c0000080,00000000,00000000).
(XEN) Domain 2 (vcpu#1) crashed on cpu#0:
(XEN) CPU:    0
(XEN) RIP:    e030:[<ffffffff8011ac35>] ???
(XEN) RFLAGS: 0000000000010202   CONTEXT: hypervisor
(XEN) rax: 0000000000000000   rbx: 0000000000000000   rcx:
0000000000000000
(XEN) rdx: ffff880001383048   rsi: 000000000000000d   rdi:
ffffffff80340718
(XEN) rbp: 0000000000000000   rsp: ffff880001383028   r8:
0000000000000000
(XEN) r9:  0000000000000000   r10: 0000000000000000   r11:
0000000000000000
(XEN) r12: 0000000000000000   r13: 0000000000000034   r14:
0000000000000000
(XEN) r15: ffff880001383128   cr0: 000000008005003b   cr3:
0000000001c01000
(XEN) Xen stack trace from rsp=ffff880001383028:
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    ffff880001383128 ffffffff802b1c5e 0000000000000000
0000000b0000000e 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000034 0000000000000000 ffff8800013832d8
ffffffff80112a4b 
(XEN)    ffff8800013832d8 0000000000000000 0000000000000034
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    ffff8800013831f8 000000000000000d ffffffff80340718
ffffffffffffffff 
(XEN)    ffffffff8011ac35 000000010000e030 0000000000010202
ffff8800013831d8 
(XEN)    000000000000e02b ffffffff8011ac20 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 ffff8800013832d8
ffffffff802b1c5e 
(XEN)    0000000000000000 0000000b0000000e 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN) Xen call trace:
(XEN)    [<ffffffff8011ac35>] ???
(XEN)    
------------------------------------------------------------------------
----


Sometimes it causes the host system to die. Here is that output.

DomU OUTPUT
-----------

xenbus_probe_init
Brought up 2 CPUs
Unable to handle kernel NULL pointer dereference at 0000000000000000
RIP: 
<0000000000000000>{_stext+2146434956}
PGD 0 
Oops: 0010 [1] SMP 
CPU 0 
Modules linked in:
Pid: 5, comm: khelper Not tainted 2.6.12.5-xenU
RIP: e030:[<0000000000000000>] <0000000000000000>{_stext+2146434956}
RSP: e02b:ffff880000063df0  EFLAGS: 00010293
RAX: ffff880000043d10 RBX: 0000000000000000 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000003 RDI: ffff880000043cf8
RBP: ffff880000063e28 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000202 R12: 0000000000000000
R13: 0000000000000001 R14: ffff880000043e08 R15: 0000000000000000
FS:  0000000000000000(0000) GS:ffffffff8037e500(0000)
knlGS:0000000000000000
CS:  e033 DS: 0000 ES: 0000
Process khelper (pid: 5, threadinfo ffff880000062000, task
ffff880000054df0)
Stack: ffffffff80128dd1 0000000300000000 ffff880000043df8
ffff880000043e00 
       0000000000000000 0000000000000000 ffffffff8013f18c
ffff880000063e58 
       ffffffff8012a700 0000000000000200 
Call Trace:<ffffffff80128dd1>{__wake_up_common+64}
<ffffffff8013f18c>{__call_usermodehelper+0}
       <ffffffff8012a700>{complete+57}
<ffffffff8013f95f>{worker_thread+476}
       <ffffffff8012abb2>{default_wake_function+0}
<ffffffff80128dd1>{__wake_up_common+64}
       <ffffffff8012abb2>{default_wake_function+0}
<ffffffff8013f783>{worker_thread+0}
       <ffffffff80143bd2>{kthread+206} <ffffffff80112cfb>{child_rip+8}
       <ffffffff80143b04>{kthread+0} <ffffffff80112cf3>{child_rip+0}
       

SERIAL OUTPUT
-------------
Code:  Bad RIP value.
RIP <0000000000000000>{_stext+2146434956} RSP <ffff880000063df0>
CR2: 0000000000000000

(XEN) (file=traps.c, line=968) Non-priv domain attempted
RDMSR(00000000c0000080,00020000,00020000).
(XEN) (file=traps.c, line=960) Non-priv domain attempted
WRMSR(00000000c0000100,00000000,00000000).
(XEN) (file=traps.c, line=960) Non-priv domain attempted
WRMSR(00000000c0000102,00000000,00000000).
(XEN) (file=traps.c, line=968) Non-priv domain attempted
RDMSR(00000000c0000080,00000000,00000000).
(XEN) (file=traps.c, line=968) Non-priv domain attempted
RDMSR(00000000c0000100,00000045,00000001).
(XEN) (file=traps.c, line=968) Non-priv domain attempted
RDMSR(00000000c0000101,00000000,00000000).
(XEN) (file=traps.c, line=968) Non-priv domain attempted
RDMSR(00000000c0000102,8037e500,ffffffff).
(XEN) Domain 1 (vcpu#1) crashed on cpu#0:
(XEN) CPU:    0
(XEN) RIP:    e030:[<ffffffff80112e17>] ???
(XEN) RFLAGS: 0000000000010292   CONTEXT: hypervisor
(XEN) rax: ffffffff8011abbb   rbx: 0000000000000000   rcx:
0000000000000000
(XEN) rdx: 0000000000000034   rsi: 000000000000000d   rdi:
ffffffff80340718
(XEN) rbp: 0000000000000000   rsp: ffff87ffffffffc8   r8:
0000000000000000
(XEN) r9:  0000000000000000   r10: 0000000000000000   r11:
0000000000000000
(XEN) r12: 0000000000000000   r13: 0000000000000034   r14:
0000000000000000
(XEN) r15: ffff880000000178   cr0: 000000008005003b   cr3:
00000000d8d1c000
(XEN) Xen stack trace from rsp=ffff87ffffffffc8:
(XEN)    CPU:    0
(XEN) RIP:    e010:[<ffff830000150b49>] show_stack+0xc4/0x123
(XEN) RFLAGS: 0000000000010246   CONTEXT: hypervisor
(XEN) rax: ffff87ffffffffc8   rbx: ffff830000107f28   rcx:
0000000000003b38
(XEN) rdx: 0000000000000000   rsi: 0000000000003671   rdi:
ffff8300001ac564
(XEN) rbp: ffff830000107e58   rsp: ffff830000107e38   r8:
0000000000000010
(XEN) r9:  00000000ffffffff   r10: ffff8300001ccc9f   r11:
ffff8300001cc8d5
(XEN) r12: ffff830000268080   r13: 0000000000000034   r14:
0000000000000000
(XEN) r15: ffff880000000178   cr0: 000000008005003b   cr3:
00000000d8d1c000
(XEN) Xen stack trace from rsp=ffff830000107e38:
(XEN)    0000000000000342 0000000000000000 ffff87ffffffffc8
ffff830000107f28 
(XEN)    ffff830000107ee8 ffff83000016b536 ffff8300001a7e98
ffff830000129c81 
(XEN)    f51c8be000000342 000000016f0d268a 000000008005003b
00000000000791a8 
(XEN)    ffff830000fd2420 ffff830000107f28 ffff8300001a7e98
00000000d8d1c000 
(XEN)    000000008005003b 000000000000e030 0000000000010292
ffff87ffffffffc8 
(XEN)    ffffffff80112e17 ffff830000107f28 ffff830000107f08
ffff83000010d171 
(XEN)    ffff830000268080 0000000000000000 ffff830000107f18
ffff83000010d189 
(XEN)    00007cffffef80b7 ffff830000169f86 ffff880000000178
0000000000000000 
(XEN)    0000000000000034 0000000000000000 0000000000000000
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    ffffffff8011abbb 0000000000000000 0000000000000034
000000000000000d 
(XEN)    ffffffff80340718 0000000e00000006 ffffffff80112e17
000000000000e030 
(XEN)    0000000000010292 ffff87ffffffffc8 000000000000e02b
0000000000000000 
(XEN)    0000000000000000 0000000000000000 0000000000000000
0000000000000000 
(XEN)    ffff830000268080 
(XEN) Xen call trace:
(XEN)    [<ffff830000150b49>] show_stack+0xc4/0x123
(XEN)    [<ffff83000016b536>] show_registers+0x442/0x444
(XEN)    [<ffff83000010d171>] domain_crash+0x56/0x65
(XEN)    [<ffff83000010d189>] domain_crash_synchronous+0x9/0x10
(XEN)    [<ffff830000169f86>]
process_guest_exception_and_events+0x16/0x20
(XEN)    
(XEN) Pagetable walk from ffff87ffffffffc8:
(XEN)  L4 = 0000000000000000
(XEN) 
(XEN) ****************************************
(XEN) Panic on CPU 0:
(XEN) CPU0 FATAL PAGE FAULT
(XEN) [error_code=0000]
(XEN) Faulting linear address: ffff87ffffffffc8
(XEN) ****************************************
(XEN) 
(XEN) Reboot in five seconds...


I guess this could be connected to the SMP issue with Dom0. Anyone else
seeing this?

Thanks,
Aravindh


_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

<Prev in Thread] Current Thread [Next in Thread>