|
|
|
|
|
|
|
|
|
|
xen-devel
[Xen-devel] Re: Out sw-iommu space problem
With swiotlb=128768 from kern.log:
Sep 15 14:27:45 heliMN02WV kernel: [ 3.595733] PCI-DMA: Using software
bounce buffering for IO (SWIOTLB)
Sep 15 14:27:45 heliMN02WV kernel: [ 3.595738] DMA: Placing 251MB
software IO TLB between ffff8800048c0000 - ffff880014440000
Sep 15 14:27:45 heliMN02WV kernel: [ 3.595740] DMA: software IO TLB at
phys 0x48c0000 – 0x14440000
It seems swiotlb was assigned 251 mb
Without swiotlb= from kern.log:
Sep 15 08:16:44 heliMN02WV kernel: [ 3.400528] PCI-DMA: Using software
bounce buffering for IO (SWIOTLB)
Sep 15 08:16:44 heliMN02WV kernel: [ 3.400533] DMA: Placing 64MB software
IO TLB between ffff8800048c0000 - ffff8800088c0000
Sep 15 08:16:44 heliMN02WV kernel: [ 3.400535] DMA: software IO TLB at
phys 0x48c0000 – 0x88c0000
I have set swiotlb=65762, i have do reboot of dom0 but after always problem
on save of service xendomains shutdown, not sm iommu out of memory but:
Sep 15 15:29:15 heliMN02WV kernel: [ 641.006286] tapdisk2[6891]: segfault
at 7fff58bb7fe8 ip 00000000004082ac sp 00007fff58bb7ff0 error 6 in
tapdisk2[400000+39000]
Sep 15 15:29:15 heliMN02WV kernel: [ 641.026060] BUG: unable to handle
kernel NULL pointer dereference at 0000000000000048
Sep 15 15:29:15 heliMN02WV kernel: [ 641.026200] IP: [<ffffffff810ce79e>]
apply_to_page_range+0x47/0x2f3
Sep 15 15:29:15 heliMN02WV kernel: [ 641.026291] PGD 0
Sep 15 15:29:15 heliMN02WV kernel: [ 641.026371] Oops: 0000 [#1] SMP
Sep 15 15:29:15 heliMN02WV kernel: [ 641.026483] last sysfs file:
/sys/devices/virtual/blktap2/blktap6/remove
Sep 15 15:29:15 heliMN02WV kernel: [ 641.026541] CPU 2
Sep 15 15:29:15 heliMN02WV kernel: [ 641.026619] Modules linked in:
xt_tcpudp tun xt_physdev iptable_filter ip_tables x_tables bridge stp
ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp
libiscsi_tcp libiscsi scsi_transport_iscsi ext2 sha256_generic aes_x86_64
aes_generic cbc blktap xen_evtchn xenfs loop dm_crypt snd_pcm snd_timer snd
soundcore snd_page_alloc evdev joydev dcdbas pcspkr power_meter button
processor acpi_processor ext4 mbcache jbd2 crc16 dm_mod sd_mod crc_t10dif sg
sr_mod cdrom usbhid hid ata_generic ehci_hcd ata_piix usbcore libata
nls_base bnx2 mpt2sas scsi_transport_sas scsi_mod thermal thermal_sys [last
unloaded: scsi_wait_scan]
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029000] Pid: 6891, comm: tapdisk2
Not tainted 2.6.32-5-xen-amd64 #1 PowerEdge T310
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029082] RIP:
e030:[<ffffffff810ce79e>] [<ffffffff810ce79e>]
apply_to_page_range+0x47/0x2f3
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029191] RSP: e02b:ffff88003dda3b58
EFLAGS: 00010202
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029246] RAX: 0000000000000880 RBX:
ffff88003ebc1000 RCX: ffff88003ebc2000
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029313] RDX: 0000000000000000 RSI:
ffff88003ebc1000 RDI: 0000000000000000
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029372] RBP: ffff88003e7383d0 R08:
0000000000000000 R09: ffff88003c4d4780
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029431] R10: 0000000000000002 R11:
0000000000000000 R12: 0000000000000000
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029491] R13: ffff88003e7383d0 R14:
ffff88003deb7800 R15: 0000000000000000
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029555] FS:
00007f185cd15740(0000) GS:ffff880003728000(0000) knlGS:0000000000000000
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029630] CS: e033 DS: 0000 ES:
0000 CR0: 000000008005003b
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029686] CR2: 0000000000000048 CR3:
0000000001001000 CR4: 0000000000002660
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029744] DR0: 0000000000000000 DR1:
0000000000000000 DR2: 0000000000000000
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029802] DR3: 0000000000000000 DR6:
00000000ffff0ff0 DR7: 0000000000000400
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029861] Process tapdisk2 (pid:
6891, threadinfo ffff88003dda2000, task ffff8800027ae9f0)
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029936] Stack:
Sep 15 15:29:15 heliMN02WV kernel: [ 641.029984] 0000000000000000
ffff880001d03b40 0000000000000000 0000000000000000
Sep 15 15:29:15 heliMN02WV kernel: [ 641.030134] <0> ffffffffa02dcee8
0000000000000000 ffffffff8100ece2 ffff8800023c9cc0
Sep 15 15:29:15 heliMN02WV kernel: [ 641.030359] <0> ffff88003ebc2000
0000000000000000 0000000000000000 ffff8800023c9cc0
Sep 15 15:29:15 heliMN02WV kernel: [ 641.030635] Call Trace:
Sep 15 15:29:15 heliMN02WV kernel: [ 641.030699] [<ffffffffa02dcee8>] ?
blktap_umap_uaddr_fn+0x0/0x59 [blktap]
Sep 15 15:29:15 heliMN02WV kernel: [ 641.030764] [<ffffffff8100ece2>] ?
check_events+0x12/0x20
Sep 15 15:29:15 heliMN02WV kernel: [ 641.030831] [<ffffffffa02de2a5>] ?
blktap_device_end_request+0xbd/0x145 [blktap]
Sep 15 15:29:15 heliMN02WV kernel: [ 641.030917] [<ffffffffa02dc743>] ?
blktap_ring_vm_close+0x60/0xd1 [blktap]
Sep 15 15:29:15 heliMN02WV kernel: [ 641.030977] [<ffffffff810d13f8>] ?
remove_vma+0x2c/0x72
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031040] [<ffffffff810d1567>] ?
exit_mmap+0x129/0x148
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031097] [<ffffffff8104cc5d>] ?
mmput+0x3c/0xdf
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031153] [<ffffffff81050862>] ?
exit_mm+0x102/0x10d
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031212] [<ffffffff8130d0d2>] ?
_spin_lock_irq+0x7/0x22
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031274] [<ffffffff81052287>] ?
do_exit+0x1f8/0x6c6
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031340] [<ffffffff8105d5a1>] ?
__dequeue_signal+0xfb/0x124
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031399] [<ffffffff8100eccf>] ?
xen_restore_fl_direct_end+0x0/0x1
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031458] [<ffffffff810e7f35>] ?
kmem_cache_free+0x72/0xa3
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031514] [<ffffffff810527cb>] ?
do_group_exit+0x76/0x9d
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031570] [<ffffffff8105f0b7>] ?
get_signal_to_deliver+0x310/0x339
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031627] [<ffffffff8101104f>] ?
do_notify_resume+0x87/0x73f
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031684] [<ffffffff810d15e1>] ?
expand_downwards+0x5b/0x169
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031742] [<ffffffff8130f589>] ?
do_page_fault+0x1f3/0x2f2
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031798] [<ffffffff810125dc>] ?
retint_signal+0x48/0x8c
Sep 15 15:29:15 heliMN02WV kernel: [ 641.031853] Code: 48 89 4c 24 20 4c 89
44 24 18 48 89 54 24 40 72 04 0f 0b eb fe 48 8b 54 24 28 48 89 f0 48 8b 4c
24 40 48 c1 e8 24 25 f8 0f 00 00 <48> 8b 52 48 48 ff c9 48 89 0c 24 48 01 d0
48 89 44 24 30 48 b8
Sep 15 15:29:15 heliMN02WV kernel: [ 641.033878] RIP [<ffffffff810ce79e>]
apply_to_page_range+0x47/0x2f3
Sep 15 15:29:15 heliMN02WV kernel: [ 641.033965] RSP <ffff88003dda3b58>
Sep 15 15:29:15 heliMN02WV kernel: [ 641.034015] CR2: 0000000000000048
Sep 15 15:29:15 heliMN02WV kernel: [ 641.034067] ---[ end trace
162efc545a37e94b ]---
Sep 15 15:29:15 heliMN02WV kernel: [ 641.034120] Fixing recursive fault but
reboot is needed!
I have do other reboot of dom0, shutdown of domu how have give error, reboot
of dom0 and other service xendomains stop but same error:
Sep 15 15:55:32 heliMN02WV kernel: [ 441.038016] tapdisk2[4787]: segfault
at 7ffff3d44ff8 ip 0000000000408296 sp 00007ffff3d45000 error 6 in
tapdisk2[400000+39000]
Sep 15 15:55:32 heliMN02WV kernel: [ 441.057462] BUG: unable to handle
kernel NULL pointer dereference at 0000000000000048
Sep 15 15:55:32 heliMN02WV kernel: [ 441.057601] IP: [<ffffffff810ce79e>]
apply_to_page_range+0x47/0x2f3
Sep 15 15:55:32 heliMN02WV kernel: [ 441.057691] PGD 0
Sep 15 15:55:32 heliMN02WV kernel: [ 441.057770] Oops: 0000 [#1] SMP
Sep 15 15:55:32 heliMN02WV kernel: [ 441.057880] last sysfs file:
/sys/devices/virtual/blktap2/blktap1/remove
Sep 15 15:55:32 heliMN02WV kernel: [ 441.057938] CPU 2
Sep 15 15:55:32 heliMN02WV kernel: [ 441.058015] Modules linked in:
xt_tcpudp xt_physdev iptable_filter ip_tables x_tables tun bridge stp
ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp
libiscsi_tcp libiscsi scsi_transport_iscsi ext2 sha256_generic aes_x86_64
aes_generic cbc blktap xen_evtchn xenfs loop dm_crypt snd_pcm snd_timer snd
soundcore snd_page_alloc joydev pcspkr evdev dcdbas button power_meter
processor acpi_processor ext4 mbcache jbd2 crc16 dm_mod sd_mod crc_t10dif sg
sr_mod cdrom usbhid hid ata_generic ata_piix ehci_hcd mpt2sas usbcore libata
nls_base bnx2 scsi_transport_sas scsi_mod thermal thermal_sys [last
unloaded: scsi_wait_scan]
Sep 15 15:55:32 heliMN02WV kernel: [ 441.060309] Pid: 4787, comm: tapdisk2
Not tainted 2.6.32-5-xen-amd64 #1 PowerEdge T310
Sep 15 15:55:32 heliMN02WV kernel: [ 441.060382] RIP:
e030:[<ffffffff810ce79e>] [<ffffffff810ce79e>]
apply_to_page_range+0x47/0x2f3
Sep 15 15:55:32 heliMN02WV kernel: [ 441.060487] RSP: e02b:ffff88003dd15b58
EFLAGS: 00010202
Sep 15 15:55:32 heliMN02WV kernel: [ 441.060542] RAX: 0000000000000880 RBX:
ffff880003337000 RCX: ffff880003338000
Sep 15 15:55:32 heliMN02WV kernel: [ 441.060599] RDX: 0000000000000000 RSI:
ffff880003337000 RDI: 0000000000000000
Sep 15 15:55:32 heliMN02WV kernel: [ 441.060657] RBP: ffff880001ef50f0 R08:
0000000000000000 R09: ffff88003d161900
Sep 15 15:55:32 heliMN02WV kernel: [ 441.060714] R10: 0000000000000002 R11:
0000000000000000 R12: 0000000000000000
Sep 15 15:55:32 heliMN02WV kernel: [ 441.060771] R13: ffff880001ef50f0 R14:
ffff88003981e000 R15: 0000000000000000
Sep 15 15:55:32 heliMN02WV kernel: [ 441.060831] FS:
00007f7e8720a740(0000) GS:ffff880003728000(0000) knlGS:0000000000000000
Sep 15 15:55:32 heliMN02WV kernel: [ 441.060905] CS: e033 DS: 0000 ES:
0000 CR0: 000000008005003b
Sep 15 15:55:32 heliMN02WV kernel: [ 441.060960] CR2: 0000000000000048 CR3:
0000000001001000 CR4: 0000000000002660
Sep 15 15:55:32 heliMN02WV kernel: [ 441.061018] DR0: 0000000000000000 DR1:
0000000000000000 DR2: 0000000000000000
Sep 15 15:55:32 heliMN02WV kernel: [ 441.061076] DR3: 0000000000000000 DR6:
00000000ffff0ff0 DR7: 0000000000000400
Sep 15 15:55:32 heliMN02WV kernel: [ 441.061133] Process tapdisk2 (pid:
4787, threadinfo ffff88003dd14000, task ffff880035dbf810)
Sep 15 15:55:32 heliMN02WV kernel: [ 441.061207] Stack:
Sep 15 15:55:32 heliMN02WV kernel: [ 441.061255] 0000000000000000
ffff880035dc59c0 0000000000000000 0000000000000000
Sep 15 15:55:32 heliMN02WV kernel: [ 441.061403] <0> ffffffffa02dcee8
0000000000000000 ffffffff8100ece2 ffff88003d499540
Sep 15 15:55:32 heliMN02WV kernel: [ 441.061623] <0> ffff880003338000
0000000000000000 0000000000000000 ffff88003d499540
Sep 15 15:55:32 heliMN02WV kernel: [ 441.061886] Call Trace:
Sep 15 15:55:32 heliMN02WV kernel: [ 441.061939] [<ffffffffa02dcee8>] ?
blktap_umap_uaddr_fn+0x0/0x59 [blktap]
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062000] [<ffffffff8100ece2>] ?
check_events+0x12/0x20
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062056] [<ffffffffa02de2a5>] ?
blktap_device_end_request+0xbd/0x145 [blktap]
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062130] [<ffffffffa02dc743>] ?
blktap_ring_vm_close+0x60/0xd1 [blktap]
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062189] [<ffffffff810d13f8>] ?
remove_vma+0x2c/0x72
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062244] [<ffffffff810d1567>] ?
exit_mmap+0x129/0x148
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062300] [<ffffffff8104cc5d>] ?
mmput+0x3c/0xdf
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062355] [<ffffffff81050862>] ?
exit_mm+0x102/0x10d
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062412] [<ffffffff8130d0d2>] ?
_spin_lock_irq+0x7/0x22
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062468] [<ffffffff81052287>] ?
do_exit+0x1f8/0x6c6
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062529] [<ffffffff8105d5a1>] ?
__dequeue_signal+0xfb/0x124
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062590] [<ffffffff8100eccf>] ?
xen_restore_fl_direct_end+0x0/0x1
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062647] [<ffffffff810e7f35>] ?
kmem_cache_free+0x72/0xa3
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062703] [<ffffffff810527cb>] ?
do_group_exit+0x76/0x9d
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062759] [<ffffffff8105f0b7>] ?
get_signal_to_deliver+0x310/0x339
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062816] [<ffffffff8101104f>] ?
do_notify_resume+0x87/0x73f
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062874] [<ffffffff810d15e1>] ?
expand_downwards+0x5b/0x169
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062930] [<ffffffff8130f589>] ?
do_page_fault+0x1f3/0x2f2
Sep 15 15:55:32 heliMN02WV kernel: [ 441.062986] [<ffffffff810125dc>] ?
retint_signal+0x48/0x8c
Sep 15 15:55:32 heliMN02WV kernel: [ 441.063041] Code: 48 89 4c 24 20 4c 89
44 24 18 48 89 54 24 40 72 04 0f 0b eb fe 48 8b 54 24 28 48 89 f0 48 8b 4c
24 40 48 c1 e8 24 25 f8 0f 00 00 <48> 8b 52 48 48 ff c9 48 89 0c 24 48 01 d0
48 89 44 24 30 48 b8
Sep 15 15:55:32 heliMN02WV kernel: [ 441.065075] RIP [<ffffffff810ce79e>]
apply_to_page_range+0x47/0x2f3
Sep 15 15:55:32 heliMN02WV kernel: [ 441.065164] RSP <ffff88003dd15b58>
Sep 15 15:55:32 heliMN02WV kernel: [ 441.065216] CR2: 0000000000000048
Sep 15 15:55:32 heliMN02WV kernel: [ 441.065270] ---[ end trace
f19d313c82859143 ]---
Sep 15 15:55:32 heliMN02WV kernel: [ 441.065324] Fixing recursive fault but
reboot is needed!
Can you help me to solve this problem please?
I try search on xen-unstable.hg "tapdisk2" revs and i found
http://xenbits.xen.org/hg/staging/xen-unstable.hg/rev/a5a0817d9210 with some
strange characters "Â", there isn't on 4.0 testing and isn't this problem
but can be a bug on unstable.
--
View this message in context:
http://xen.1045712.n5.nabble.com/Out-sw-iommu-space-problem-tp4803078p4807062.html
Sent from the Xen - Dev mailing list archive at Nabble.com.
_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel
|
|
|
|
|