WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

Re: [Xen-users] XCP : Failed load VDI information on NFS shared repo

To: admin@xxxxxxxxxxx
Subject: Re: [Xen-users] XCP : Failed load VDI information on NFS shared repo
From: Ciaran Kendellen <ciaran@xxxxxxxxxxxxxxx>
Date: Mon, 19 Sep 2011 16:59:38 +0100
Cc: Grant McWilliams <grantmasterflash@xxxxxxxxx>, xen-users@xxxxxxxxxxxxxxxxxxx
Delivery-date: Mon, 19 Sep 2011 09:04:10 -0700
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to: <CANXrN=0JnkKjYD4jNn3ZtHZ0uOgH7heuqD_xeFgw9kE8ekbUjg@xxxxxxxxxxxxxx>
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
References: <4E64F317.2070300@xxxxxxxxxxxxxxx> <CAGnmK4y45arSQeTaVD8ndXMsN=JNRycWJBTPoP8TL+yyouMNEw@xxxxxxxxxxxxxx> <CANXrN=0JnkKjYD4jNn3ZtHZ0uOgH7heuqD_xeFgw9kE8ekbUjg@xxxxxxxxxxxxxx>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: Mozilla/5.0 (X11; U; Linux x86_64; en-GB; rv:1.9.2.17) Gecko/20110414 SUSE/3.1.10 Thunderbird/3.1.10
An update to my last mail, having restarted the file system servers and all hosts in the resource pool, I now get the following when attempting to copy the VDI for the vm in question :

There was an SR backend failure.
status: non-zero exit
stdout:
stderr: Traceback (most recent call last):
  File "/opt/xensource/sm/NFSSR", line 253, in ?
    SRCommand.run(NFSSR, DRIVER_INFO)
  File "/opt/xensource/sm/SRCommand.py", line 246, in run
    ret = cmd.run(sr)
  File "/opt/xensource/sm/SRCommand.py", line 94, in run
    return self._run_locked(sr)
  File "/opt/xensource/sm/SRCommand.py", line 131, in _run_locked
    return self._run(sr, target)
  File "/opt/xensource/sm/SRCommand.py", line 183, in _run
    caching_params)
  File "/opt/xensource/sm/blktap2.py", line 1280, in activate
    if self._activate_locked(sr_uuid, vdi_uuid, caching_params):
  File "/opt/xensource/sm/blktap2.py", line 29, in wrapper
    ret = op(self, *args)
  File "/opt/xensource/sm/blktap2.py", line 1312, in _activate_locked
    dev_path = self._activate(sr_uuid, vdi_uuid, caching_params)
  File "/opt/xensource/sm/blktap2.py", line 1332, in _activate
    dev_path = self._tap_activate(phy_path, vdi_type, sr_uuid)
  File "/opt/xensource/sm/blktap2.py", line 1135, in _tap_activate
    VDI._tap_type(vdi_type))
  File "/opt/xensource/sm/blktap2.py", line 711, in launch_on_tap
    raise TapdiskFailed(cls.Arg(_type, path), ctl)
blktap2.TapdiskFailed: Tapdisk(vhd:/var/run/sr-mount/0554f47c-1a12-9bd1-ea9b-d2b68984f0ed/b1dda4b8-574d-458f-83de-586e7e5e2921.vhd): ['/usr/sbin/tap-ctl', 'open', '-p', '14429', '-m', '1', '-a', 'vhd:/var/run/sr-mount/0554f47c-1a12-9bd1-ea9b-d2b68984f0ed/b1dda4b8-574d-458f-83de-586e7e5e2921.vhd'] failed: status=22, pid=14431, errmsg=

I'm pretty sure the vdi IS corrupt, but I'm not sure what utils are available to help me repair/diagnose/query it??

I've ran the td-utils query commands direct from the shell, and I get similar results to above error code 22 with no further info.

I've tried looking for a list of tap-disk error codes to help me diagnose, to no avail.

Any body fancy having a stab at this?

Kind regards,

Ciaran.

On 09/09/11 13:47, David Markey wrote:
I bet your VHD is corrupt.

Can you give a bit more SMlog context, i.e. 10 lines above and below?

Can you give dmesg output also.


On 9 September 2011 06:32, Grant McWilliams <grantmasterflash@xxxxxxxxx> wrote:


On Mon, Sep 5, 2011 at 9:04 AM, Ciaran Kendellen <ciaran@xxxxxxxxxxxxxxx> wrote:
Hello there.

I had cause to force shutdown a vm recently and on attempting to restart it I get :

Error code: SR_BACKEND_FAILURE_65
Error parameters: , Failed to load VDI [opterr=Failed load VDI information /var/run/sr-mount/0554f47c-1a12-9bd1-ea9b-d2b68984f0ed/e82a1032-5a0c-4ff7-96cf-1f62f32938db.vhd]

The VDI is on a shared NFS repository, other vm's came back up no problem.

Any ideas what to try anyone?

Thanks in advance,

Ciaran.

___
NFS is sideways. Start at the low layers on the host with the NFS share and make sure everything is good there first. For instance if you have more than one NFS mount or you change the mount, or it doesn't mount up etc. you will get these errors. I got this when I had an SR connected and I shut down NFS, renamed the export and started it back up. The SR sat there twitching until I realised what I did.

Grant McWilliams
http://grantmcwilliams.com/



_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users


_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users