<html>

  <head>

    <meta content="text/html; charset=windows-1252"

      http-equiv="Content-Type">

  </head>

  <body text="#000000" bgcolor="#FFFFFF">

    sorry for serial posting but, i got new logs it might help..<br>

    <br>

    the message appear during the migration;<br>

    <br>

    /var/log/glusterfs/nfs.log<br>

    <br>

    <br>

    [2016-03-14 09:45:04.573765] I [MSGID: 109036]

    [dht-common.c:8043:dht_log_new_layout_for_dir_selfheal] 0-testv-dht:

    Setting layout of /New Virtual Machine_1 with [Subvol_name:

    testv-stripe-0, Err: -1 , Start: 0 , Stop: 4294967295 , Hash: 1 ], <br>

    [2016-03-14 09:45:04.957499] E

    [shard.c:369:shard_modify_size_and_block_count]

    (--&gt;/usr/lib64/glusterfs/3.7.8/xlator/cluster/distribute.so(dht_file_setattr_cbk+0x14f)

    [0x7f27a13c067f]

    --&gt;/usr/lib64/glusterfs/3.7.8/xlator/features/shard.so(shard_common_setattr_cbk+0xcc)

    [0x7f27a116681c]

    --&gt;/usr/lib64/glusterfs/3.7.8/xlator/features/shard.so(shard_modify_size_and_block_count+0xdd)

    [0x7f27a116584d] ) 0-testv-shard: Failed to get

    trusted.glusterfs.shard.file-size for

    c3e88cc1-7e0a-4d46-9685-2d12131a5e1c<br>

    [2016-03-14 09:45:04.957577] W [MSGID: 112199]

    [nfs3-helpers.c:3418:nfs3_log_common_res] 0-nfs-nfsv3: /New Virtual

    Machine_1/New Virtual Machine-flat.vmdk =&gt; (XID: 3fec5a26,

    SETATTR: NFS: 22(Invalid argument for operation), POSIX: 22(Invalid

    argument)) [Invalid argument]<br>

    [2016-03-14 09:45:05.079657] E [MSGID: 112069]

    [nfs3.c:3649:nfs3_rmdir_resume] 0-nfs-nfsv3: No such file or

    directory: (192.168.221.52:826) testv :

    00000000-0000-0000-0000-000000000001<br>

    <br>

    <br>

    <br>

    <div class="moz-signature"><font color="#3366ff"><font

          color="#000000">Respectfully<b><br>

          </b><b>Mahdi A. Mahd<br>

            <br>

          </b></font></font><font color="#3366ff"> </font></div>

    <div class="moz-cite-prefix">On 03/14/2016 11:14 AM, Mahdi Adnan

      wrote:<br>

    </div>

    <blockquote cite="mid:56E672E7.6060606@earthlinktele.com"

      type="cite">

      <meta content="text/html; charset=windows-1252"

        http-equiv="Content-Type">

      So i have deployed a new server "Cisco UCS C220M4" and created a

      new volume;<br>

      <br>

      Volume Name: testv<br>

      Type: Stripe<br>

      Volume ID: 55cdac79-fe87-4f1f-90c0-15c9100fe00b<br>

      Status: Started<br>

      Number of Bricks: 1 x 2 = 2<br>

      Transport-type: tcp<br>

      Bricks:<br>

      Brick1: 10.70.0.250:/mnt/b1/v<br>

      Brick2: 10.70.0.250:/mnt/b2/v<br>

      Options Reconfigured:<br>

      nfs.disable: off<br>

      features.shard-block-size: 64MB<br>

      features.shard: enable<br>

      cluster.server-quorum-type: server<br>

      cluster.quorum-type: auto<br>

      network.remote-dio: enable<br>

      cluster.eager-lock: enable<br>

      performance.stat-prefetch: off<br>

      performance.io-cache: off<br>

      performance.read-ahead: off<br>

      performance.quick-read: off<br>

      performance.readdir-ahead: off<br>

      <br>

      same error ..<br>

      <br>

      can anyone share with me the info of a working striped volume ? <br>

      <div class="moz-signature"><br>

      </div>

      <div class="moz-cite-prefix">On 03/14/2016 09:02 AM, Mahdi Adnan

        wrote:<br>

      </div>

      <blockquote cite="mid:56E65400.5080905@earthlinktele.com"

        type="cite">

        <meta content="text/html; charset=windows-1252"

          http-equiv="Content-Type">

        I have a pool of two bricks in the same server;<br>

        <br>

        Volume Name: k<br>

        Type: Stripe<br>

        Volume ID: 1e9281ce-2a8b-44e8-a0c6-e3ebf7416b2b<br>

        Status: Started<br>

        Number of Bricks: 1 x 2 = 2<br>

        Transport-type: tcp<br>

        Bricks:<br>

        Brick1: gfs001:/bricks/t1/k<br>

        Brick2: gfs001:/bricks/t2/k<br>

        Options Reconfigured:<br>

        features.shard-block-size: 64MB<br>

        features.shard: on<br>

        cluster.server-quorum-type: server<br>

        cluster.quorum-type: auto<br>

        network.remote-dio: enable<br>

        cluster.eager-lock: enable<br>

        performance.stat-prefetch: off<br>

        performance.io-cache: off<br>

        performance.read-ahead: off<br>

        performance.quick-read: off<br>

        performance.readdir-ahead: off<br>

        <br>

        same issue ...<br>

        glusterfs 3.7.8 built on Mar 10 2016 20:20:45.<br>

        <br>

        <br>

        <div class="moz-signature"><font color="#3366ff"><font

              color="#000000">Respectfully<b><br>

              </b><b>Mahdi A. Mahdi</b></font></font><font

            color="#3366ff"><br>

            <br>

          </font><font color="#3366ff"><font color="#000000">Systems

              Administrator<br>

              IT. Department<br>

              <a moz-do-not-send="true"

                href="https://www.facebook.com/earthlinktele">Earthlink

                Telecommunications</a></font><br>

            <br>

            <font color="#000000">Cell: 07903316180<br>

              Work: 3352<br>

              Skype: </font><a href="mailto:mahdi.adnan@outlook.com"

              moz-do-not-send="true"><font color="#000000">mahdi.adnan@outlook.com</font></a></font></div>

        <div class="moz-cite-prefix">On 03/14/2016 08:11 AM, Niels de

          Vos wrote:<br>

        </div>

        <blockquote

          cite="mid:20160314051137.GC7821@ndevos-x240.usersys.redhat.com"

          type="cite">

          <pre wrap="">On Mon, Mar 14, 2016 at 08:12:27AM +0530, Krutika Dhananjay wrote:

</pre>

          <blockquote type="cite">

            <pre wrap="">It would be better to use sharding over stripe for your vm use case. It

offers better distribution and utilisation of bricks and better heal

performance.

And it is well tested.

</pre>

          </blockquote>

          <pre wrap="">Basically the "striping" feature is deprecated, "sharding" is its

improved replacement. I expect to see "striping" completely dropped in

the next major release.

Niels

</pre>

          <blockquote type="cite">

            <pre wrap="">Couple of things to note before you do that:

1. Most of the bug fixes in sharding have gone into 3.7.8. So it is advised

that you use 3.7.8 or above.

2. When you enable sharding on a volume, already existing files in the

volume do not get sharded. Only the files that are newly created from the

time sharding is enabled will.

    If you do want to shard the existing files, then you would need to cp

them to a temp name within the volume, and then rename them back to the

original file name.

HTH,

Krutika

On Sun, Mar 13, 2016 at 11:49 PM, Mahdi Adnan &lt;<a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:mahdi.adnan@earthlinktele.com">mahdi.adnan@earthlinktele.com</a>

</pre>

            <blockquote type="cite">

              <pre wrap="">wrote:

</pre>

            </blockquote>

            <blockquote type="cite">

              <pre wrap="">I couldn't find anything related to cache in the HBAs.

what logs are useful in my case ? i see only bricks logs which contains

nothing during the failure.

###

[2016-03-13 18:05:19.728614] E [MSGID: 113022] [posix.c:1232:posix_mknod]

0-vmware-posix: mknod on

/bricks/b003/vmware/.shard/17d75e20-16f1-405e-9fa5-99ee7b1bd7f1.511 failed

[File exists]

[2016-03-13 18:07:23.337086] E [MSGID: 113022] [posix.c:1232:posix_mknod]

0-vmware-posix: mknod on

/bricks/b003/vmware/.shard/eef2d538-8eee-4e58-bc88-fbf7dc03b263.4095 failed

[File exists]

[2016-03-13 18:07:55.027600] W [trash.c:1922:trash_rmdir] 0-vmware-trash:

rmdir issued on /.trashcan/, which is not permitted

[2016-03-13 18:07:55.027635] I [MSGID: 115056]

[server-rpc-fops.c:459:server_rmdir_cbk] 0-vmware-server: 41987: RMDIR

/.trashcan/internal_op (00000000-0000-0000-0000-000000000005/internal_op)

==&gt; (Operation not permitted) [Operation not permitted]

[2016-03-13 18:11:34.353441] I [login.c:81:gf_auth] 0-auth/login: allowed

user names: c0c72c37-477a-49a5-a305-3372c1c2f2b4

[2016-03-13 18:11:34.353463] I [MSGID: 115029]

[server-handshake.c:612:server_setvolume] 0-vmware-server: accepted client

from gfs002-2727-2016/03/13-20:17:43:613597-vmware-client-4-0-0 (version:

3.7.8)

[2016-03-13 18:11:34.591139] I [login.c:81:gf_auth] 0-auth/login: allowed

user names: c0c72c37-477a-49a5-a305-3372c1c2f2b4

[2016-03-13 18:11:34.591173] I [MSGID: 115029]

[server-handshake.c:612:server_setvolume] 0-vmware-server: accepted client

from gfs002-2719-2016/03/13-20:17:42:609388-vmware-client-4-0-0 (version:

3.7.8)

###

ESXi just keeps telling me "Cannot clone T: The virtual disk is either

corrupted or not a supported format.

error

3/13/2016 9:06:20 PM

Clone virtual machine

T

VCENTER.LOCAL\Administrator

"

My setup is 2 servers with a floating ip controlled by CTDB and my ESXi

server mount the NFS via the floating ip.

On 03/13/2016 08:40 PM, pkoelle wrote:

</pre>

              <blockquote type="cite">

                <pre wrap="">Am 13.03.2016 um 18:22 schrieb David Gossage:

</pre>

                <blockquote type="cite">

                  <pre wrap="">On Sun, Mar 13, 2016 at 11:07 AM, Mahdi Adnan &lt;

<a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:mahdi.adnan@earthlinktele.com">mahdi.adnan@earthlinktele.com</a>

</pre>

                  <blockquote type="cite">

                    <pre wrap="">wrote:

</pre>

                  </blockquote>

                  <pre wrap="">My HBAs are LSISAS1068E, and the filesystem is XFS.

</pre>

                  <blockquote type="cite">

                    <pre wrap="">I tried EXT4 and it did not help.

I have created a stripted volume in one server with two bricks, same

issue.

and i tried a replicated volume with just "sharding enabled" same issue,

as soon as i disable the sharding it works just fine, niether sharding

nor

striping works for me.

i did follow up with some of threads in the mailing list and tried some

of

the fixes that worked with the others, none worked for me. :(

</pre>

                  </blockquote>

                  <pre wrap="">Is it possible the LSI has write-cache enabled?

</pre>

                </blockquote>

                <pre wrap="">Why is that relevant? Even the backing filesystem has no idea if there is

a RAID or write cache or whatever. There are blocks and sync(), end of

story.

If you lose power and screw up your recovery OR do funky stuff with SAS

multipathing that might be an issue with a controller cache. AFAIK thats

not what we are talking about.

I'm afraid but unless the OP has some logs from the server, a

reproducible testcase or a backtrace from client or server this isn't

getting us anywhere.

cheers

Paul

</pre>

                <blockquote type="cite">

                  <pre wrap="">

On 03/13/2016 06:54 PM, David Gossage wrote:

</pre>

                  <blockquote type="cite">

                    <pre wrap="">

On Sun, Mar 13, 2016 at 8:16 AM, Mahdi Adnan &lt;

<a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:mahdi.adnan@earthlinktele.com">mahdi.adnan@earthlinktele.com</a>&gt; wrote:

Okay so i have enabled shard in my test volume and it did not help,

</pre>

                    <blockquote type="cite">

                      <pre wrap="">stupidly enough, i have enabled it in a production volume

"Distributed-Replicate" and it currpted  half of my VMs.

I have updated Gluster to the latest and nothing seems to be changed in

my situation.

below the info of my volume;

</pre>

                    </blockquote>

                    <pre wrap="">I was pointing at the settings in that email as an example for

corruption

fixing. I wouldn't recommend enabling sharding if you haven't gotten the

base working yet on that cluster. What HBA's are you using and what is

layout of filesystem for bricks?

Number of Bricks: 3 x 2 = 6

</pre>

                    <blockquote type="cite">

                      <pre wrap="">Transport-type: tcp

Bricks:

Brick1: gfs001:/bricks/b001/vmware

Brick2: gfs002:/bricks/b004/vmware

Brick3: gfs001:/bricks/b002/vmware

Brick4: gfs002:/bricks/b005/vmware

Brick5: gfs001:/bricks/b003/vmware

Brick6: gfs002:/bricks/b006/vmware

Options Reconfigured:

performance.strict-write-ordering: on

cluster.server-quorum-type: server

cluster.quorum-type: auto

network.remote-dio: enable

performance.stat-prefetch: disable

performance.io-cache: off

performance.read-ahead: off

performance.quick-read: off

cluster.eager-lock: enable

features.shard-block-size: 16MB

features.shard: on

performance.readdir-ahead: off

On 03/12/2016 08:11 PM, David Gossage wrote:

On Sat, Mar 12, 2016 at 10:21 AM, Mahdi Adnan &lt;

<a moz-do-not-send="true" class="moz-txt-link-rfc2396E" href="mailto:mahdi.adnan@earthlinktele.com">&lt;mahdi.adnan@earthlinktele.com&gt;</a><a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:mahdi.adnan@earthlinktele.com">mahdi.adnan@earthlinktele.com</a>&gt; wrote:

Both servers have HBA no RAIDs and i can setup a replicated or

</pre>

                      <blockquote type="cite">

                        <pre wrap="">dispensers without any issues.

Logs are clean and when i tried to migrate a vm and got the error,

nothing showed up in the logs.

i tried mounting the volume into my laptop and it mounted fine but,

if i

use dd to create a data file it just hang and i cant cancel it, and i

cant

unmount it or anything, i just have to reboot.

The same servers have another volume on other bricks in a distributed

replicas, works fine.

I have even tried the same setup in a virtual environment (created two

vms and install gluster and created a replicated striped) and again

same

thing, data corruption.

</pre>

                      </blockquote>

                      <pre wrap="">I'd look through mail archives for a topic "Shard in Production" I

think

it's called.  The shard portion may not be relevant but it does discuss

certain settings that had to be applied with regards to avoiding

corruption

with VM's.  You may want to try and disable the

performance.readdir-ahead

also.

</pre>

                      <blockquote type="cite">

                        <pre wrap="">On 03/12/2016 07:02 PM, David Gossage wrote:

On Sat, Mar 12, 2016 at 9:51 AM, Mahdi Adnan &lt;

<a moz-do-not-send="true" class="moz-txt-link-rfc2396E" href="mailto:mahdi.adnan@earthlinktele.com">&lt;mahdi.adnan@earthlinktele.com&gt;</a><a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:mahdi.adnan@earthlinktele.com">mahdi.adnan@earthlinktele.com</a>&gt; wrote:

Thanks David,

</pre>

                        <blockquote type="cite">

                          <pre wrap="">My settings are all defaults, i have just created the pool and

started

it.

I have set the settings as your recommendation and it seems to be the

same issue;

Type: Striped-Replicate

Volume ID: 44adfd8c-2ed1-4aa5-b256-d12b64f7fc14

Status: Started

Number of Bricks: 1 x 2 x 2 = 4

Transport-type: tcp

Bricks:

Brick1: gfs001:/bricks/t1/s

Brick2: gfs002:/bricks/t1/s

Brick3: gfs001:/bricks/t2/s

Brick4: gfs002:/bricks/t2/s

Options Reconfigured:

performance.stat-prefetch: off

network.remote-dio: on

cluster.eager-lock: enable

performance.io-cache: off

performance.read-ahead: off

performance.quick-read: off

performance.readdir-ahead: on

</pre>

                        </blockquote>

                        <pre wrap="">Is their a raid controller perhaps doing any caching?

In the gluster logs any errors being reported during migration

process?

Since they aren't in use yet have you tested making just mirrored

bricks

using different pairings of servers two at a time to see if problem

follows

certain machine or network ports?

</pre>

                        <blockquote type="cite">

                          <pre wrap="">

On 03/12/2016 03:25 PM, David Gossage wrote:

On Sat, Mar 12, 2016 at 1:55 AM, Mahdi Adnan &lt;

<a moz-do-not-send="true" class="moz-txt-link-rfc2396E" href="mailto:mahdi.adnan@earthlinktele.com">&lt;mahdi.adnan@earthlinktele.com&gt;</a><a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:mahdi.adnan@earthlinktele.com">mahdi.adnan@earthlinktele.com</a>&gt; wrote:

Dears,

</pre>

                          <blockquote type="cite">

                            <pre wrap="">I have created a replicated striped volume with two bricks and two

servers but I can't use it because when I mount it in ESXi and try

to

migrate a VM to it, the data get corrupted.

Is any one have any idea why is this happening ?

Dell 2950 x2

Seagate 15k 600GB

CentOS 7.2

Gluster 3.7.8

Appreciate your help.

</pre>

                          </blockquote>

                          <pre wrap="">Most reports of this I have seen end up being settings related.  Post

gluster volume info. Below is what I have seen as most common

recommended

settings.

I'd hazard a guess you may have some the read ahead cache or prefetch

on.

quick-read=off

read-ahead=off

io-cache=off

stat-prefetch=off

eager-lock=enable

remote-dio=on

</pre>

                          <blockquote type="cite">

                            <pre wrap="">Mahdi Adnan

System Admin

_______________________________________________

Gluster-users mailing list

<a moz-do-not-send="true" class="moz-txt-link-rfc2396E" href="mailto:Gluster-users@gluster.org">&lt;Gluster-users@gluster.org&gt;</a><a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:Gluster-users@gluster.org">Gluster-users@gluster.org</a>

<a moz-do-not-send="true" class="moz-txt-link-rfc2396E" href="http://www.gluster.org/mailman/listinfo/gluster-users">&lt;http://www.gluster.org/mailman/listinfo/gluster-users&gt;</a>

<a moz-do-not-send="true" class="moz-txt-link-freetext" href="http://www.gluster.org/mailman/listinfo/gluster-users">http://www.gluster.org/mailman/listinfo/gluster-users</a>

</pre>

                          </blockquote>

                          <pre wrap="">

</pre>

                        </blockquote>

                      </blockquote>

                    </blockquote>

                  </blockquote>

                  <pre wrap="">_______________________________________________

Gluster-users mailing list

<a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:Gluster-users@gluster.org">Gluster-users@gluster.org</a>

<a moz-do-not-send="true" class="moz-txt-link-freetext" href="http://www.gluster.org/mailman/listinfo/gluster-users">http://www.gluster.org/mailman/listinfo/gluster-users</a>

</pre>

                </blockquote>

                <pre wrap="">_______________________________________________

Gluster-users mailing list

<a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:Gluster-users@gluster.org">Gluster-users@gluster.org</a>

<a moz-do-not-send="true" class="moz-txt-link-freetext" href="http://www.gluster.org/mailman/listinfo/gluster-users">http://www.gluster.org/mailman/listinfo/gluster-users</a>

</pre>

              </blockquote>

              <pre wrap="">_______________________________________________

Gluster-users mailing list

<a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:Gluster-users@gluster.org">Gluster-users@gluster.org</a>

<a moz-do-not-send="true" class="moz-txt-link-freetext" href="http://www.gluster.org/mailman/listinfo/gluster-users">http://www.gluster.org/mailman/listinfo/gluster-users</a>

</pre>

            </blockquote>

          </blockquote>

          <blockquote type="cite">

            <pre wrap="">_______________________________________________

Gluster-users mailing list

<a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:Gluster-users@gluster.org">Gluster-users@gluster.org</a>

<a moz-do-not-send="true" class="moz-txt-link-freetext" href="http://www.gluster.org/mailman/listinfo/gluster-users">http://www.gluster.org/mailman/listinfo/gluster-users</a>

</pre>

          </blockquote>

        </blockquote>

        <br>

        <br>

        <fieldset class="mimeAttachmentHeader"></fieldset>

        <br>

        <pre wrap="">_______________________________________________

Gluster-users mailing list

<a moz-do-not-send="true" class="moz-txt-link-abbreviated" href="mailto:Gluster-users@gluster.org">Gluster-users@gluster.org</a>

<a moz-do-not-send="true" class="moz-txt-link-freetext" href="http://www.gluster.org/mailman/listinfo/gluster-users">http://www.gluster.org/mailman/listinfo/gluster-users</a></pre>

      </blockquote>

      <br>

      <br>

      <fieldset class="mimeAttachmentHeader"></fieldset>

      <br>

      <pre wrap="">_______________________________________________

Gluster-users mailing list

<a class="moz-txt-link-abbreviated" href="mailto:Gluster-users@gluster.org">Gluster-users@gluster.org</a>

<a class="moz-txt-link-freetext" href="http://www.gluster.org/mailman/listinfo/gluster-users">http://www.gluster.org/mailman/listinfo/gluster-users</a></pre>

    </blockquote>

    <br>

  </body>

</html>