<div dir="ltr"><div><div><div><div><div><div>Hi Ravi,<br><br></div>As I discussed earlier this issue, I investigated this issue and find that healing is not triggered because the &quot;gluster volume heal c_glusterfs info split-brain&quot; command not showing any entries as a outcome of this command even though the file in split brain case.<br><br></div>So, what I have done I manually deleted the gfid entry of that file from .glusterfs directory and follow the instruction mentioned in the following link to do heal<br><br><a href="https://github.com/gluster/glusterfs/blob/master/doc/debugging/split-brain.md">https://github.com/gluster/glusterfs/blob/master/doc/debugging/split-brain.md</a><br><br></div>and this works fine for me.<br><br></div>But my question is why the split-brain command not showing any file in output.<br><br></div><div>Here I am attaching all the log which I get from the node for you and also the output of commands from both of the boards<br><br></div><div>In this tar file two directories are present <br><br></div><div>000300 - log for the board which is running continuously<br></div><div>002500-  log for the board which is rebooted <br><br></div><div>I am waiting for your reply please help me out on this issue.<br><br></div><div>Thanks in advanced.<br></div><div><br></div>Regards,<br></div>Abhishek<br></div><div class="gmail_extra"><br><div class="gmail_quote">On Fri, Feb 26, 2016 at 1:21 PM, ABHISHEK PALIWAL <span dir="ltr">&lt;<a href="mailto:abhishpaliwal@gmail.com" target="_blank">abhishpaliwal@gmail.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote"><span class="">On Fri, Feb 26, 2016 at 10:28 AM, Ravishankar N <span dir="ltr">&lt;<a href="mailto:ravishankar@redhat.com" target="_blank">ravishankar@redhat.com</a>&gt;</span> wrote:<br></span><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">

  
  <div text="#000000" bgcolor="#FFFFFF"><span class="">

    <div>On 02/26/2016 10:10 AM, ABHISHEK

      PALIWAL wrote:<br>

    </div>

    <blockquote type="cite">

      <p dir="ltr">Yes correct</p>

    </blockquote>

    <br>

    Okay, so when you say the files are not in sync until some time, are

    you getting stale data when accessing from the mount?<br>

    I&#39;m not able to figure out why heal info shows zero when the files

    are not in sync, despite all IO happening from the mounts. Could you

    provide the output of getfattr -d -m . -e hex /brick/file-name from

    both bricks when you hit this issue?</span><div><div><br>

    </div><div>I&#39;ll provide the logs once I get. here delay means we are powering on the second board after the 10 minutes.<br></div><div><div class="h5"><div>

    <br>

    <br>

    <blockquote type="cite">

      <div class="gmail_quote">On Feb 26, 2016 9:57 AM, &quot;Ravishankar N&quot;

        &lt;<a href="mailto:ravishankar@redhat.com" target="_blank">ravishankar@redhat.com</a>&gt;

        wrote:<br type="attribution">

        <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

          <div text="#000000" bgcolor="#FFFFFF">

            <div>Hello,<br>

              <br>

              On 02/26/2016 08:29 AM, ABHISHEK PALIWAL wrote:<br>

            </div>

            <blockquote type="cite">

              <div dir="ltr">

                <div>

                  <div>

                    <div>

                      <div>

                        <div>

                          <div>

                            <div>Hi Ravi,<br>

                              <br>

                            </div>

                            Thanks for the response.<br>

                            <br>

                          </div>

                          We are using Glugsterfs-3.7.8<br>

                          <br>

                          Here is the use case:<br>

                          <br>

                          <span style="color:rgb(0,0,0)">We have a

                            logging file which saves logs of the events

                            for every board of a node and these files

                            are in sync using glusterfs. System in

                            replica 2 mode it means <span>When one

                              brick in a replicated volume goes offline,

                              the glusterd daemons on the other nodes

                              keep track of all the files that are not

                              replicated to the offline brick. When the

                              offline brick becomes available again, the

                              cluster initiates a healing process,

                              replicating the updated files to that

                              brick. </span>But in our casse, we see

                            that log file of one board is not in the

                            sync and its format is corrupted means files

                            are not in sync.</span><br>

                        </div>

                      </div>

                    </div>

                  </div>

                </div>

              </div>

            </blockquote>

            <br>

            Just to understand you correctly, you have mounted the 2

            node replica-2 volume on both these nodes and writing to a

            logging file from the mounts right? <br>

            <br>

            <blockquote type="cite">

              <div dir="ltr">

                <div>

                  <div>

                    <div>

                      <div>

                        <div><br>

                        </div>

                        Even the outcome of <span><span>#gluster volume

                            heal c_glusterfs info shows that there is no

                            pending heals.<br>

                            <br>

                          </span></span><span><span>Also , The logging

                            file which is updated is of fixed size and

                            the new entries will be wrapped ,overwriting

                            the old entries.<br>

                            <br>

                            This way we have seen that after few

                            restarts , the contents of the same file on

                            two bricks are different , but the volume

                            heal info shows zero entries<br>

                            <br>

                          </span></span></div>

                      <span><span>Solution:<br>

                          <br>

                        </span></span></div>

                    <span><span>But when we tried to put delay </span></span><span><span><span><span>

                            &gt; 5 min</span></span> before the healing

                        everything is working fine.<br>

                        <br>

                      </span></span></div>

                  <span><span>Regards,<br>

                    </span></span></div>

                <span><span>Abhishek<br>

                  </span></span> </div>

              <div class="gmail_extra"><br>

                <div class="gmail_quote">On Fri, Feb 26, 2016 at 6:35

                  AM, Ravishankar N <span dir="ltr">&lt;<a href="mailto:ravishankar@redhat.com" target="_blank"></a><a href="mailto:ravishankar@redhat.com" target="_blank">ravishankar@redhat.com</a>&gt;</span>

                  wrote:<br>

                  <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

                    <div text="#000000" bgcolor="#FFFFFF"><span>

                        <div>On 02/25/2016 06:01 PM, ABHISHEK PALIWAL

                          wrote:<br>

                        </div>

                        <blockquote type="cite">

                          <div dir="ltr">

                            <div>

                              <div>

                                <div>

                                  <div>Hi,<br>

                                    <br>

                                  </div>

                                  Here, I have one query regarding the

                                  time taken by the healing process.<br>

                                </div>

                                In current two node setup when we

                                rebooted one node then the self-healing

                                process starts less than 5min interval

                                on the board which resulting the

                                corruption of the some files data.<br>

                              </div>

                            </div>

                          </div>

                        </blockquote>

                        <br>

                      </span> Heal should start immediately after the

                      brick process comes up. What version of gluster

                      are you using? What do you mean by corruption of

                      data? Also, how did you observe that the heal

                      started after 5 minutes?<br>

                      -Ravi<br>

                      <blockquote type="cite"><span>

                          <div dir="ltr">

                            <div>

                              <div><br>

                              </div>

                              And to resolve it I have search on google

                              and found the following link:<br>

                              <a href="https://support.rackspace.com/how-to/glusterfs-troubleshooting/" target="_blank">https://support.rackspace.com/how-to/glusterfs-troubleshooting/</a><br>

                              <br>

                            </div>

                            <div>Mentioning that the healing process can

                              takes upto 10min of time to start this

                              process.<br>

                              <br>

                            </div>

                            <div>Here is the statement from the link:<br>

                              <br>

                              &quot;Healing replicated volumes <br>

                              <br>

                              When any brick in a replicated volume goes

                              offline, the glusterd daemons on the

                              remaining nodes keep track of all the

                              files that are not replicated to the

                              offline brick. When the offline brick

                              becomes available again, the cluster

                              initiates a healing process, replicating

                              the updated files to that brick. <b>The

                                start of this process can take up to 10

                                minutes, based on observation.</b>&quot; <br>

                            </div>

                            <div><br>

                            </div>

                            <div>After giving the time of more than 5

                              min file corruption problem has been

                              resolved.<br>

                            </div>

                            <div><br>

                            </div>

                            <div>So, Here my question is there any way

                              through which we can reduce the time taken

                              by the healing process to start?<br>

                              <br>

                            </div>

                            <br>

                            Regards,<br>

                            Abhishek Paliwal<br clear="all">

                            <br>

                            <br>

                          </div>

                          <br>

                          <fieldset></fieldset>

                          <br>

                        </span>

                        <pre>_______________________________________________

Gluster-devel mailing list

<a href="mailto:Gluster-devel@gluster.org" target="_blank">Gluster-devel@gluster.org</a>

<a href="http://www.gluster.org/mailman/listinfo/gluster-devel" target="_blank">http://www.gluster.org/mailman/listinfo/gluster-devel</a></pre>

                      </blockquote>

                      <br>

                      <br>

                    </div>

                  </blockquote>

                </div>

                <br>

                <br clear="all">

                <br>

                -- <br>

                <div>

                  <div dir="ltr"><br>

                    <br>

                    <br>

                    <br>

                    Regards<br>

                    Abhishek Paliwal<br>

                  </div>

                </div>

              </div>

            </blockquote>

            <br>

            <br>

          </div>

        </blockquote>

      </div>

    </blockquote>

    <br>

    <br>

  </div></div></div></div></div><span class="HOEnZb"><font color="#888888">


</font></span></blockquote></div><span class="HOEnZb"><font color="#888888"><br><br clear="all"><br>-- <br><div><div dir="ltr"><br><br><br><br>Regards<br>

Abhishek Paliwal<br>

</div></div>

</font></span></div></div>

</blockquote></div><br><br clear="all"><br>-- <br><div class="gmail_signature"><div dir="ltr"><br><br><br><br>Regards<br>

Abhishek Paliwal<br>

</div></div>

</div>