<div dir="ltr"><div><div><div><div><div><div>Hi Ravi,<br><br></div>As I discussed earlier this issue, I investigated this issue and find that healing is not triggered because the &quot;gluster volume heal c_glusterfs info split-brain&quot; command not showing any entries as a outcome of this command even though the file in split brain case.<br><br></div>So, what I have done I manually deleted the gfid entry of that file from .glusterfs directory and follow the instruction mentioned in the following link to do heal<br><br><a href="https://github.com/gluster/glusterfs/blob/master/doc/debugging/split-brain.md">https://github.com/gluster/glusterfs/blob/master/doc/debugging/split-brain.md</a><br><br></div>and this works fine for me.<br><br></div>But my question is why the split-brain command not showing any file in output.<br><br></div><div>Here I am attaching all the log which I get from the node for you and also the output of commands from both of the boards<br><br></div><div>In this tar file two directories are present <br><br></div><div>000300 - log for the board which is running continuously<br></div><div>002500-  log for the board which is rebooted <br><br></div><div>I am waiting for your reply please help me out on this issue.<br><br></div><div>Thanks in advanced.<br></div><div><br></div>Regards,<br></div>Abhishek<br></div><div class="gmail_extra"><br><div class="gmail_quote">On Fri, Feb 26, 2016 at 1:21 PM, ABHISHEK PALIWAL <span dir="ltr">&lt;<a href="mailto:abhishpaliwal@gmail.com" target="_blank">abhishpaliwal@gmail.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote"><span class="">On Fri, Feb 26, 2016 at 10:28 AM, Ravishankar N <span dir="ltr">&lt;<a href="mailto:ravishankar@redhat.com" target="_blank">ravishankar@redhat.com</a>&gt;</span> wrote:<br></span><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
  
    
  
  <div text="#000000" bgcolor="#FFFFFF"><span class="">
    <div>On 02/26/2016 10:10 AM, ABHISHEK
      PALIWAL wrote:<br>
    </div>
    <blockquote type="cite">
      <p dir="ltr">Yes correct</p>
    </blockquote>
    <br>
    Okay, so when you say the files are not in sync until some time, are
    you getting stale data when accessing from the mount?<br>
    I&#39;m not able to figure out why heal info shows zero when the files
    are not in sync, despite all IO happening from the mounts. Could you
    provide the output of getfattr -d -m . -e hex /brick/file-name from
    both bricks when you hit this issue?</span><div><div><br>
    </div><div>I&#39;ll provide the logs once I get. here delay means we are powering on the second board after the 10 minutes.<br></div><div><div class="h5"><div>
    <br>
    <br>
    <blockquote type="cite">
      <div class="gmail_quote">On Feb 26, 2016 9:57 AM, &quot;Ravishankar N&quot;
        &lt;<a href="mailto:ravishankar@redhat.com" target="_blank">ravishankar@redhat.com</a>&gt;
        wrote:<br type="attribution">
        <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
          <div text="#000000" bgcolor="#FFFFFF">
            <div>Hello,<br>
              <br>
              On 02/26/2016 08:29 AM, ABHISHEK PALIWAL wrote:<br>
            </div>
            <blockquote type="cite">
              <div dir="ltr">
                <div>
                  <div>
                    <div>
                      <div>
                        <div>
                          <div>
                            <div>Hi Ravi,<br>
                              <br>
                            </div>
                            Thanks for the response.<br>
                            <br>
                          </div>
                          We are using Glugsterfs-3.7.8<br>
                          <br>
                          Here is the use case:<br>
                          <br>
                          <span style="color:rgb(0,0,0)">We have a
                            logging file which saves logs of the events
                            for every board of a node and these files
                            are in sync using glusterfs. System in
                            replica 2 mode it means <span>When one
                              brick in a replicated volume goes offline,
                              the glusterd daemons on the other nodes
                              keep track of all the files that are not
                              replicated to the offline brick. When the
                              offline brick becomes available again, the
                              cluster initiates a healing process,
                              replicating the updated files to that
                              brick. </span>But in our casse, we see
                            that log file of one board is not in the
                            sync and its format is corrupted means files
                            are not in sync.</span><br>
                        </div>
                      </div>
                    </div>
                  </div>
                </div>
              </div>
            </blockquote>
            <br>
            Just to understand you correctly, you have mounted the 2
            node replica-2 volume on both these nodes and writing to a
            logging file from the mounts right? <br>
            <br>
            <blockquote type="cite">
              <div dir="ltr">
                <div>
                  <div>
                    <div>
                      <div>
                        <div><br>
                        </div>
                        Even the outcome of <span><span>#gluster volume
                            heal c_glusterfs info shows that there is no
                            pending heals.<br>
                            <br>
                          </span></span><span><span>Also , The logging
                            file which is updated is of fixed size and
                            the new entries will be wrapped ,overwriting
                            the old entries.<br>
                            <br>
                            This way we have seen that after few
                            restarts , the contents of the same file on
                            two bricks are different , but the volume
                            heal info shows zero entries<br>
                            <br>
                          </span></span></div>
                      <span><span>Solution:<br>
                          <br>
                        </span></span></div>
                    <span><span>But when we tried to put delay </span></span><span><span><span><span>
                            &gt; 5 min</span></span> before the healing
                        everything is working fine.<br>
                        <br>
                      </span></span></div>
                  <span><span>Regards,<br>
                    </span></span></div>
                <span><span>Abhishek<br>
                  </span></span> </div>
              <div class="gmail_extra"><br>
                <div class="gmail_quote">On Fri, Feb 26, 2016 at 6:35
                  AM, Ravishankar N <span dir="ltr">&lt;<a href="mailto:ravishankar@redhat.com" target="_blank"></a><a href="mailto:ravishankar@redhat.com" target="_blank">ravishankar@redhat.com</a>&gt;</span>
                  wrote:<br>
                  <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
                    <div text="#000000" bgcolor="#FFFFFF"><span>
                        <div>On 02/25/2016 06:01 PM, ABHISHEK PALIWAL
                          wrote:<br>
                        </div>
                        <blockquote type="cite">
                          <div dir="ltr">
                            <div>
                              <div>
                                <div>
                                  <div>Hi,<br>
                                    <br>
                                  </div>
                                  Here, I have one query regarding the
                                  time taken by the healing process.<br>
                                </div>
                                In current two node setup when we
                                rebooted one node then the self-healing
                                process starts less than 5min interval
                                on the board which resulting the
                                corruption of the some files data.<br>
                              </div>
                            </div>
                          </div>
                        </blockquote>
                        <br>
                      </span> Heal should start immediately after the
                      brick process comes up. What version of gluster
                      are you using? What do you mean by corruption of
                      data? Also, how did you observe that the heal
                      started after 5 minutes?<br>
                      -Ravi<br>
                      <blockquote type="cite"><span>
                          <div dir="ltr">
                            <div>
                              <div><br>
                              </div>
                              And to resolve it I have search on google
                              and found the following link:<br>
                              <a href="https://support.rackspace.com/how-to/glusterfs-troubleshooting/" target="_blank">https://support.rackspace.com/how-to/glusterfs-troubleshooting/</a><br>
                              <br>
                            </div>
                            <div>Mentioning that the healing process can
                              takes upto 10min of time to start this
                              process.<br>
                              <br>
                            </div>
                            <div>Here is the statement from the link:<br>
                              <br>
                              &quot;Healing replicated volumes <br>
                              <br>
                              When any brick in a replicated volume goes
                              offline, the glusterd daemons on the
                              remaining nodes keep track of all the
                              files that are not replicated to the
                              offline brick. When the offline brick
                              becomes available again, the cluster
                              initiates a healing process, replicating
                              the updated files to that brick. <b>The
                                start of this process can take up to 10
                                minutes, based on observation.</b>&quot; <br>
                            </div>
                            <div><br>
                            </div>
                            <div>After giving the time of more than 5
                              min file corruption problem has been
                              resolved.<br>
                            </div>
                            <div><br>
                            </div>
                            <div>So, Here my question is there any way
                              through which we can reduce the time taken
                              by the healing process to start?<br>
                              <br>
                            </div>
                            <br>
                            Regards,<br>
                            Abhishek Paliwal<br clear="all">
                            <br>
                            <br>
                          </div>
                          <br>
                          <fieldset></fieldset>
                          <br>
                        </span>
                        <pre>_______________________________________________
Gluster-devel mailing list
<a href="mailto:Gluster-devel@gluster.org" target="_blank">Gluster-devel@gluster.org</a>
<a href="http://www.gluster.org/mailman/listinfo/gluster-devel" target="_blank">http://www.gluster.org/mailman/listinfo/gluster-devel</a></pre>
                      </blockquote>
                      <br>
                      <br>
                    </div>
                  </blockquote>
                </div>
                <br>
                <br clear="all">
                <br>
                -- <br>
                <div>
                  <div dir="ltr"><br>
                    <br>
                    <br>
                    <br>
                    Regards<br>
                    Abhishek Paliwal<br>
                  </div>
                </div>
              </div>
            </blockquote>
            <br>
            <br>
          </div>
        </blockquote>
      </div>
    </blockquote>
    <br>
    <br>
  </div></div></div></div></div><span class="HOEnZb"><font color="#888888">

</font></span></blockquote></div><span class="HOEnZb"><font color="#888888"><br><br clear="all"><br>-- <br><div><div dir="ltr"><br><br><br><br>Regards<br>
Abhishek Paliwal<br>
</div></div>
</font></span></div></div>
</blockquote></div><br><br clear="all"><br>-- <br><div class="gmail_signature"><div dir="ltr"><br><br><br><br>Regards<br>
Abhishek Paliwal<br>
</div></div>
</div>