<div dir="ltr"><br><div class="gmail_extra"><div class="gmail_quote">On 15 June 2016 at 08:09, Atin Mukherjee <span dir="ltr">&lt;<a href="mailto:amukherj@redhat.com" target="_blank">amukherj@redhat.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class=""><br>

<br>

On 06/15/2016 12:14 PM, Arif Ali wrote:<br>

&gt;<br>

&gt; On 15 June 2016 at 06:48, Atin Mukherjee &lt;<a href="mailto:amukherj@redhat.com">amukherj@redhat.com</a><br>

</span><span class="">&gt; &lt;mailto:<a href="mailto:amukherj@redhat.com">amukherj@redhat.com</a>&gt;&gt; wrote:<br>

&gt;<br>

&gt;<br>

&gt;<br>

&gt;     On 06/15/2016 11:06 AM, Gandalf Corvotempesta wrote:<br>

&gt;     &gt; Il 15 giu 2016 07:09, &quot;Atin Mukherjee&quot; &lt;<a href="mailto:amukherj@redhat.com">amukherj@redhat.com</a> &lt;mailto:<a href="mailto:amukherj@redhat.com">amukherj@redhat.com</a>&gt;<br>

</span>&gt;     &gt; &lt;mailto:<a href="mailto:amukherj@redhat.com">amukherj@redhat.com</a> &lt;mailto:<a href="mailto:amukherj@redhat.com">amukherj@redhat.com</a>&gt;&gt;&gt; ha scritto:<br>

<span class="">&gt;     &gt;&gt; To get rid of this situation you&#39;d need to stop all the running glusterd<br>

&gt;     &gt;&gt; instances and go into /var/lib/glusterd/peers folder on all the nodes<br>

&gt;     &gt;&gt; and manually correct the UUID file names and their content if required.<br>

&gt;     &gt;<br>

&gt;     &gt; If i understood properly the only way to fix this is by bringing the<br>

&gt;     &gt; whole cluster down? &quot;you&#39;d need to stop all the running glusterd instances&quot;<br>

&gt;     &gt;<br>

&gt;     &gt; I hope you are referring to all instances on the failed node...<br>

&gt;<br>

&gt;     No, since the configuration are synced across all the nodes, any<br>

&gt;     incorrect data gets replicated through out. So in this case to be on the<br>

&gt;     safer side and validate the correctness all glusterd instances on *all*<br>

&gt;     the nodes should be brought down. Having said that, this doesn&#39;t impact<br>

&gt;     I/O as the management path is different than I/O.<br>

&gt;<br>

&gt;<br>

&gt; As a sanity, one of the things I did last night, was to reboot the whole<br>

&gt; gluster system, when I had downtime arranged. I thought this is<br>

&gt; something would be asked, as I had seen similar requests on the mailing<br>

&gt; list previously<br>

&gt;<br>

&gt; Unfortunately though, it didn&#39;t fix the problem.<br>

<br>

</span>Only reboot is not going to solve the problem. You&#39;d need to correct the<br>

configuration as I explained earlier in this thread. If it doesn&#39;t<br>

please send the me the content of /var/lib/glusterd/peers/ &amp;<br>

/var/lib/glusterd/<a href="http://glusterd.info" rel="noreferrer" target="_blank">glusterd.info</a> file from all the nodes where glusterd<br>

instances are running. I&#39;ll take a look and correct them and send it<br>

back to you.<br></blockquote><div><br></div><div>Thanks Atin,</div><div><br></div><div>Apologies, I missed your mail, as I was travelling</div><div><br></div><div>I have checked the relevant files you have mentioned, and they seem to look correct to me, but I have attached it for sanity, maybe you can spot something, that I have not seen</div></div></div></div>