<html><body><div style="font-family: garamond,new york,times,serif; font-size: 12pt; color: #000000"><div><br></div><div><br></div><hr id="zwchr"><blockquote style="border-left:2px solid #1010FF;margin-left:5px;padding-left:5px;color:#000;font-weight:normal;font-style:normal;text-decoration:none;font-family:Helvetica,Arial,sans-serif;font-size:12pt;" data-mce-style="border-left: 2px solid #1010FF; margin-left: 5px; padding-left: 5px; color: #000; font-weight: normal; font-style: normal; text-decoration: none; font-family: Helvetica,Arial,sans-serif; font-size: 12pt;"><b>From: </b>"Lindsay Mathieson" &lt;lindsay.mathieson@gmail.com&gt;<br><b>To: </b>gluster-users@gluster.org<br><b>Sent: </b>Tuesday, December 8, 2015 7:29:00 AM<br><b>Subject: </b>Re: [Gluster-users] Strange file corruption<br><div><br></div><div class="moz-cite-prefix">Hi Udo, thanks for posting your volume info settings. Please note for the following, I am not one of the devs, just a user, so unfortunately I have no authoritative answers :(<br><br> I am running a very similar setup - Proxmox 4.0, three nodes, but using ceph for our production storage. Am heavily testing gluster 3.7 on the side. We find the performance of ceph slow on these small setups and management of it a PITA.<br><br><br> Some more questions<br><br> - how are your VM images being accessed by Proxmox? gfapi? (Proxmox Gluster storage type) or by using the fuse mount?<br><br> - whats your underlying filesystem (ext4, zfs etc)<br><br> - Are you using the HA/Watchdog system in Proxmox?<br><br><br><br> On 07/12/15 21:03, Udo Giacomozzi wrote:<br></div><blockquote cite="mid:56656779.5000707@indunet.it">esterday I had a strange situation where Gluster healing corrupted <b class="moz-txt-star"><span class="moz-txt-tag">*</span>all<span class="moz-txt-tag">*</span></b> my VM images. <br><br><br> In detail: <br> I had about 15 VMs running (in Proxmox 4.0) totaling about 600 GB of qcow2 images. Gluster is used as storage for those images in replicate 3 setup (ie. 3 physical servers replicating all data). <br> All VMs were running on machine #1 - the two other machines (#2 and #3) were <b class="moz-txt-star"><span class="moz-txt-tag">*</span>idle<span class="moz-txt-tag">*</span></b>. <br> Gluster was fully operating (no healing) when I rebooted machine #2. <br> For other reasons I had to reboot machines #2 and #3 a few times, but since all VMs were running on machine #1 and nothing on the other machines was accessing Gluster files, I was confident that this wouldn't disturb Gluster. <br> But anyway this means that I rebootet Gluster nodes during a healing process. <br><br> After a few minutes, Gluster files began showing corruption - up to the point that the qcow2 files became unreadable and all VMs stopped working.</blockquote><br><br> :( sounds painful - my sympathies. <br><br> You're running 3.5.2 - thats getting rather old. I use the gluster debian repos:<br><br> &nbsp; 3.6.7 : <a class="moz-txt-link-freetext" href="http://download.gluster.org/pub/gluster/glusterfs/3.6/LATEST/Debian/" target="_blank" data-mce-href="http://download.gluster.org/pub/gluster/glusterfs/3.6/LATEST/Debian/">http://download.gluster.org/pub/gluster/glusterfs/3.6/LATEST/Debian/</a><br> &nbsp; 3.7.6 : <a class="moz-txt-link-freetext" href="http://download.gluster.org/pub/gluster/glusterfs/LATEST/Debian/jessie/" target="_blank" data-mce-href="http://download.gluster.org/pub/gluster/glusterfs/LATEST/Debian/jessie/">http://download.gluster.org/pub/gluster/glusterfs/LATEST/Debian/jessie/</a><br><br> 3.6.x is the latest stable, 3.7 is close to stable(?) 3.7 has some nice new features such as sharding, which is very useful for VM hosting - it enables much faster heal times.<br><br> Regards what happened with your VM's, I'm not sure. Having two servers down should have disabled the entire store making it not readable or writable. I note that you are missing some settings that need to be set for VM stores - there will be corruption problems if you live migrate without them.<br><br><pre>quick-read=off

read-ahead=off

io-cache=off

stat-prefetch=off

eager-lock=enable

remote-dio=enable

quorum-type=auto

server-quorum-type=server

</pre></blockquote><div><br></div><div>Perfectly put. I am one of the devs who work on replicate module. You can alternatively enable this configuration in one shot using the following command for VM workloads:<br></div><div># gluster volume set &lt;VOLNAME&gt; group virt<br></div><div><br></div><div>Like Lindsay mentioned, it is not clear why the brick on the first node did not go down when the other two nodes were rebooted, given that server quorum is enabled on your volume.<br></div><div>Client-quorum option seems to be missing in your volume. Once you enable it, it is necessary for at least two bricks per replica set to be online for the writes to happen on the volume.<br></div><div>If two nodes are down, the volume becomes read-only, causing the VMs to possibly go into a paused state.<br></div><div><br></div><div>-Krutika<br></div><div><br></div><div><br></div><blockquote style="border-left:2px solid #1010FF;margin-left:5px;padding-left:5px;color:#000;font-weight:normal;font-style:normal;text-decoration:none;font-family:Helvetica,Arial,sans-serif;font-size:12pt;" data-mce-style="border-left: 2px solid #1010FF; margin-left: 5px; padding-left: 5px; color: #000; font-weight: normal; font-style: normal; text-decoration: none; font-family: Helvetica,Arial,sans-serif; font-size: 12pt;"><pre>

"stat-prefetch=off" is particularly important.

</pre><br><br><br><br><br><br>_______________________________________________<br>Gluster-users mailing list<br>Gluster-users@gluster.org<br>http://www.gluster.org/mailman/listinfo/gluster-users</blockquote><div><br></div></div></body></html>