<div dir="ltr"><div><div><div><div><div><div><div><div><div><div><div>Hi again,<br><br></div>As I mentioned earlier, I had to recreate my slave volume et restart the geo-replication again.<br><br></div>and as usual, the geo-replication went well at the beginning, but after restoring another container on MASTERS, we start getting these errors :<br><br></div>On Master:<br>[2015-05-26 11:56:04.858262] I [monitor(monitor):222:monitor] Monitor: starting gsyncd worker<br>[2015-05-26 11:56:04.966274] I [gsyncd(/mnt/brick2/brick):649:main_i] <top>: syncing: gluster://localhost:data2 -> ssh://root@gserver3:gluster://localhost:slavedata2<br>[2015-05-26 11:56:04.967361] I [changelogagent(agent):75:__init__] ChangelogAgent: Agent listining...<br>[2015-05-26 11:56:07.473591] I [master(/mnt/brick2/brick):83:gmaster_builder] <top>: setting up xsync change detection mode<br>[2015-05-26 11:56:07.474025] I [master(/mnt/brick2/brick):404:__init__] _GMaster: using 'rsync' as the sync engine<br>[2015-05-26 11:56:07.475222] I [master(/mnt/brick2/brick):83:gmaster_builder] <top>: setting up changelog change detection mode<br>[2015-05-26 11:56:07.475511] I [master(/mnt/brick2/brick):404:__init__] _GMaster: using 'rsync' as the sync engine<br>[2015-05-26 11:56:07.476761] I [master(/mnt/brick2/brick):83:gmaster_builder] <top>: setting up changeloghistory change detection mode<br>[2015-05-26 11:56:07.477065] I [master(/mnt/brick2/brick):404:__init__] _GMaster: using 'rsync' as the sync engine<br>[2015-05-26 11:56:09.528716] I [master(/mnt/brick2/brick):1197:register] _GMaster: xsync temp directory: /var/lib/misc/glusterfsd/data2/ssh%3A%2F%2Froot%4010.10.10.10%3Agluster%3A%2F%2F127.0.0.1%3Aslavedata2/e55761a256af4acfe9b4a419be62462a/xsync<br>[2015-05-26 11:56:09.529055] I [resource(/mnt/brick2/brick):1434:service_loop] GLUSTER: Register time: 1432637769<br>[2015-05-26 11:56:09.545244] I [master(/mnt/brick2/brick):519:crawlwrap] _GMaster: primary master with volume id 107c9baa-f734-4926-8e7e-c60e3107284f ...<br>[2015-05-26 11:56:09.567487] I [master(/mnt/brick2/brick):528:crawlwrap] _GMaster: crawl interval: 1 seconds<br>[2015-05-26 11:56:09.585380] I [master(/mnt/brick2/brick):1112:crawl] _GMaster: starting history crawl... turns: 1, stime: (1432580690, 0)<br>[2015-05-26 11:56:10.591133] I [master(/mnt/brick2/brick):1141:crawl] _GMaster: slave's time: (1432580690, 0)<br>[2015-05-26 11:56:16.564407] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/9f0887da-2243-470d-be92-49a6d85acf5d', 'stat': {'atime': 1432589079.955492, 'gid': 0, 'mtime': 1362693065.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.565541] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/1076aea5-6875-494f-a276-6268e443d86e', 'stat': {'atime': 1432589080.1354961, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.566585] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/2b449e9b-e9a7-4371-9e1b-de5d9e2407a0', 'stat': {'atime': 1432589080.0714946, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.567661] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/5c10f0cd-0ffa-41b6-b056-89d5f2ea7c9b', 'stat': {'atime': 1432589080.001493, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.568644] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/22b9e1b0-8f8e-4a17-a02f-e9f4a31e65b8', 'stat': {'atime': 1432589080.0674946, 'gid': 0, 'mtime': 1362693065.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.569616] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/0600d002-78dd-49e9-ab26-ee1f3ec81293', 'stat': {'atime': 1432589079.9294913, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.570667] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/8dd195ec-3698-45f6-82e4-2679a1731019', 'stat': {'atime': 1432589079.9764924, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.571583] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/13f2c030-7483-4924-bc0e-c12d97c65ed6', 'stat': {'atime': 1432589079.9794924, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.572529] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/6e23fedf-6b83-4f49-94f2-49d150dba857', 'stat': {'atime': 1432589080.0784948, 'gid': 0, 'mtime': 1362693065.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.573537] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/1b1695d7-0958-4db6-8dd8-917950fadd27', 'stat': {'atime': 1432589079.9414916, 'gid': 0, 'mtime': 1378284454.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.574553] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/c3795ae6-6e73-4b46-8aa2-fe296b927a42', 'stat': {'atime': 1432589080.0514941, 'gid': 0, 'mtime': 1362693065.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.575500] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/5544e740-fc67-42cd-9672-9d9fe2ad119f', 'stat': {'atime': 1432589080.0394938, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.576426] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/54d85a75-1d57-4a4c-b144-1aa70f52f88c', 'stat': {'atime': 1432589080.0164933, 'gid': 0, 'mtime': 1362693065.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.577302] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/46435d6d-02d1-40a4-8018-84d60f15c793', 'stat': {'atime': 1432589079.964492, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.578196] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/1b16ad0b-0107-48e7-adac-2ee450c11181', 'stat': {'atime': 1432589079.9734924, 'gid': 0, 'mtime': 1403054465.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.579090] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/15b8f710-1467-47f4-891c-911fe4a6f66e', 'stat': {'atime': 1432589080.1074955, 'gid': 0, 'mtime': 1362693065.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.579996] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/97f115e6-8403-491b-9ec6-bf8e645f69ec', 'stat': {'atime': 1432589079.9704924, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.580945] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/894d48f8-1977-4d44-9e3f-31711ddf2432', 'stat': {'atime': 1432589079.9274912, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.581921] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/6c6190db-d2ed-48d9-8904-4e555b6650ab', 'stat': {'atime': 1432589080.0134933, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.582889] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/d1597e70-cf34-4516-92f8-8fd5f05f59b5', 'stat': {'atime': 1432589080.1234958, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:16.583786] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/b107565a-f6a5-4eee-89a6-acf6715b1d18', 'stat': {'atime': 1432589079.9514918, 'gid': 0, 'mtime': 1372762987.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.42256] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/b51de310-36a9-4ad6-8595-f2a7e08610fb', 'stat': {'atime': 1432589161.3073761, 'gid': 0, 'mtime': 1372763052.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.42618] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/3adc75a4-d293-4311-8d6d-00113797bb91', 'stat': {'atime': 1432589161.2773755, 'gid': 0, 'mtime': 1372763050.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.42836] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/881cfdd0-7a68-4678-ab86-9b301425ba1f', 'stat': {'atime': 1432589161.217374, 'gid': 0, 'mtime': 1372763054.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.43070] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/33004685-604c-4049-8fbf-7a4226a0ff68', 'stat': {'atime': 1432589161.215374, 'gid': 0, 'mtime': 1368045650.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.43327] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/b4b08a51-48c7-47ea-b980-8da5b96599d2', 'stat': {'atime': 1432589161.1853733, 'gid': 0, 'mtime': 1368045650.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.43549] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/6126413e-1526-4e33-b5be-2556c8c6a8cf', 'stat': {'atime': 1432589161.2253742, 'gid': 0, 'mtime': 1372763054.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.43762] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/cccb2dda-b88c-4d5a-9d73-09a113a1d6e8', 'stat': {'atime': 1432589161.2923758, 'gid': 0, 'mtime': 1372763054.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.44001] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/854ddd73-a95c-4207-a40c-30b8df301940', 'stat': {'atime': 1432589161.2643752, 'gid': 0, 'mtime': 1403054465.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.44230] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/e111377d-4b65-42af-b10c-0db0a93077ca', 'stat': {'atime': 1432589161.261375, 'gid': 0, 'mtime': 1371576397.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.44464] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/9898a879-30aa-450c-ba50-b046a706e8b8', 'stat': {'atime': 1432589161.3673775, 'gid': 0, 'mtime': 1372763054.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.44673] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/8918c4c9-83de-4a57-b26a-3cca1ccc9ad2', 'stat': {'atime': 1432589161.3623774, 'gid': 0, 'mtime': 1372763051.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.44924] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/5b852034-822f-493d-b524-a08c1e93d095', 'stat': {'atime': 1432589161.2533748, 'gid': 0, 'mtime': 1371576397.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.45156] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/382f67fc-b737-40c9-bd5d-8a8d52e3dd13', 'stat': {'atime': 1432589161.299376, 'gid': 0, 'mtime': 1372763053.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.45367] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/b650307f-9500-4ed8-9e6c-016317fdf203', 'stat': {'atime': 1432589161.3713777, 'gid': 0, 'mtime': 1372763051.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.45598] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/7fef67e7-d558-44c0-9e33-16609fae88bc', 'stat': {'atime': 1432589161.1833732, 'gid': 0, 'mtime': 1372763051.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.45835] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/6d80a054-acc5-420b-a5e0-6b6c2166ac08', 'stat': {'atime': 1432589161.3303766, 'gid': 0, 'mtime': 1397764212.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.46082] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/dfda8d4e-dbf2-4c0a-ad3f-14a1923187fb', 'stat': {'atime': 1432589161.3653774, 'gid': 0, 'mtime': 1368045650.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.46308] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/213965ed-7e02-41aa-a827-0dad01b34a78', 'stat': {'atime': 1432589161.395378, 'gid': 0, 'mtime': 1371576397.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.46533] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/335a3c61-8792-44f3-bb84-a9e63bd50fe3', 'stat': {'atime': 1432589161.3103762, 'gid': 0, 'mtime': 1368045650.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.46752] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/5c40a3c9-687b-4651-ae5e-c8289531bf13', 'stat': {'atime': 1432589161.393378, 'gid': 0, 'mtime': 1379638431.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.46999] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/15711707-e31d-4c64-adb5-08504ae59a2b', 'stat': {'atime': 1432589161.172373, 'gid': 0, 'mtime': 1372763051.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.47262] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/c7c7514f-5fb8-4dc3-aff1-0c30d4815819', 'stat': {'atime': 1432589161.345377, 'gid': 0, 'mtime': 1372763051.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.47473] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/084f9c2d-40ca-4fe3-9e78-1bff2ecc7716', 'stat': {'atime': 1432589161.3593774, 'gid': 0, 'mtime': 1372763049.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.47693] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/9651d035-0260-41e2-97a2-9fb2b51ef0c9', 'stat': {'atime': 1432589161.3213766, 'gid': 0, 'mtime': 1368045650.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.47950] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/c6c49cdf-0b2b-4fa4-b2b9-398ebf3c589c', 'stat': {'atime': 1432589161.347377, 'gid': 0, 'mtime': 1372763053.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.48182] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/f6367f6f-3e19-4fa1-bbea-9a96c11d8bbc', 'stat': {'atime': 1432589161.1883733, 'gid': 0, 'mtime': 1372763053.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:19.48405] W [master(/mnt/brick2/brick):792:log_failures] _GMaster: META FAILED: ({'go': '.gfid/eeacb5af-8c99-4bfc-8495-9c84b119f9c7', 'stat': {'atime': 1432589161.2343745, 'gid': 0, 'mtime': 1412981693.0, 'mode': 41471, 'uid': 0}, 'op': 'META'}, 2)<br>[2015-05-26 11:56:20.410108] E [repce(/mnt/brick2/brick):207:__call__] RepceClient: call 8099:140141675022144:1432637780.1 (meta_ops) failed on peer with OSError<br>[2015-05-26 11:56:20.410460] E [syncdutils(/mnt/brick2/brick):276:log_raise_exception] <top>: FAIL:<br>Traceback (most recent call last):<br> File "/usr/libexec/glusterfs/python/syncdaemon/gsyncd.py", line 165, in main<br> main_i()<br> File "/usr/libexec/glusterfs/python/syncdaemon/gsyncd.py", line 659, in main_i<br> local.service_loop(*[r for r in [remote] if r])<br> File "/usr/libexec/glusterfs/python/syncdaemon/resource.py", line 1440, in service_loop<br> g3.crawlwrap(oneshot=True)<br> File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 580, in crawlwrap<br> self.crawl()<br> File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 1150, in crawl<br> self.changelogs_batch_process(changes)<br> File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 1059, in changelogs_batch_process<br> self.process(batch)<br> File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 946, in process<br> self.process_change(change, done, retry)<br> File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 920, in process_change<br> failures = self.slave.server.meta_ops(meta_entries)<br> File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 226, in __call__<br> return self.ins(self.meth, *a)<br> File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 208, in __call__<br> raise res<br>OSError: [Errno 95] Operation not supported: '.gfid/d7f761f2-1dc5-4aef-bf3f-29d5de823fb0'<br>[2015-05-26 11:56:20.412513] I [syncdutils(/mnt/brick2/brick):220:finalize] <top>: exiting.<br>[2015-05-26 11:56:20.419653] I [repce(agent):92:service_loop] RepceServer: terminating on reaching EOF.<br>[2015-05-26 11:56:20.420038] I [syncdutils(agent):220:finalize] <top>: exiting.<br>[2015-05-26 11:56:20.487646] I [monitor(monitor):282:monitor] Monitor: worker(/mnt/brick2/brick) died in startup phase<br><br><br><br></div>On slave:<br>[2015-05-26 11:56:05.336785] I [gsyncd(slave):649:main_i] <top>: syncing: gluster://localhost:slavedata2<br>[2015-05-26 11:56:06.371880] I [resource(slave):842:service_loop] GLUSTER: slave listening<br>[2015-05-26 11:56:20.386070] E [repce(slave):117:worker] <top>: call failed:<br>Traceback (most recent call last):<br> File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 113, in worker<br> res = getattr(self.obj, rmeth)(*in_data[2:])<br> File "/usr/libexec/glusterfs/python/syncdaemon/resource.py", line 745, in meta_ops<br> [ENOENT], [ESTALE, EINVAL])<br> File "/usr/libexec/glusterfs/python/syncdaemon/syncdutils.py", line 475, in errno_wrap<br> return call(*arg)<br>OSError: [Errno 95] Operation not supported: '.gfid/d7f761f2-1dc5-4aef-bf3f-29d5de823fb0'<br>[2015-05-26 11:56:20.397442] I [repce(slave):92:service_loop] RepceServer: terminating on reaching EOF.<br>[2015-05-26 11:56:20.397603] I [syncdutils(slave):220:finalize] <top>: exiting.<br>[2015-05-26 11:56:30.827872] I [repce(slave):92:service_loop] RepceServer: terminating on reaching EOF.<br>[2015-05-26 11:56:31.25315] I [syncdutils(slave):220:finalize] <top>: exiting.<br><br><br></div>the state of the replication is Active<br></div>I searched about synchronization incomplete and I found this <a href="http://www.gluster.org/community/documentation/index.php/Gluster_3.2:_Troubleshooting_Geo-replication">http://www.gluster.org/community/documentation/index.php/Gluster_3.2:_Troubleshooting_Geo-replication</a><br><br>Synchronization is not complete<br><br>Description: GlusterFS Geo-replication did not synchronize the data completely but still the geo-replication status display OK.<br><br>Solution: You can enforce a full sync of the data by erasing the index and restarting GlusterFS Geo-replication. After restarting, GlusterFS Geo-replication begins synchronizing all the data, that is, all files will be compared with by means of being checksummed, which can be a lengthy /resource high utilization operation, mainly on large data sets (however, actual data loss will not occur). If the error situation persists, contact Gluster Support.<br><br>For more information about erasing index, see Tuning Volume Options. <br><br></div>But there no mention about how to erase the index, the only option I found is : geo-replication.indexing<br></div>is that it?<br><br></div>if yes, after disabling it, will the geo-replication verify all files on slave?<br></div>when do I have to re-enable it again?<br><br></div>thanks<br></div><div class="gmail_extra"><br><div class="gmail_quote">2015-05-25 13:25 GMT+01:00 wodel youchi <span dir="ltr"><<a href="mailto:wodel.youchi@gmail.com" target="_blank">wodel.youchi@gmail.com</a>></span>:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div><div><div><div><div><div>Hi, and thanks for your replies.<br><br></div>For Kotresh : No, I am not using tar ssh for my geo-replication.<br><br></div>For Aravinda: I had to recreate my slave volume all over et restart the geo-replication.<br><br></div>If I have thousands of files with this problem, do I have to execute the fix for all of them? is there an easy way?<br></div>Can checkpoints help me in this situation?<br></div><div>and more important, what can cause this problem?<br><br></div><div>I am syncing containers, they contain lot of files small files, using tar ssh, would it be more suitable?<br></div><div><br></div><br>PS: I tried to execute this command on the Master<br></div><pre><span style="font-family:arial,helvetica,sans-serif">bash generate-gfid-file.sh localhost:data2 $PWD/get-gfid.sh /tmp/master_gfid_file.txt<br><br></span></pre><pre><span style="font-family:arial,helvetica,sans-serif">but I got errors with files that have blank (space) in their names, for example: Admin Guide.pdf<br></span></pre><pre><span style="font-family:arial,helvetica,sans-serif">the script sees two files Admin and Guide.pdf, then the get-gfid.sh returns errors "no such file or directory"<br><br></span></pre><pre><span style="font-family:arial,helvetica,sans-serif">thanks.</span><br></pre></div><div class="HOEnZb"><div class="h5"><div class="gmail_extra"><br><div class="gmail_quote">2015-05-25 7:00 GMT+01:00 Aravinda <span dir="ltr"><<a href="mailto:avishwan@redhat.com" target="_blank">avishwan@redhat.com</a>></span>:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Looks like this is GFID conflict issue not the tarssh issue.<span><br>
<br>
_GMaster: ENTRY FAILED: ({'uid': 0, 'gfid':<br>
'e529a399-756d-4cb1-9779-0af2822a0d94', 'gid': 0, 'mode': 33152, 'entry':<br>
'.gfid/874799ef-df75-437b-bc8f-3fcd58b54789/main.mdb', 'op': 'CREATE'}, 2)<br>
<br></span>
Data: {'uid': 0,<span><br>
'gfid': 'e529a399-756d-4cb1-9779-0af2822a0d94',<br>
'gid': 0,<br>
'mode': 33152,<br>
'entry': '.gfid/874799ef-df75-437b-bc8f-3fcd58b54789/main.mdb',<br>
'op': 'CREATE'}<br>
<br></span>
and Error: 2<br>
<br>
During creation of "main.mdb" RPC failed with error number 2, ie, ENOENT. This error comes when parent directory not exists or exists with different GFID.<br>
In this case Parent GFID "874799ef-df75-437b-bc8f-3fcd58b54789" does not exists on slave.<br>
<br>
<br>
To fix the issue,<br>
-----------------<br>
Find the parent directory of "main.mdb",<br>
Get the GFID of that directory, using getfattr<br>
Check the GFID of the same directory in Slave(To confirm GFIDs are different)<br>
To fix the issue, Delete that directory in Slave.<br>
Set virtual xattr for that directory and all the files inside that directory.<br>
setfattr -n glusterfs.geo-rep.trigger-sync -v "1" <DIR><br>
setfattr -n glusterfs.geo-rep.trigger-sync -v "1" <file-path><br>
<br>
<br>
Geo-rep will recreate the directory with Proper GFID and starts sync.<br>
<br>
Let us know if you need any help.<br>
<br>
--<br>
regards<span><font color="#888888"><br>
Aravinda</font></span><div><div><br>
<br>
<br>
<br>
On 05/25/2015 10:54 AM, Kotresh Hiremath Ravishankar wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Hi Wodel,<br>
<br>
Is the sync mode, tar over ssh (i.e., config use_tarssh is true) ?<br>
If yes, there is known issue with it and patch is already up in master.<br>
<br>
But it can be resolved in either of the two ways.<br>
<br>
1. If sync mode required is tar over ssh, just disable sync_xattrs which is true<br>
by default.<br>
<br>
gluster vol geo-rep <master-vol> <slave-host>::<slave-vol> config sync_xattrs false<br>
<br>
2. If sync mode is ok to be changed to rsync. Please do.<br>
gluster vol geo-rep <master-vol> <slave-host>::<slave-vol> use_tarssh false<br>
<br>
NOTE: rsync supports syncing of acls and xattrs where as tar over ssh does not.<br>
In 3.7.0-2, tar over ssh should be used with sync_xattrs to false<br>
<br>
Hope this helps.<br>
<br>
Thanks and Regards,<br>
Kotresh H R<br>
<br>
----- Original Message -----<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
From: "wodel youchi" <<a href="mailto:wodel.youchi@gmail.com" target="_blank">wodel.youchi@gmail.com</a>><br>
To: "gluster-users" <<a href="mailto:gluster-users@gluster.org" target="_blank">gluster-users@gluster.org</a>><br>
Sent: Sunday, May 24, 2015 3:31:38 AM<br>
Subject: [Gluster-users] [Centos7x64] Geo-replication problem glusterfs 3.7.0-2<br>
<br>
Hi,<br>
<br>
I have two gluster servers in replicated mode as MASTERS<br>
and one server for replicated geo-replication.<br>
<br>
I've updated my glusterfs installation to 3.7.0-2, all three servers<br>
<br>
I've recreated my slave volumes<br>
I've started the geo-replication, it worked for a while and now I have some<br>
problmes<br>
<br>
1- Files/directories are not deleted on slave<br>
2- New files/rectories are not synced to the slave.<br>
<br>
I have these lines on the active master<br>
<br>
[2015-05-23 06:21:17.156939] W [master(/mnt/brick2/brick):792:log_failures]<br>
_GMaster: ENTRY FAILED: ({'uid': 0, 'gfid':<br>
'e529a399-756d-4cb1-9779-0af2822a0d94', 'gid': 0, 'mode': 33152, 'entry':<br>
'.gfid/874799ef-df75-437b-bc8f-3fcd58b54789/main.mdb', 'op': 'CREATE'}, 2)<br>
[2015-05-23 06:21:17.158066] W [master(/mnt/brick2/brick):792:log_failures]<br>
_GMaster: ENTRY FAILED: ({'uid': 0, 'gfid':<br>
'b4bffa4c-2e88-4b60-9f6a-c665c4d9f7ed', 'gid': 0, 'mode': 33152, 'entry':<br>
'.gfid/874799ef-df75-437b-bc8f-3fcd58b54789/main.hdb', 'op': 'CREATE'}, 2)<br>
[2015-05-23 06:21:17.159154] W [master(/mnt/brick2/brick):792:log_failures]<br>
_GMaster: ENTRY FAILED: ({'uid': 0, 'gfid':<br>
'9920cdee-6b87-4408-834b-4389f5d451fe', 'gid': 0, 'mode': 33152, 'entry':<br>
'.gfid/874799ef-df75-437b-bc8f-3fcd58b54789/main.db', 'op': 'CREATE'}, 2)<br>
[2015-05-23 06:21:17.160242] W [master(/mnt/brick2/brick):792:log_failures]<br>
_GMaster: ENTRY FAILED: ({'uid': 0, 'gfid':<br>
'307756d2-d924-456f-b090-10d3ff9caccb', 'gid': 0, 'mode': 33152, 'entry':<br>
'.gfid/874799ef-df75-437b-bc8f-3fcd58b54789/main.ndb', 'op': 'CREATE'}, 2)<br>
[2015-05-23 06:21:17.161283] W [master(/mnt/brick2/brick):792:log_failures]<br>
_GMaster: ENTRY FAILED: ({'uid': 0, 'gfid':<br>
'69ebb4cb-1157-434b-a6e9-386bea81fc1d', 'gid': 0, 'mode': 33152, 'entry':<br>
'.gfid/874799ef-df75-437b-bc8f-3fcd58b54789/COPYING', 'op': 'CREATE'}, 2)<br>
[2015-05-23 06:21:17.162368] W [master(/mnt/brick2/brick):792:log_failures]<br>
_GMaster: ENTRY FAILED: ({'uid': 0, 'gfid':<br>
'7d132fda-fc82-4ad8-8b6c-66009999650c', 'gid': 0, 'mode': 33152, 'entry':<br>
'.gfid/f6f2582e-0c5c-4cba-943a-6d5f64baf340/daily.cld', 'op': 'CREATE'}, 2)<br>
[2015-05-23 06:21:17.163718] W [master(/mnt/brick2/brick):792:log_failures]<br>
_GMaster: ENTRY FAILED: ({'uid': 0, 'gfid':<br>
'd8a0303e-ba45-4e45-a8fd-17994c34687b', 'gid': 0, 'mode': 16832, 'entry':<br>
'.gfid/f6f2582e-0c5c-4cba-943a-6d5f64baf340/clamav-54acc14b44e696e1cfb4a75ecc395fe0',<br>
'op': 'MKDIR'}, 2)<br>
[2015-05-23 06:21:17.165102] W [master(/mnt/brick2/brick):792:log_failures]<br>
_GMaster: ENTRY FAILED: ({'uid': 0, 'gfid':<br>
'49d42bf6-3146-42bd-bc29-e704927d6133', 'gid': 0, 'mode': 16832, 'entry':<br>
'.gfid/f6f2582e-0c5c-4cba-943a-6d5f64baf340/clamav-debec3aa6afe64bffaee8d099e76f3d4',<br>
'op': 'MKDIR'}, 2)<br>
[2015-05-23 06:21:17.166147] W [master(/mnt/brick2/brick):792:log_failures]<br>
_GMaster: ENTRY FAILED: ({'uid': 0, 'gfid':<br>
'1ddb93ae-3717-4347-910f-607afa67cdb0', 'gid': 0, 'mode': 33152, 'entry':<br>
'.gfid/49d42bf6-3146-42bd-bc29-e704927d6133/clamav-704a1e9a3e2c97ccac127632d7c6b8e4',<br>
'op': 'CREATE'}, 2)<br>
<br>
<br>
in the slave lot of lines like this<br>
<br>
[2015-05-22 07:53:57.071999] W [fuse-bridge.c:1970:fuse_create_cbk]<br>
0-glusterfs-fuse: 25833: /.gfid/03a5a40b-c521-47ac-a4e3-916a6df42689 => -1<br>
(Operation not permitted)<br>
<br>
<br>
in the active master I have 3.7 GB of XSYNC-CHANGELOG.xxxxxxx files in<br>
/var/lib/misc/glusterfsd/data2/ssh%3A%2F%2Froot%4010.10.10.10%3Agluster%3A%2F%2F127.0.0.1%3Aslavedata2/e55761a256af4acfe9b4a419be62462a/xsync<br>
<br>
I don't know if this is normal.<br>
<br>
any idea?<br>
<br>
<br>
_______________________________________________<br>
Gluster-users mailing list<br>
<a href="mailto:Gluster-users@gluster.org" target="_blank">Gluster-users@gluster.org</a><br>
<a href="http://www.gluster.org/mailman/listinfo/gluster-users" target="_blank">http://www.gluster.org/mailman/listinfo/gluster-users</a><br>
</blockquote>
_______________________________________________<br>
Gluster-users mailing list<br>
<a href="mailto:Gluster-users@gluster.org" target="_blank">Gluster-users@gluster.org</a><br>
<a href="http://www.gluster.org/mailman/listinfo/gluster-users" target="_blank">http://www.gluster.org/mailman/listinfo/gluster-users</a><br>
</blockquote>
<br>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>