[Bug 12498] New: --fuzzy --fuzzy hugely impacts performance even if its' not needed

classic Classic list List threaded Threaded
2 messages Options
Reply | Threaded
Open this post in threaded view
|

[Bug 12498] New: --fuzzy --fuzzy hugely impacts performance even if its' not needed

samba-bugs
https://bugzilla.samba.org/show_bug.cgi?id=12498

            Bug ID: 12498
           Summary: --fuzzy --fuzzy hugely impacts performance even if
                    its' not needed
           Product: rsync
           Version: 3.1.2
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: P5
         Component: core
          Assignee: [hidden email]
          Reporter: [hidden email]
        QA Contact: [hidden email]

Hello,

I make backups to a Rsync daemon.
Each backup is made to an empty directory, using 20 --link-dest parameters, so
that already uploaded files are hard-linked.
Perfect.

I made some tests adding --fuzzy --fuzzy so that non-existing files can use a
base-file, when possible.

It works perfectly, really good thing to find a base-file within link-dest
directories.

However, it really slows down the backup process, even when it's not needed.
For example, it terribly impacts performance even for a backup with no new
files at all, for which all files will be hard-linked.

Here are some backup durations for a backup with no new files :
- without -yy                 :  9 seconds
- with -y                     :  9 seconds
- with --y and  1 --link-dest : 11 seconds
- with --y and  8 --link-dest : 22 seconds
- with --y and 20 --link-dest : 37 seconds

My question is then, could it be possible for --fuzzy --fuzzy (-yy) to only
work for files which really need to be uploaded ?
So to avoid working for files which will simply be hard-linked.

Would such a modification only impact daemon side ? (would be great)

Thank you very much !

Best regards,

Ben

--
You are receiving this mail because:
You are the QA Contact for the bug.

--
Please use reply-all for most replies to avoid omitting the mailing list.
To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync
Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html
Reply | Threaded
Open this post in threaded view
|

[Bug 12498] --fuzzy --fuzzy hugely impacts performance even if its' not needed

Samba - rsync mailing list
https://bugzilla.samba.org/show_bug.cgi?id=12498

--- Comment #1 from Ben RUBSON <[hidden email]> ---
Created attachment 13748
  --> https://bugzilla.samba.org/attachment.cgi?id=13748&action=edit
Do fuzzy only when needed

Here's a patch for this issue. Thx !

--
You are receiving this mail because:
You are the QA Contact for the bug.

--
Please use reply-all for most replies to avoid omitting the mailing list.
To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync
Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html