]> arthur.barton.de Git - bup.git/commit
hashsplit.py is now much, much faster than before.
authorAvery Pennarun <apenwarr@gmail.com>
Wed, 30 Dec 2009 00:20:35 +0000 (19:20 -0500)
committerAvery Pennarun <apenwarr@gmail.com>
Wed, 30 Dec 2009 00:27:30 +0000 (19:27 -0500)
commit523df95d6fb025875a62923a058ba7d381ae7506
tree845da55ebb09ca164a7d4c4991fae611b28e49d6
parent63ee6272e456ced6a077319c46c8c4cef1d0a572
hashsplit.py is now much, much faster than before.

4.8 secs vs. 0.8 secs for testfile1.

Still vastly slower than the C version (0.17 secs including time to fork
git for each blob) but still a significant improvement.

The remaining slowness seems to be entirely from:

- running git hash-object (which we can avoid by hashing the object
  ourselves)

- running the rolling checksum algorithm (which we can speed up using a C
  module)

So it's looking good.
Makefile
hashsplit.c
hashsplit.py