arthur.barton.de Git - bup.git/log

]> arthur.barton.de Git - bup.git/log

projects / bup.git / log

commit | commitdiff | tree

Zoran Zaric [Tue, 10 May 2011 00:59:54 +0000 (02:59 +0200)]

Add documentation for compression levels

Signed-off-by: Zoran Zaric <zz@zoranzaric.de>

commit | commitdiff | tree

Zoran Zaric [Tue, 10 May 2011 00:59:53 +0000 (02:59 +0200)]

Add test case for compression level

A save run with 0 and 9 are made and the size of the resuling repos is
compared.

Signed-off-by: Zoran Zaric <zz@zoranzaric.de>

commit | commitdiff | tree

Zoran Zaric [Tue, 10 May 2011 00:59:52 +0000 (02:59 +0200)]

Add compression level options to bup save and bup split

For now 0 and 9 should suffice. 1 still is the default.

Signed-off-by: Zoran Zaric <zz@zoranzaric.de>

commit | commitdiff | tree

Zoran Zaric [Tue, 10 May 2011 00:59:51 +0000 (02:59 +0200)]

Make zlib compression level a parameter for Client

Signed-off-by: Zoran Zaric <zz@zoranzaric.de>

commit | commitdiff | tree

Zoran Zaric [Tue, 10 May 2011 00:59:50 +0000 (02:59 +0200)]

Make zlib compression level a parameter of git.PackWriter

Signed-off-by: Zoran Zaric <zz@zoranzaric.de>

commit | commitdiff | tree

Aneurin Price [Fri, 13 May 2011 13:19:43 +0000 (14:19 +0100)]

Use is_superuser() rather than checking euid directly

Replacing calls to geteuid() to use is_superuser() fixes metadata test
failures on Cygwin when running as an Administrator.

Signed-off-by: Aneurin Price <aneurin.price@gmail.com>

commit | commitdiff | tree

Aneurin Price [Fri, 13 May 2011 13:19:42 +0000 (14:19 +0100)]

Add is_superuser() helper function

This checks for an effective UID of 0 on most platforms, or the result
of the IsUserAnAdmin shell32 function on Cygwin.

Signed-off-by: Aneurin Price <aneurin.price@gmail.com>

commit | commitdiff | tree

mik [Sun, 15 May 2011 19:23:31 +0000 (15:23 -0400)]

Makefile: add a PREFIX variable for locations other than /usr.

make install PREFIX=/usr/local

Signed-off-by: Michael Barabanov <michael.barabanov@gmail.com>

commit | commitdiff | tree

Shane Geiger [Mon, 9 May 2011 01:30:55 +0000 (21:30 -0400)]

fixed cp problem in bup/t/test-meta.sh

commit | commitdiff | tree

Zoran Zaric [Sun, 8 May 2011 22:53:47 +0000 (00:53 +0200)]

Remove a file created in a index test

Signed-off-by: Zoran Zaric <zz@zoranzaric.de>

commit | commitdiff | tree

Avery Pennarun [Sun, 8 May 2011 19:09:04 +0000 (19:09 +0000)]

Earlier "negative timestamp" patch had a 64-bit timestamp in the test.

The date in the comment is correct - for -0x80000000.  Sadly, the *code*
actually said -0x90000000.  That works on 64-bit systems (and filesystems)
not not 32-bit ones, where python gives an encoding error.

In any case, based on the comment (June 10, 1893) it seems tat -0x80000000
must have been the intended value anyway.  Now 'make test' passes on 32-bit
Linux again.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Zoran Zaric [Sun, 8 May 2011 13:41:40 +0000 (15:41 +0200)]

Prefix paths in grafting- and stripping-tests

On systems with symlinks in the used paths the tests failed. This adds
a prefix to the used paths, that is not likely to exist.

Signed-off-by: Zoran Zaric <zz@zoranzaric.de>

commit | commitdiff | tree

Avery Pennarun [Sun, 8 May 2011 07:13:48 +0000 (03:13 -0400)]

Merge branch 'master' of git://github.com/thatch/bup

* 'master' of git://github.com/thatch/bup:
  Missing space in optspec
  Fix a bug where marginally old files couldn't be stored in the index
  Show better errors with out-of-range Entry values

commit | commitdiff | tree

Gabriel Filion [Mon, 2 May 2011 23:12:34 +0000 (19:12 -0400)]

Doc: add some precisions for --remote and dumb mode

The -r/--remote argument to some of bup's commands currently doesn't
give enough information about how to customize options to SSH. Let's add
information about this so that users know how to customize options for
SSH connections.

Also, in bup-server's documentation, point out which mode is the default
one for more clarity.

Signed-off-by: Gabriel Filion <lelutin@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 8 May 2011 05:31:59 +0000 (01:31 -0400)]

DESIGN: mention bloom filters.

Jeff Anderson-Lee discovered the missing information and posted to the
mailing list. Gabriel Filion reminded me to actually update the docs :)

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Neil E. Hodges [Sat, 30 Apr 2011 20:15:36 +0000 (13:15 -0700)]

Fix compile error using GCC 4.6.

commit | commitdiff | tree

Tim Hatch [Sat, 16 Apr 2011 00:36:09 +0000 (17:36 -0700)]

Missing space in optspec

Signed-off-by: Tim Hatch <tim@timhatch.com>

commit | commitdiff | tree

Tim Hatch [Sat, 16 Apr 2011 00:18:51 +0000 (17:18 -0700)]

Fix a bug where marginally old files couldn't be stored in the index

Due to the struct having unsigned timestamps, files with dates between Dec 13,
1901 and Jan 1, 1970 were not representable. This change extends the struct to
be able to pack signed timestamps, which was the spirit of code in _fixup, and
extends the useful range back to 1901. Timestamps prior to 1901 are still
adjusted to zero, as they were before.

There should be no compatibility problems loading packed structures created
before this change, since positive values were truncated at 0x7fffffff.

Signed-off-by: Tim Hatch <tim@timhatch.com>

commit | commitdiff | tree

Tim Hatch [Sat, 16 Apr 2011 00:14:53 +0000 (17:14 -0700)]

Show better errors with out-of-range Entry values

When values can't be encoded in a struct, it's useful to actually give the values.

Signed-off-by: Tim Hatch <tim@timhatch.com>

commit | commitdiff | tree

Dickon Reed [Thu, 24 Mar 2011 09:24:01 +0000 (09:24 +0000)]

Don't compile bup_get_Linux_file_attr on systems without FS_IOC_GETFLAGS.

commit | commitdiff | tree

Rob Browning [Mon, 21 Mar 2011 01:35:53 +0000 (20:35 -0500)]

tgit.py: provoke ENOTDIR rather than EACCES in test_check_repo_or_die().

Replace the objects/pack directory with an empty file to provoke an
ENOTDIR error from stat('objects/pack/.').

Previously the code changed the permissions of the test directory to
0000 in order to provoke an error other than ENOENT (i.e. EACCES), but
that doesn't work when the tests are run as root or fakeroot.

(As Gabriel Filion pointed out, the chmod of the testdir is no
longer necessary, so I removed it and squashed that into this patch.
-- apenwarr)

Signed-off-by: Rob Browning <rlb@defaultvalue.org>
Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Rob Browning [Mon, 21 Mar 2011 00:41:49 +0000 (19:41 -0500)]

Add initial .dir-locals.el to set emacs sh/python/c mode defaults.

Signed-off-by: Rob Browning <rlb@defaultvalue.org>

commit | commitdiff | tree

Rob Browning [Mon, 21 Mar 2011 03:04:13 +0000 (22:04 -0500)]

test-meta.sh: use cp -a instead of make install to create the test tree.

Signed-off-by: Rob Browning <rlb@defaultvalue.org>

commit | commitdiff | tree

Rob Browning [Mon, 21 Mar 2011 00:41:47 +0000 (19:41 -0500)]

test-meta.sh: change indentation to 4 spaces and fix if/for formatting.

Signed-off-by: Rob Browning <rlb@defaultvalue.org>

commit | commitdiff | tree

Rob Browning [Mon, 21 Mar 2011 00:41:46 +0000 (19:41 -0500)]

test-meta.sh: use $foo rather than ${foo} where possible.

Signed-off-by: Rob Browning <rlb@defaultvalue.org>

commit | commitdiff | tree

Rob Browning [Mon, 21 Mar 2011 00:41:45 +0000 (19:41 -0500)]

tmetadata.py: call helpers.unlink() rather than unlink().

Signed-off-by: Rob Browning <rlb@defaultvalue.org>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 10:53:42 +0000 (03:53 -0700)]

cmd/xstat: don't report mtime/atime for symlinks if we don't have_ns_timestamps.

We can't set the atime/mtime on a symlink anyway if we don't
have_ns_timestamps, which means the values are meaningless. Report them as
0 in order to avoid triggering a unit test failure.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 10:46:32 +0000 (03:46 -0700)]

test-meta.sh: remove a bashism, and don't delete dirs on exit.

It's really annoying to have it wiping out directories that you want to
examine after a failed test. And "set -o pipefail" is not available in the
version of bash on MacOS 10.4.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 10:45:32 +0000 (03:45 -0700)]

metadata.py: be careful with the umask() when restoring symlinks.

On MacOS, the umask affects symlink permissions, although not in any sort of
useful way that I can see. Still, getting the permissions wrong breaks the
unit tests, so let's be careful about it.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 09:57:31 +0000 (02:57 -0700)]

Get rid of some python syntax not compatible with python 2.4.

Mostly this is 'x if b else y' crap, which I hate anyway.

This makes 'make test' *almost* pass on my MacOS 10.4 system (with upgrade
to python 2.4 from fink).

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 09:36:28 +0000 (02:36 -0700)]

xstat: FSTime.from_stat_time() was completely wrong.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 09:19:08 +0000 (02:19 -0700)]

tmetadata: the "non existent group name" test didn't make any sense.

There's certainly no reason to expect the file's uid/gid would have changed
after a call that's supposed to fail.  It was passing by pure luck on Linux,
which doesn't have a sticky gid bit causing the newly created file to have
a gid != os.getgid().  But on MacOS, the file was originally created with a
gid != os.getgid(), and so restoring its numeric id restored that, and then
the test failed.

The test is still kind of pointless; it doesn't actually test anything
useful, like (for example) automatic fallback to restoring by numeric gid if
the groupname can't be found.  In fact, looking at the code, it doesn't seem
like it *would* fall back, which is a bug.  But I'm not going to fix that
right now.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 09:18:38 +0000 (02:18 -0700)]

metadata.py: don't mknod regular files.

This fails on MacOS. Just use open() instead.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 09:07:03 +0000 (02:07 -0700)]

A bunch of IOError->OSError conversions.

Some of our replacement functions were throwing IOError when the function
they replaced would throw OSError. This was particularly noticeable with
utime() on MacOS, since it caused a unit test failure.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 08:45:44 +0000 (01:45 -0700)]

tmetadata.py: don't call subprocesses for trivial operations.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 08:38:07 +0000 (01:38 -0700)]

metadata: don't die if Linux attr (not xattr) support is missing.

We don't need an import warning for this one, since linuxattr support is
always available on linux, and never available elsewhere, since it's in
_helpers.c and there are no special python modules to install.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 08:30:20 +0000 (01:30 -0700)]

metadata: ignore old incompatible python-xattr module; we need python-pyxattr.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 08:25:13 +0000 (01:25 -0700)]

metadata: recover politely if xattr/acl support is missing.

...previously we'd just crash, which is definitely not polite.

metadata.py now prints warning on import if these features are missing.
That's probably overly obnoxious, especially on systems that don't support
those types of metadata at all. Is there a way to determine whether a
kernel *should* support that type of metadata, so we can warn only if so?

(Obviously if the kernel doesn't support, say, xattrs, there's no point
warning that bup doesn't support them, because no files will be using them
anyway. Hmm...)

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 08:23:43 +0000 (01:23 -0700)]

metadata/xstat: fix a bunch of CodingStyle issues.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Mar 2011 07:55:01 +0000 (00:55 -0700)]

Merge branch 'maint'

* maint:
Makefile: Fix 'clean' rule

Conflicts:
Makefile

commit | commitdiff | tree

Rob Browning [Mon, 14 Mar 2011 01:37:56 +0000 (20:37 -0500)]

Don't accidentally pass atime/ctime/mtime through from_stat_time() twice.

Don't accidentally pass atime/ctime/mtime through
FSTime.from_stat_time() twice in the xstat stat_result.from_stat_rep()
static method when _have_ns_fs_timestamps is false.

Signed-off-by: Rob Browning <rlb@defaultvalue.org>

commit | commitdiff | tree

Gabriel Filion [Fri, 11 Mar 2011 01:01:30 +0000 (20:01 -0500)]

Makefile: Fix 'clean' rule

In commit 1df0bdd1ad937, I introduced a problem in the make file: the
chmod operation that gives back some permissions on
lib/bup/t/pybuptest.tmp dies if this directory does not exist.

pybuptest.tmp is only created when running the tests.

when the chmod dies, the clean rule stops, thus not completing the
cleanup, so we must make sure this operation is not fatal if the
directory doesn't exist.

Signed-off-by: Gabriel Filion <lelutin@gmail.com>

commit | commitdiff | tree

Dickon Reed [Fri, 18 Mar 2011 19:25:38 +0000 (12:25 -0700)]

Allow chown to uid:0 to succeed.

The test case assumed that it was not possible to set uid:0 on a file,
which is to say that the current user is not a member of group
0. That's an environmental assumption which is not universal (I am a
counterexample).

Modified lib/bup/t/tmetadata.py

commit | commitdiff | tree

Avery Pennarun [Fri, 11 Mar 2011 04:03:06 +0000 (20:03 -0800)]

Fix more gcc type punning errors introduced by the metadata changes.

Whatever, let's use 0 and 1 instead of Py_True and Py_False.

_helpers.c: In function ‘bup_set_linux_file_attr’:
_helpers.c:683: warning: dereferencing type-punned pointer will break
strict-aliasing rules
_helpers.c:683: warning: dereferencing type-punned pointer will break
strict-aliasing rules
_helpers.c: In function ‘bup_utimensat’:
_helpers.c:758: warning: dereferencing type-punned pointer will break
strict-aliasing rules
_helpers.c:758: warning: dereferencing type-punned pointer will break
strict-aliasing rules
_helpers.c: In function ‘init_helpers’:
_helpers.c:940: warning: dereferencing type-punned pointer will break
strict-aliasing rules
_helpers.c:941: warning: dereferencing type-punned pointer will break
strict-aliasing rules
error: command 'gcc' failed with exit status 1

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Fri, 11 Mar 2011 04:00:48 +0000 (20:00 -0800)]

Merge branch 'maint'

* maint:
_helpers.c: fix a "type punning" warning from gcc.
Add a test for previous octal/string filemode fix.

commit | commitdiff | tree

Avery Pennarun [Fri, 11 Mar 2011 03:58:58 +0000 (19:58 -0800)]

_helpers.c: fix a "type punning" warning from gcc.

_helpers.c: In function ‘bloom_contains’:
_helpers.c:260: warning: dereferencing type-punned pointer will break strict-aliasing rules

Whatever, let's use 1 instead of the apparently problematic Py_True.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 10 Mar 2011 22:51:49 +0000 (14:51 -0800)]

Add a test for previous octal/string filemode fix.

Apparently nothing was testing symlink behaviour; add a basic test for
symlink save/restore.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 10 Mar 2011 20:57:41 +0000 (12:57 -0800)]

Merge branch 'maint'

* maint:
  Use debug1() when reporting paths skipped
  Save was using a quoted instead of octal gitmode.
  Verify permissions in check_repo_or_die()

Conflicts:
Makefile
lib/bup/git.py

commit | commitdiff | tree

Aneurin Price [Wed, 9 Mar 2011 15:36:48 +0000 (15:36 +0000)]

Use debug1() when reporting paths skipped

Skipping paths during indexing is a normal event not indicative of any
problems, so need not be reported in normal operation.

Signed-off-by: Aneurin Price <aneurin.price@gmail.com>

commit | commitdiff | tree

Brandon Low [Mon, 7 Mar 2011 19:17:40 +0000 (11:17 -0800)]

Save was using a quoted instead of octal gitmode.

This bugged in an assert on python 2.7 for me, and I believe was
incorrect but functional behavior.

Signed-off-by: Brandon Low <lostlogic@lostlogicx.com>
Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Gabriel Filion [Thu, 10 Mar 2011 20:41:54 +0000 (12:41 -0800)]

Verify permissions in check_repo_or_die()

Currently, if one doesn't have read or access permission up to
repo('objects/pack'), bup exits with the following error:

error: repo() is not a bup/git repository

(with repo() replaced with the actual path).

This is misleading, since there is possibly really a repository there
but the user can't access it.

Make git.check_repo_or_die() verify that the current user has the
permission to access repo('objects/pack'), and if not, output a
meaningful error message.

As a bonus, we get an error if the bup_dir path is not a directory.

Signed-off-by: Gabriel Filion <lelutin@gmail.com>

commit | commitdiff | tree

Eduardo Bacchi Kienetz [Thu, 10 Mar 2011 18:35:54 +0000 (10:35 -0800)]

t/tmetadata.py: fix a typo st_uid -> st_gid.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Zoran Zaric [Wed, 2 Mar 2011 16:39:33 +0000 (17:39 +0100)]

Add python-pyxattr as a dependency to the README

Signed-off-by: Zoran Zaric <zz@zoranzaric.de>
Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Aneurin Price [Wed, 9 Mar 2011 15:36:48 +0000 (15:36 +0000)]

commit | commitdiff | tree

Brandon Low [Mon, 7 Mar 2011 19:17:40 +0000 (11:17 -0800)]

commit | commitdiff | tree

Gabriel Filion [Mon, 21 Feb 2011 16:14:38 +0000 (11:14 -0500)]

commit | commitdiff | tree

Rob Browning [Tue, 1 Mar 2011 03:00:44 +0000 (21:00 -0600)]

Add "import math" to xstat.py.

commit | commitdiff | tree

Avery Pennarun [Mon, 28 Feb 2011 09:47:15 +0000 (01:47 -0800)]

Merge branch 'rlb/meta'

* rlb/meta:
  t/test-meta.sh: replace 'diff -u5' with 'diff -U5'.
  Don't touch correct target xattrs; remove inappropriate target xattrs.
  Rename test-fs.img to testfs.img and add it to the clean target.
  t/test-meta.sh: detect and handle fakeroot.
  Add atime tests and fix atime capture in metadata.from_path().
  Improve test-meta.sh status messages.
  Handle missing files more gracefully in "bup xstat".
  Add initial (trivial) root-only ACL metadata tests and fix exposed bugs.
  Add initial (trivial) root-only metadata tests for attr and xattr.
  Don't specify 0700 permissions when creating initial directories.
  Fix "meta extract -v" directory output.
  Fix _apply_common_rec() symlink chown/chmod guard.
  Change os.geteuid to os.geteuid() in tmetadata.py.
  Remove redundant call to get_linux_file_attr() in _add_linux_attr().
  In _add_linux_attr(), catch IOError, not EnvironmentError; handle ENOTTY.
  Improve some metadata error messages.
  Only print secs for bup xstat times when ns == 0.
  Use oct() rather than hex() when printing mode from bup xstat.
  Remove bup: prefix from metadata error messages.
  Don't "chmod 000" paths during restore.
  Remove MetadataError and make apply error handling finer-grained.
  Remove MetadataAcquireError and make error handling finer-grained
  Accommodate missing owner or group name during metadata save/restore.
  Preserve existing stack trace when throwing MetadataErrors.
  Add (private for now) "bup xstat" command and use it in the metadata tests.
  Also check defined(_ATFILE_SOURCE) in utimensat() guard.
  Rename bup-meta.1.md to bup-meta.md.
  Simplify FSTime() - always use an integer ns internal representation.
  Rename metadata exceptions and add a parent MetadataError class.
  Don't use str(e) when instantiating Metadata exceptions.
  Fix typos in Metadata._encode_linux_xattr().
  Fix handling of conditional definitions in xstat.
  Always define _have_ns_fs_timestamps (True or False).
  Change "bup meta" to use recursive_dirlist() to add support for --xdev.
  Fix minor bug in "bup meta -t" argument handling (if -> elif).
  Modify drecurse.py and index.py to use xstat functions.
  Move stat-related work to bup.xstat; add xstat.stat.
  Add helpers.fstat and _helpers._have_ns_fs_timestamps.
  Add a helpers.FSTime class to handle filesystem timestamps and use it.
  Attempt to unlink or rmdir existing paths as appropriate during restore.
  Conditionalize build/use of get_linux_file_attr and set_linux_file_attr.
  Check stat() after attempted restore of nonexistent owner/group in tests.
  Don't try to restore owner unless root; handle nonexistent owner/group.
  Add metadata test_restore_restricted_user_group().
  Add helpers.detect_fakeroot() and use it in relevant metadata tests.
  Defer metadata aquisition and application errors during create/extract.
  Rename py_* functions to bup_* in lib/bup/_helpers.c.
  Don't allow negative ns in metadata timestamps; normalize on read/write.
  Add (sec, ns) timestamps and extended stat, lstat, utime, and lutime.
  Add vint tests and signed vint support via write_vint and read_vint.
  Change user to the more accurate owner in metadata.py.
  Correctly respect restore_numeric_ids in Metadata _apply_common_rec().
  Send bup meta --list output to stdout, not stderr.
  Fix bup-meta.1 start-extract/finish-extract example.
  Use Py_RETURN_TRUE in py_lutimes() and py_set_linux_file_attr().
  t/test.sh: fix whitespace problems with the 'Inode:' line from 'stat'.
  t/test.sh: fix occasional atime-related failure in metadata tests.
  t/test.sh: refactoring to reduce duplicated code.
  Add initial support for metadata archives.

commit | commitdiff | tree

Avery Pennarun [Mon, 28 Feb 2011 09:33:09 +0000 (01:33 -0800)]

Merge branch 'master' into meta

* master:
  midx/bloom: use progress() and debug1() for non-critical messages
  helpers: separately determine if stdout and stderr are ttys.
  cmd/newliner: restrict progress lines to the screen width.
  hashsplit: use shorter offset-filenames inside trees.
  Replace 040000 and 0100644 constants with GIT_MODE_{TREE,FILE}
  git.py: rename treeparse to tree_decode() and add tree_encode().
  hashsplit.py: remove PackWriter-specific knowledge.
  cmd/split: fixup progress message, and print -b output incrementally.
  hashsplit.py: convert from 'bits' to 'level' earlier in the sequence.
  hashsplit.py: okay, *really* fix BLOB_MAX.
  hashsplit.py: simplify code and fix BLOB_MAX handling.
  options.py: o.fatal(): print error after, not before, usage message.
  options.py: make --usage just print the usage message.

Conflicts:
lib/bup/_helpers.c

commit | commitdiff | tree

Avery Pennarun [Mon, 28 Feb 2011 09:29:40 +0000 (01:29 -0800)]

t/test-meta.sh: replace 'diff -u5' with 'diff -U5'.

The old-style option is obsolete.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Rob Browning [Sun, 27 Feb 2011 22:15:24 +0000 (16:15 -0600)]

Don't touch correct target xattrs; remove inappropriate target xattrs.

When applying xattr metadata, make sure to remove any target xattrs
that aren't in the metadata record, but don't touch target xattrs that
already match the metadata record. Add corresponding tests.

Throw an ApplyError() if xattr set() or remove() return EPERM from
within _apply_linux_xattr_rec().

Remove lib/bup/t/testfs and lib/bup/t/testfs.img in the clean target.

commit | commitdiff | tree

Rob Browning [Sun, 27 Feb 2011 22:01:18 +0000 (16:01 -0600)]

Rename test-fs.img to testfs.img and add it to the clean target.

commit | commitdiff | tree

Avery Pennarun [Sat, 26 Feb 2011 09:00:51 +0000 (01:00 -0800)]

Merge commit '6f02181'

* commit '6f02181':
  helpers: separately determine if stdout and stderr are ttys.
  cmd/newliner: restrict progress lines to the screen width.
  hashsplit: use shorter offset-filenames inside trees.
  Replace 040000 and 0100644 constants with GIT_MODE_{TREE,FILE}
  git.py: rename treeparse to tree_decode() and add tree_encode().
  hashsplit.py: remove PackWriter-specific knowledge.
  cmd/split: fixup progress message, and print -b output incrementally.
  hashsplit.py: convert from 'bits' to 'level' earlier in the sequence.
  hashsplit.py: okay, *really* fix BLOB_MAX.
  hashsplit.py: simplify code and fix BLOB_MAX handling.
  options.py: o.fatal(): print error after, not before, usage message.
  options.py: make --usage just print the usage message.

commit | commitdiff | tree

Gabriel Filion [Fri, 25 Feb 2011 16:16:05 +0000 (11:16 -0500)]

midx/bloom: use progress() and debug1() for non-critical messages

Some messages in these two commands indicate progress but are not
filtered out when the command is not run under a tty. This makes bup
return some unwanted messages when run under cron.

Using progress() and debug1() instead should fix that.

(Changed a few from progress() to debug1() by apenwarr.)

Signed-off-by: Gabriel Filion <lelutin@gmail.com>
Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Feb 2011 05:21:45 +0000 (21:21 -0800)]

helpers: separately determine if stdout and stderr are ttys.

Previously we only cared if stderr was a tty (since we use that to determine
if we should print progress() or not). But we might want to check stdout as
well, for the same reason that gzip does: we should be refusing to write
binary data to a terminal.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Feb 2011 04:48:15 +0000 (20:48 -0800)]

cmd/newliner: restrict progress lines to the screen width.

Otherwise \r won't work as expected.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Feb 2011 02:48:06 +0000 (18:48 -0800)]

hashsplit: use shorter offset-filenames inside trees.

We previously zero-padded all the filenames (which are hexified versions of
the file offsets) to 16 characters, which corresponds to a maximum file size
that fits into a 64-bit integer. I realized that there's no reason to
use a fixed padding length; just pad all the entries in a particular tree to
the length of the longest entry (to ensure that sorting
alphabetically is still equivalent to sorting numerically).

This saves a small amount of space in each tree, which is probably
irrelevant given that gzip compression can quite easily compress extra
zeroes. But it also makes browsing the tree in git look a little prettier.

This is backwards compatible with old versions of vfs.py, since vfs.py has
always just treated the numbers as an ordered set of numbers, and doesn't
care how much zero padding they have.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Feb 2011 02:02:12 +0000 (18:02 -0800)]

Replace 040000 and 0100644 constants with GIT_MODE_{TREE,FILE}

Those constants were scattered in *way* too many places. While we're there,
fix the inconsistent usage of strings vs. ints when specifying the file
mode; there's no good reason to be passing strings around (except that I
foolishly did that in the original code in version 0.01).

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Feb 2011 01:57:48 +0000 (17:57 -0800)]

git.py: rename treeparse to tree_decode() and add tree_encode().

tree_encode() gets most of its functionality from PackWriter.new_tree(),
which is not just a one liner that calls tree_encode(). We will soon want
to be able to calculate tree hashes without actually writing a tree to a
packfile, so let's split out that functionality.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 12:22:50 +0000 (04:22 -0800)]

hashsplit.py: remove PackWriter-specific knowledge.

Let's use callback functions explicitly instead of passing around special
objects; that makes the dependencies a bit more clear and hopefully opens
the way to some more refactoring for clarity.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 11:10:23 +0000 (03:10 -0800)]

cmd/split: fixup progress message, and print -b output incrementally.

As a side effect, you can no longer combine -b with -t, -c, or -n. But that
was kind of a pointless thing to do anyway, because it silently enforced
--fanout=0, which is almost certainly not what you wanted, precisely if you
were using -t, -c, or -n.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 10:30:47 +0000 (02:30 -0800)]

hashsplit.py: convert from 'bits' to 'level' earlier in the sequence.

The hierarchy level is a more directly useful measurement than the bit count,
although right now neither is used very heavily.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Feb 2011 04:33:36 +0000 (20:33 -0800)]

hashsplit.py: okay, *really* fix BLOB_MAX.

In some conditions, we were still splitting into blobs larger than BLOB_MAX.
Fix that too.

Unfortunately adding an assertion about it in the 'bup split' main loop
slows things down by a measurable amount, so I can't easily add that to
prevent this from happening by accidenta again in the future.

After implementing this, it looks like 8192 (typical blob size) times two
isn't big enough to prevent this from kicking in in "normal" cases; let's
use 4x instead. In my test file, we exceed this maximum much less. (Every
time we exceed BLOB_MAX, it means the bupsplit algorithm isn't working, so
we won't be deduplicating as effectively. So we want that to be rare.)

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 09:56:31 +0000 (01:56 -0800)]

hashsplit.py: simplify code and fix BLOB_MAX handling.

This reduces the number of lines without removing functionality.  I renamed
a few constants to make more sense.

The only functional change is that BLOB_MAX is now an actual maximum instead
of a variable number depending on buf.used().  Previously, it might have
been as large as BLOB_READ_SIZE = 1MB, which is much larger than BLOB_MAX =
16k.  Now BLOB_MAX is actually the max.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Feb 2011 05:37:16 +0000 (21:37 -0800)]

options.py: o.fatal(): print error after, not before, usage message.

git prints the error *before* the usage message, but the more I play with
it, the more I'm annoyed by that behaviour. The usage message can be pretty
long, and the error gots lost way above the usage message. The most
important thing *is* the error, so let's print it last.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sun, 20 Feb 2011 05:34:51 +0000 (21:34 -0800)]

options.py: make --usage just print the usage message.

This is a relatively common option in other programs, so let's make it work
in case someone tries to use it.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Sat, 19 Feb 2011 23:34:45 +0000 (15:34 -0800)]

Merge remote branch 'origin/master' into meta

* origin/master:
  doc/import-rsnapshot: small corrections and clarification
  cmd/midx, git.py: all else being equal, delete older midxes first.
  t/test.sh: a test for the recently-uncovered midx4 problem.
  _helpers.c: midx4 didn't handle multiple index with the same object.
  cmd/midx: add a --check option.
  Add git.shorten_hash(), printing only the first few bytes of a sha1.
  tclient.py: add some additional tests that objcache.refresh() is called.
  cmd/server: add a debug message saying which object caused a suggestion.
  cmd/list-idx: a quick tool for searching the contents of idx/midx files.
  Add tests around the bloom ruin and check options
  Add a bloom --ruin for testing failure cases
  One more constant for header lengths
  Split PackMidx from git.py into a new midx.py.
  bloom.py: move bloom.ShaBloom.create to just bloom.create.
  Move bloom-related stuff from git.py to a new bloom.py.
  cmd/bloom: add a --force option to forget regenerating the bloom.
  Use the new qprogress() function in more places.
  Bail out immediately instead of redownloading .idx
  Add a --check behavior to verify bloom
  Defines/preprocessor lengths > magic numbers
  cmd/{bloom,midx}: clean up progress messages.
  cmd/bloom: by default generate bloom filters in *all* index-cache dirs.
  cmd/newliner: avoid printing a blank line if the final output ended in \r.
  cmd/index: make the default mode '-u'.
  _helpers.c: don't cygwin doesn't set any win32 defines.
  _helpers.c: don'g unpythonize_argv() on win32.
  Remove .c and .o rules, apply flags to csetup.py
  Fix a valid warning that some compilers warned
  Move .idx file writing to C
  main.py: fix whitespace in the usage string.
  cmd/daemon: FD_CLOEXEC the listener socket and don't leak fd for the connection.
  cmd/daemon: close file descriptors correctly in parent process.
  cmd/daemon: use SO_REUSEADDR.
  cmd/daemon: pass extra options along to 'bup server'.
  cmd/daemon: correctly report socket binding/listening errors.
  main.py: use execvp() instead of subprocess.Popen() when possible.
  _helpers.c: Remove ugly 'python' junk from argv[0] so 'ps' is prettier.
  cmd/bloom: fix a message pluralization.
  cmd/join: add a new -o (output filename) option.
  cmd/ls: fix a typo causing 'bup ls foo/latest' to not work.
  cmd/server: add a new 'help' command.
  midx4: Fix the other side of my previous nasty bug

Conflicts:
lib/bup/_helpers.c
lib/bup/helpers.py

commit | commitdiff | tree

Gabriel Filion [Fri, 18 Feb 2011 19:15:41 +0000 (14:15 -0500)]

doc/import-rsnapshot: small corrections and clarification

There's a typo in the --dry-run option explanation.

The form "[...] or only imports all [...]" is confusing. Turn it around
a little bit so that the quantifiers are associated more easily to the
right portions of the sentence.

Also, add an example for using the backuptarget argument.

Signed-off-by: Gabriel Filion <lelutin@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Fri, 18 Feb 2011 09:15:47 +0000 (01:15 -0800)]

cmd/midx, git.py: all else being equal, delete older midxes first.

Previous runs of 'bup midx -f' might have created invalid midx files with
exactly the same length as a newer run. bup's "prune redundant midx" logic
would quasi-randomly choose one or the other to delete (based on
alphabetical order of filenames, basically) and sometimes that would be the
new one, not the old one, so the 'bup midx -f' results never actually kicked
in.

Now if the file sizes are equal we'll use the mtime as a tie breaker; newer
is better.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Fri, 18 Feb 2011 08:17:17 +0000 (00:17 -0800)]

t/test.sh: a test for the recently-uncovered midx4 problem.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Fri, 18 Feb 2011 08:44:32 +0000 (00:44 -0800)]

_helpers.c: midx4 didn't handle multiple index with the same object.

It *tried* to handle it, but would end up with a bunch of zero entries at
the end, which prevents .exists() from working correctly in some cases.

In midx2, it made sense to never include the same entry twice, because the
only informatin we had about a particular entry was that it existed. In
midx4 this is no longer true; we might sometimes want to know *all* the idx
files that contain a particular object (for example, when we implement
expiry later). So the easiest fix for this bug is to just include multiple
entries when we have them.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Fri, 18 Feb 2011 08:11:59 +0000 (00:11 -0800)]

cmd/midx: add a --check option.

Running this on my system does reveal that some objects return
exists()==False on my midx even though they show up during iteration.

Now to actually find and fix it...

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Fri, 18 Feb 2011 07:36:49 +0000 (23:36 -0800)]

Add git.shorten_hash(), printing only the first few bytes of a sha1.

The full name is rarely needed and clutters the output. Let's try this
instead in a few places.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Fri, 18 Feb 2011 05:35:39 +0000 (21:35 -0800)]

tclient.py: add some additional tests that objcache.refresh() is called.

...which it is, so no bugs were fixed here. Aneurin is sitll exposing a bug
somehow though.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 12:54:13 +0000 (04:54 -0800)]

cmd/server: add a debug message saying which object caused a suggestion.

Let's use this to try to debug Aneurin's problem (and potentially others).

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 12:50:04 +0000 (04:50 -0800)]

cmd/list-idx: a quick tool for searching the contents of idx/midx files.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Brandon Low [Thu, 17 Feb 2011 04:09:47 +0000 (20:09 -0800)]

Add tests around the bloom ruin and check options

This generally improves our test coverage of bloom filter behavior and
more specifically makes sure that check and ruin do something.

Signed-off-by: Brandon Low <lostlogic@lostlogicx.com>

commit | commitdiff | tree

Brandon Low [Thu, 17 Feb 2011 06:41:33 +0000 (22:41 -0800)]

Add a bloom --ruin for testing failure cases

This command option ruins a bloom filter by setting all of its bits to
zero.

Signed-off-by: Brandon Low <lostlogic@lostlogicx.com>

commit | commitdiff | tree

Brandon Low [Thu, 17 Feb 2011 04:09:45 +0000 (20:09 -0800)]

One more constant for header lengths

I missed bloom header length in the last pass.

Signed-off-by: Brandon Low <lostlogic@lostlogicx.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 02:55:41 +0000 (18:55 -0800)]

Split PackMidx from git.py into a new midx.py.

git.py is definitely too big. It still is, but this helps a bit.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 02:39:35 +0000 (18:39 -0800)]

bloom.py: move bloom.ShaBloom.create to just bloom.create.

I don't really like class-level functions. Ideally we'd just move all the
creation stuff into cmd/bloom, but tbloom.py is testing them, so it's not
really worth it.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 02:34:36 +0000 (18:34 -0800)]

Merge branch 'bl/bloomcheck' into ap/cleanups

* bl/bloomcheck:
  Bail out immediately instead of redownloading .idx
  Add a --check behavior to verify bloom
  Defines/preprocessor lengths > magic numbers

Conflicts:
cmd/bloom-cmd.py

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 02:05:18 +0000 (18:05 -0800)]

Move bloom-related stuff from git.py to a new bloom.py.

No other functionality changes other than that cmd/memtest now reports the
number of bloom steps separately from the midx/idx steps. (This is mostly
so they don't have to share the same global variables, but it's also
interesting information to break out.)

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 01:32:51 +0000 (17:32 -0800)]

cmd/bloom: add a --force option to forget regenerating the bloom.

This corresponds to midx's --force option.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 00:11:26 +0000 (16:11 -0800)]

Use the new qprogress() function in more places.

qprogress() was introduced in the last commit and has smarter default
behaviour that automatically reduces progress output so we don't print too
many messages per second. Various commands/etc were doing this in various
different ad-hoc ways, but let's centralize it all in one place.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Brandon Low [Thu, 17 Feb 2011 01:46:01 +0000 (17:46 -0800)]

Bail out immediately instead of redownloading .idx

This should make diagnosing / fixing corrupted bloom filters and midx
files easier, and is generally more sane behavior.

Signed-off-by: Brandon Low <lostlogic@lostlogicx.com>

commit | commitdiff | tree

Brandon Low [Thu, 17 Feb 2011 01:46:02 +0000 (17:46 -0800)]

Add a --check behavior to verify bloom

This new behavior is useful when diagnosing weird behavior, lets a bloom
filter claiming to contain a particular idx be verified against that idx
file.

Signed-off-by: Brandon Low <lostlogic@lostlogicx.com>

commit | commitdiff | tree

Brandon Low [Thu, 17 Feb 2011 01:46:00 +0000 (17:46 -0800)]

Defines/preprocessor lengths > magic numbers

This just changes some instances of "8", "12" and "20" to use the
equivalent sizeof or #defined constants to make the code more readable.

Signed-off-by: Brandon Low <lostlogic@lostlogicx.com>

commit | commitdiff | tree

Avery Pennarun [Thu, 17 Feb 2011 00:01:30 +0000 (16:01 -0800)]

cmd/{bloom,midx}: clean up progress messages.

bloom was printing messages more often than necessary on fast computers,
which could overwhelm the stderr output a bit. Also change to a percentage
+ number of objects, like midx and save do, rather than just printing the
current file number.

And don't print so many lines of output by default: now if bloom isn't
end up doing anything, it doesn't print any output. And if it does do
something, it prints only one output line per file.

bloom and midx now both print the name of the directory where they're
creating their output files; if you have multiple directories in
.bup/index-cache, it was a little confusing to see them doing
multiple runs for no apparent reason.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

commit | commitdiff | tree

Avery Pennarun [Wed, 16 Feb 2011 23:58:14 +0000 (15:58 -0800)]

cmd/bloom: by default generate bloom filters in *all* index-cache dirs.

This matches with 'bup midx -a' and 'bup midx -f' behaviour. People might
have been thinking they were regenerating bloom filters without actually
doing them all.

Moved some shared code to do this from cmd/midx to git.py.

Signed-off-by: Avery Pennarun <apenwarr@gmail.com>

Alex' bup ("It backs things up") development repository

RSS Atom