-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
color-words: change algorithm to allow for 0-character word boundaries
Up until now, the color-words code assumed that word boundaries are identical to white space characters. Therefore, it could get away with a very simple scheme: it copied the hunks, substituted newlines for each white space character, called libxdiff with the processed text, and then identified the text to output by the offsets (which agreed since the original text had the same length). This code was ugly, for a number of reasons: - it was impossible to introduce 0-character word boundaries, - we had to print everything word by word, and - the code needed extra special handling of newlines in the removed part. Fix all of these issues by processing the text such that - we build word lists, separated by newlines, - we remember the original offsets for every word, and - after calling libxdiff on the wordlists, we parse the hunk headers, and find the corresponding offsets, and then - we print the removed/added parts in one go. The pre and post samples in the test were provided by Santi Béjar. Note that there is some strange special handling of hunk headers where one line range is 0 due to POSIX: in this case, the start is one too low. In other words a hunk header '@@ -1,0 +2 @@' actually means that the line must be added after the _second_ line of the pre text, _not_ the first. Signed-off-by: Johannes Schindelin <johannes.schindelin@gmx.de> Signed-off-by: Junio C Hamano <gitster@pobox.com>
- Loading branch information
Johannes Schindelin
authored and
Junio C Hamano
committed
Jan 17, 2009
1 parent
23c1575
commit 2e5d200
Showing
2 changed files
with
157 additions
and
66 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,66 @@ | ||
#!/bin/sh | ||
|
||
test_description='word diff colors' | ||
|
||
. ./test-lib.sh | ||
|
||
test_expect_success setup ' | ||
git config diff.color.old red | ||
git config diff.color.new green | ||
' | ||
|
||
decrypt_color () { | ||
sed \ | ||
-e 's/.\[1m/<WHITE>/g' \ | ||
-e 's/.\[31m/<RED>/g' \ | ||
-e 's/.\[32m/<GREEN>/g' \ | ||
-e 's/.\[36m/<BROWN>/g' \ | ||
-e 's/.\[m/<RESET>/g' | ||
} | ||
|
||
word_diff () { | ||
test_must_fail git diff --no-index "$@" pre post > output && | ||
decrypt_color < output > output.decrypted && | ||
test_cmp expect output.decrypted | ||
} | ||
|
||
cat > pre <<\EOF | ||
h(4) | ||
a = b + c | ||
EOF | ||
|
||
cat > post <<\EOF | ||
h(4),hh[44] | ||
a = b + c | ||
aa = a | ||
aeff = aeff * ( aaa ) | ||
EOF | ||
|
||
cat > expect <<\EOF | ||
<WHITE>diff --git a/pre b/post<RESET> | ||
<WHITE>index 330b04f..5ed8eff 100644<RESET> | ||
<WHITE>--- a/pre<RESET> | ||
<WHITE>+++ b/post<RESET> | ||
<BROWN>@@ -1,3 +1,7 @@<RESET> | ||
<RED>h(4)<RESET><GREEN>h(4),hh[44]<RESET> | ||
<RESET> | ||
a = b + c<RESET> | ||
<GREEN>aa = a<RESET> | ||
<GREEN>aeff = aeff * ( aaa )<RESET> | ||
EOF | ||
|
||
test_expect_success 'word diff with runs of whitespace' ' | ||
word_diff --color-words | ||
' | ||
|
||
test_done |