Have you ever wondered how utilities like Beyond Compare or DIFF are comparing files? They do it (I guess) by solving the longest common subsequence (LCS) problem.
After reading the Wikipedia article linked above, I obtained an overall view of the problem and I looked at the possible resolutions. So, I decided to implement a Delphi class to do the string comparison trick, which is the base for the text file comparison.
Let me put it as follows: given two strings to be compared, I want to highlight in blue the characters added to the first string and in red the characters removed from it. The common (unchanged) characters will keep the default color.
For example:
String 1 = Delphi allows both structural and object oriented programming.
String 2 = Does Delphi allow object oriented programming?
Highlighted differences:
Does Delphi allows both structural and object oriented programming.?
The Delphi class looks like this:
type
TDiff = record
Character: Char;
CharStatus: Char; //Possible values: [+, -, =]
end;
TStringComparer = class
……………
public
class function Compare(aString1, aString2: string): TList<TDiff>;
end;
When you call TStringComparer.Compare, a generic list of TDiff records is created. A TDiff record contains a character and whether this character was added (CharStatus = ‘+’), removed (CharStatus = ‘-’) or unchanged (CharStatus = ‘=’) in both strings under comparison.
Let’s drop two edits (Edit1, Edit2), a rich edit (RichEdit1) and a button (Button1) on a Delphi form. To highlight the differences put the following code in the OnClick event of the button:
procedure TForm1.Button1Click(Sender: TObject);
var
Differences: TList<TDiff>;
Diff: TDiff;
begin
//Yes, I know...this method could be refactored ;-)
Differences:= TStringComparer.Compare(Edit1.Text, Edit2.Text);
try
RichEdit1.Clear;
RichEdit1.SelStart:= RichEdit1.GetTextLen;
for Diff in Differences do
if Diff.CharStatus = '+' then
begin
RichEdit1.SelAttributes.Color:= clBlue;
RichEdit1.SelText := Diff.Character;
end
else if Diff.CharStatus = '-' then
begin
RichEdit1.SelAttributes.Color:= clRed;
RichEdit1.SelText:= Diff.Character;
end
else
begin
RichEdit1.SelAttributes.Color:= clDefault;
RichEdit1.SelText:= Diff.Character;
end;
finally
Differences.Free;
end;
end;
It looks like in the image below:
For the full implementation read further down. Note that various optimizations could be added to the code below, but I didn’t implement them. Anyway, I hope this helps. Feedback is welcome! Feel free to find and correct bugs ;-)