Data Record Matching

Pete Rowley (
Mon, 20 Apr 1998 20:39:56 +0200 (MET DST)

I have a problem which I suspect may be a good candidate for an ai
algorithm. I define good as something which would produce better (more
accurate) results than the same time spent coding a brute force
algorithm in C. The problem is this, I have two databases which are
separate, but they contain records relating to people - some of those
people will have records in both systems, some only one. I want to
automatically match up those records which relate to the same person -
but here's the catch, I can assume no coordination between those
systems, names may be entered differently e.g. Jim instead of James, the

number and type of the fields in the records may be very different, and
there may be more than one person with the same name etc.

Given this problem, could I expect good results from some ai engine? -
what kind of ai would this be best suited to - fuzzy logic? The ability

of the algorithm to get better at it's job over time would be nice, but
not essential.

If you are a vendor who thinks they have a suitable, highly portable,
easily software embeddable engine or possibly a C code generating
product then I would like to hear from you.


Pete Rowley