Re: Data Metrics

Jake Milman (jmilman@amdocs.com)
Sun, 26 Jul 1998 16:58:14 +0200 (MET DST)

For non-numeric data (e.g. "distance" between RNA strands), the Tanimoto
distance is sometimes used. (Search the net for refs.)

Jake

Will Dwinnell <76743.1740@CompuServe.COM> wrote in article
<OI$N7Bns9GA.377@nih2naaa.prod2.compuserve.com>...
> "does anyone know about approaches to theories on the
> definition of a metric for data?
>
> Something like measuring the "distance" between two records,
> based on the data they contain..."
>
> This is a well-explored subject for purely numeric data. For
> mixed numeric/nominal or purely nominal data, there is less
> information. With numeric data, look for "distance metrics" and
> "similarity measures" in multivariate statistics. Some common
> examples (which may aid in a keyword search) are Euclidean
> distance (also known as geometric or L-1 distance), Manhattan
> distance (also known as city-block, absolute, maximum or
> L-infinity distance) and Mahalanobis distance. Purely nominal
> data is sometimes compared using string distances like the
> Levenshtein distance.
>
> --
> Will Dwinnell
>

############################################################################
This message was posted through the fuzzy mailing list.
(1) To subscribe to this mailing list, send a message body of
"SUB FUZZY-MAIL myFirstName mySurname" to listproc@dbai.tuwien.ac.at
(2) To unsubscribe from this mailing list, send a message body of
"UNSUB FUZZY-MAIL" or "UNSUB FUZZY-MAIL yoursubscription@email.address.com"
to listproc@dbai.tuwien.ac.at
(3) To reach the human who maintains the list, send mail to
fuzzy-owner@dbai.tuwien.ac.at
(4) WWW access and other information on Fuzzy Sets and Logic see
http://www.dbai.tuwien.ac.at/ftp/mlowner/fuzzy-mail.info
(5) WWW archive: http://www.dbai.tuwien.ac.at/marchives/fuzzy-mail/index.html