Re: Data Metrics

Robin B. Lake (rbl@hal.EPBI.CWRU.Edu)
Sun, 26 Jul 1998 18:51:17 +0200 (MET DST)

You asked:
>
> Hi,
> does anyone know about approaches to theories on the definition of
> a metric for data?
>
> Something like measuring the "distance" between two records, based
> on the data they contain, NOT on the repository (their "physical" distance
> on a hard disk plate, which wouldn't be that sensical anyway).
>
Is the data numeric? If so, look at the distribution function for
each data element. If not numeric, say text data, convert to numeric by
establishing a numeric measure for each text item from the set of
text items, using something like a similarity measure for text.

Rob Lake
Environmental Modeling Inc.
rbl@po.cwru.edu

############################################################################
This message was posted through the fuzzy mailing list.
(1) To subscribe to this mailing list, send a message body of
"SUB FUZZY-MAIL myFirstName mySurname" to listproc@dbai.tuwien.ac.at
(2) To unsubscribe from this mailing list, send a message body of
"UNSUB FUZZY-MAIL" or "UNSUB FUZZY-MAIL yoursubscription@email.address.com"
to listproc@dbai.tuwien.ac.at
(3) To reach the human who maintains the list, send mail to
fuzzy-owner@dbai.tuwien.ac.at
(4) WWW access and other information on Fuzzy Sets and Logic see
http://www.dbai.tuwien.ac.at/ftp/mlowner/fuzzy-mail.info
(5) WWW archive: http://www.dbai.tuwien.ac.at/marchives/fuzzy-mail/index.html