Date: Wed Mar 15 2000 - 02:45:54 MET


Sixth ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
August 20, 2000
Boston, MA, USA
Workshop Web Site:


Hillol Kargupta
Faculty of Computer Science
School of Electrical Engineering and Computer Science
Washington State University
Pullman, WA 99164-2752

Joydeep Ghosh
Department of Electrical and Computer Engineering
Engineering Science Building 143
The University of Texas at Austin
Austin, TX 78712-1084,USA

Philip Chan
Computer Science Department
Florida Institute of Technology
150 W. University Blvd.
Melbourne, FL 32901

Vipin Kumar
Department of Computer Science and Army High Performance Computing Research Center
4-192, EE/CSci Building,
University of Minnesota
Minneapolis, MN 55455

Zoran Obradovic
Faculty of Computer Science
School of Electrical Engineering and Computer Science
Washington State University
Pullman, WA 99164-2752

Program Committee

 Chandrika Kamath, Lawrence Livermore Laboratory
 Mohammed Zaki, Rensselaer Polytechnic Institute
 Vincent To-yee Ng, Hong Kong Polytechnic University
 Nandit Soparkar, University of Michigan
 Charles R. Musick, Lawrence Livermore Laboratory
 Dharmendra Modha, IBM Almaden Research Center
 Ruediger Wirth, DaimlerChrysler Corporation
 Sreenivas, Mahesh, Microsoft Corporation
 Yike Guo, Imperial College
 Kagan Tumer, NASA Ames Research Center
 Alok Choudhary, Northwestern University
 Larry Hall, University of South Florida
 Reagan Moore, San Diego Supercomputer Center
 Howard Ho, IBM Almaden Research Center

Electronic Publicity Chair

Byung-Hoon (Hooney) Park, Washington State University

Background Information

Information is most useful when it can be transformed into actionable
knowledge. Since the volume of data in most non-trivial processes is
typically large, the role of automated data analysis in modern information
processing applications is becoming increasingly important. The emerging
development of large, data rich distributed systems like the internet and
intranets has added another dimension to the problem---a plethora of
distributed information resources. Although the communication bandwidth is
increasing, by no means is it increasing at a rate that is even close to the
increase of available information. As a result, downloading large data sets
to a single site over limited bandwidth channels, followed by the
application of centralized data analysis algorithms may not be scalable for
the large, distributed data analysis applications of the future. Moreover,
this may not even be feasible because of security/privacy concerns, or
incompatibilities at the system or information representation levels.
Examples include information retrieval in a mobile computing environment,
situation monitoring using large sensor networks, distributed real-time
advanced practice of telemedicine, and e-commerce applications. We need to
rethink our fundamental approach toward data analysis in these emerging new
environments. Distributed knowledge discovery (DKD) accepts the fact that
data may be inherently distributed among different loosely coupled sites
connected by a network and the sites may have heterogeneous data. It offers
techniques to discover new knowledge through distributed data analysis and
modeling using minimal communication of information. Design and development
of distributed data analysis algorithms, architectures and their
scalability, efficiency, ability to work with different kinds of data and
computing platforms such as handheld devices to web server farms, security,
human-computer interaction are some of the open research problems in DKD.

The first workshop on Distributed Data Mining, held at the Fourth
International Conference on Knowledge Discovery and Data Mining (1998)
brought the interested researchers and practitioners together and created an
environment for crystallizing the fast growing field of DKD. The workshop
was focused on the state-of-the-art DKD algorithms, systems, and
application related issues. Approximately 40 participants attended the
workshop. The workshop had 13 presentations, including 3 invited talks. It
was sponsored by the Intel Corporation and Magnify Inc. It resulted in a
book (edited by Hillol Kargupta and Philip Chan) to be published by AAAI/MIT
Press in early Summer of 2000. With the rapid growth in DKD technology both
in terms of research issues and real-world applications, we expect that the
second workshop will be even more successful. Further details about the
(1998) workshop can be found at

Today after about two years, this field has gained a large boost in
momentum. The number of researchers and practitioners working in this area
has nearly tripled. More research issues are introduced and the field is
starting to crystallize its shape. We believe it is now the time for
hosting another workshop to discuss the state of the art research and
practice. This workshop will also extend its scope by including
complementary research in parallel data mining which is also an exciting
area and highly related to its distributed counterpart.

Workshop Description

The primary thrust of this workshop will be on knowledge discovery from
distributed data. However, high quality research on high performance
parallel data mining is also relevant to the workshop. The proposed workshop
will provide a platform to discuss both theoretical and applied research
issues in distributed and parallel knowledge discovery (DPKD). The topics
of interest include, but are not limited to:

   1) Theoretical foundation of DPKD.

   2) Methods and algorithms: Advanced distributed and parallel data
      analysis and knowledge discovery algorithms.

   3) Architectural issues: Architecture, control, security, and
      communication issues in DPKD.

   4) Distributed data analysis in mobile computing environments.

   5) Experimental systems: Large experimental systems, performance design

   6) Software agents and DPKD: Agent based approaches in DPKD. Agent
      interaction: cooperation, collaboration, negotiation, organizational

   7) Distributed and Parallel knowledge discovery from spatial data.

   8) Applications of DPKD: Application in business, science, engineering,
      medicine, and other disciplines.

   9) Human-computer interaction in DPKD: Human-computer interaction in
      DPKD, multi-user interaction in DPKD.

  10) Distributed Data analysis on the Internet.

Important Dates

March 1, 2000: Workshop Call for Papers.
May 15, 2000: Papers Due.
June 15, 2000: Acceptance Notification.
July 15, 2000: Revision Due.
August 20, 2000: Workshop.

Paper Submission

All papers must be submitted to the following address:

          Hillol Kargupta
          Faculty of Computer Science
          School of Electrical Engineering and Computer Science
          Washington State University
          Pullman, WA 99164-2752
          Phone: (509) 335-6602
          Fax: (509) 335-3818

Electronic submission (postscript, pdf, or MS Word format) is highly
encouraged. For hard-copy submission, please send three (3) copies of the
full paper to the above address.

Electronic submissions should be sent to


The accepted papers will be published in the workshop proceedings from ACM
Press. Selected papers from the workshop will be invited to submit papers in
a special issue on DPKD in an appropriate journal. The organizing committee
is currently in the process of finalizing this issue.


The 1998 workshop on distributed data mining was sponsored by Intel and
Magnify Inc. This year we are also seeking sponsors. This will be used to
support plenary speakers, graduate student travel assistance, and other
related expenses. For further details send e-mail to

This message was posted through the fuzzy mailing list.
(1) To subscribe to this mailing list, send a message body of
"SUB FUZZY-MAIL myFirstName mySurname" to
(2) To unsubscribe from this mailing list, send a message body of
(3) To reach the human who maintains the list, send mail to
(4) WWW access and other information on Fuzzy Sets and Logic see
(5) WWW archive:

This archive was generated by hypermail 2b25 : Thu Apr 06 2000 - 15:59:49 MET DST