The Queen's University of Belfast
Parallel Computer Centre

[Next] [Previous] [Top]

Introduction


Historical Perspective

Problems

What is data mining?

Definitions

the non trivial extraction of implicit, previously unknown, and potentially useful information from data

William J Frawley, Gregory Piatetsky-Shapiro and Christopher J Matheus

variety of techniques to identify nuggets of information or decision-making knowledge in bodies of data, and extracting these in such a way that they can be put to use in the areas such as decision support, prediction, forecasting and estimation. The data is often voluminous, but as it stands of low value as no direct use can be made of it; it is the hidden information in the data that is useful

Clementine User Guide

Techniques used

Summary

Comparison Data Mining and DBMS

Characteristics

of a data mining system

Who needs data mining?

Who(ever) has information fastest and uses it wins

Don McKeough,

former president of Coke Cola

Example

Philadelphia Police & Fire Credit Union

Data Mining

Applications

Data Mining Goals

Classification

if STATUS = married and INCOME > 10000

and HOUSE_OWNER = yes

then INVESTMENT_TYPE = good

Association

Sequence/Temporal

Data Mining and Machine Learning

Differences

Differences

Data Mining Process



Data Mining Process

Stages

Issues in Data Mining

Techniques

Knowledge Representation Methods



A neural net can be trained to identify the risk of cancer from a number of factors



Related Technologies

Data Warehousing

Definition

Characteristics of a data warehouse

defined by Bill Inmon (IS guru)

Data warehousing

Processes

Uses

of a data warehouse

Data Warehouse model



Structure of data inside the data warehouse

An example of levels of summarization of data

Criteria

for a data warehouse

Problems with data warehousing

these companies have slapped `data warehouse' labels on traditional transaction-processing products and co- opted the lexicon of the industry in order to be considered players in this fast-growing category

Chris Erickson, Red Brick

Data warehousing & OLTP

Similarities and Differences

OLTP systems

Data warehouse systems

OLAP

On-line Analytical processing

OLAP

common analytical operations

Knowledge acquisition

using data mining

Siftware History

Commercial Examples

Information Harvester Inc.

Red Brick Company

IBM

Data mining projects

UU - Jordanstown

UUJ Example

Policy lapse/renewal prediction

MKS

The Mining Kernel System



MKS

Conclusion

The future


[Next] [Previous] [Top]
All documents are the responsibility of, and copyright, © their authors and do not represent the views of The Parallel Computer Centre, nor of The Queen's University of Belfast.
Maintained by Alan Rea, email A.Rea@qub.ac.uk
Generated with CERN WebMaker