![]() |
Centre for Science and Engineering of Materials
|
|
SATOMGI Module 1 Lecturer: Dr. Peter Christen, Department of Computer Science,
ANU Module Description: Data mining is data analysis performed on very large databases with an emphasis on identifying and extracting novel, potentially useful, and understandable patterns and associations. Data mining is a multi-disciplinary field which uses a combination of machine learning, statistical analysis, modelling techniques, visualisation and database technology. In many cases information from several data sources needs to be matched, linked and aggregated in order to allow more detailed data analysis or mining. Similarly, detecting and removing duplicate records that relate to the same entity within one data set is of importance, as data quality affects any subsequent analysis or mining. The aim of such linkages is to match and aggregate all records relating to the same entity. Learning outcomes: On completion of this module, participants
should have gained a understanding of the basic concepts and techniques
used in data mining and data matching, including: Assumed knowledge:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Page last updated: 24 July 2007 Please direct all enquiries to: Webmaster CSEM Page authorised by: Director, CSEM |
| The Australian National University — CRICOS Provider Number 00120C |