Applied Statistics
International Conference, 19 - 22 September 2004
Ljubljana, Slovenia


Data Mining in Large Datasets

Brian Ripley
Department of Statistics
University of Oxford
Oxford, UK

Data mining used to mean over-analysing a data set, but that is now called `data dredging', and data mining is a newly popular term for finding structure in large-scale databases.

The talk will give an introduction to the area and illustrate it by two case studies from Magnetic Resonance Imaging of human brains, one on classifying tissue and one on determining where in the brain the subject is thinking.