Beyond statistical procedures for predictive modelling: Data mining algorithms and support for university research at QUT


Autoria(s): Duplock, Ray; Kelson, Neil A.
Data(s)

01/11/2010

Resumo

In a seminal data mining article, Leo Breiman [1] argued that to develop effective predictive classification and regression models, we need to move away from the sole dependency on statistical algorithms and embrace a wider toolkit of modeling algorithms that include data mining procedures. Nevertheless, many researchers still rely solely on statistical procedures when undertaking data modeling tasks; the sole reliance on these procedures has lead to the development of irrelevant theory and questionable research conclusions ([1], p.199). We will outline initiatives that the HPC & Research Support group is undertaking to engage researchers with data mining tools and techniques; including a new range of seminars, workshops, and one-on-one consultations covering data mining algorithms, the relationship between data mining and the research cycle, and limitations and problems with these new algorithms. Organisational limitations and restrictions to these initiatives are also discussed.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/38617/

Relação

http://eprints.qut.edu.au/38617/1/DMConferencePoster_September_2010.pdf

http://www.eresearch.edu.au/

Duplock, Ray & Kelson, Neil A. (2010) Beyond statistical procedures for predictive modelling: Data mining algorithms and support for university research at QUT. In eResearch Australasia 2010 : 21st Century Research : Where Computing Meets Data, 8th-12th November 2010, RACV Royal Pines, Gold Coast, Queensland. (Unpublished)

Fonte

Division of Technology, Information and Learning Support; High Performance Computing and Research Support

Palavras-Chave #010499 Statistics not elsewhere classified #data mining #statistical procedures #HERN #predictive modelling
Tipo

Conference Item