PV056 Machine learning and data mining


Projects

Writing papers on machine learning

Projects in (2009) (2010) (2011)


Literature
Evaluation

Course materials

* = is not subject to the final exam

Slides are available at IS MU -> PV056 -> Teaching materials.

Weka
(homepage) (datasets) (wiki) (how do I use WEKA's classes in my own code)
(multi-instance classification in weka) and (data sets)
Time series in Weka (two filters in weka) (TClass) (from wpi)
Artificial neural network plugins etc.
BioWeka

R
Home page of the R-project
R-project at CUNI


INTRODUCTION TO MACHINE LEARNING

Introduction
Classification. Version spaces (animation)
Learning decision trees. C4.5 (animation 1) (animation 2)
Evaluation of results
Bayesian learning. Instance-based learning. SVM.
Network models. Neural nets. Winnow
Inductive logic programming (multi-relational learning) (intro) (advanced) (aleph)
Unsupervised learning. Cluster analysis. CLUSTER. AutoClass
Outlier and rare event detection (intro) (KDD 2010 Tutorial)

ADVANCED MACHINE LEARNING

Ensemble methods. Bagging. Boosting. Random forests
Regression trees (C&RT)
Active learning
Propositionalization
Learning from streams (ICDM 2006 Tutorial)
Learning from multi-instance data
Semi-supervised learning
Preference learning
Meta-learning (intro) (Data Mining Advisor)

DATA MINING

MINING IN STRUCTURED DATA

DATA MINING AND BIOINFORMATICS

VISUAL ANALYTICS


Links

Teaching

R Systems Data sets OLAP, data warehouses and data mining