Incremental Rule Learning and Border Examples Selection from Numerical Data Streams
Francisco J. Ferrer-Troyano (Computer Science Department, University of Seville, Spain)
Jesús S. Aguilar-Ruiz (Computer Science Department, University of Seville, Spain)
José C. Riquelme (Computer Science Department, University of Seville, Spain)
Abstract: Mining data streams is a challenging task that requires online systems based on incremental learning approaches. This paper describes a classification system based on decision rules that may store up-to-date border examples to avoid unnecessary revisions when virtual drifts are present in data. Consistent rules classify new test examples by covering and inconsistent rules classify them by distance as the nearest neighbour algorithm. In addition, the system provides an implicit forgetting heuristic so that positive and negative examples are removed from a rule when they are not near one another.
Keywords: classification, concept drift, data streams, decision rules, incremental learning
Categories: H.2.8, I.2.6, I.5.2