An official website of the European Union How do you know?      
European Commission logo
JRC Publications Repository Menu

Leveraging Sequential Pattern Information for Active Learning from Sequential Data

cover
This paper presents a novel active learning technique aimed at the selection of sequences for manual annotation from a database of unlabelled sequences. Supervised machine learning algorithms can employ these sequences to build better models than from using random sequences for training. The main contribution of the proposed method is the use of sequential pattern information contained in the database to select very representative and diverse sequences for annotation. These two characteristics ensure the proper coverage of the instance space of sequences and, at the same time, avoids over-fitting the trained model. The approach, called SPIAL (Sequential Pattern Information for Active Learning), uses sequential pattern mining algorithms to extract frequently occurring sub-sequences from the database and evaluates how representative and diverse each sequence is, based on this information. The output is a list of sequences for annotation sorted by representativeness and diversity. The algorithm is modular and, unlike current techniques, independent of the features taken into account by the machine learning algorithm that trains the model. Experiments done on well-known benchmarks involving sequential data show that the models trained using SPIAL increase their convergence speed while significantly reducing computation cost compared to other state-of-the-art algorithms. In addition, the sets of sequences proposed are smaller, which reduces annotation effort.
2021-03-31
INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS
JRC122281
1051-4651 (online),   
https://ailb-web.ing.unimore.it/icpr/paper/120/nn,    https://publications.jrc.ec.europa.eu/repository/handle/JRC122281,   
Language Citation
NameCountryCityType
Datasets
IDTitlePublic URL
Dataset collections
IDAcronymTitlePublic URL
Scripts / source codes
DescriptionPublic URL
Additional supporting files
File nameDescriptionFile type 
Show metadata record  Copy citation url to clipboard  Download BibTeX
Items published in the JRC Publications Repository are protected by copyright, with all rights reserved, unless otherwise indicated. Additional information: https://ec.europa.eu/info/legal-notice_en#copyright-notice