A local search approach for generating directional data
In application fields such as linguistic and computer vision there is an increasing need of reference data for the empirical analysis of new methods and the assessment of different algorithms. Current evaluations are based on few real-life collections or on artificial data generators built on models that are too simplistic to cover real scenarios and to allow researchers to identify crucial limitations of their algorithms. We propose a flexible approach to generate high-dimensional vectors, with directional properties controlled by the distribution of their pair-wise cosine distances. The generation method is formulated as a non-linear continuous optimization problem, which is solved with a computationally efficient local search algorithm. We show with an empirical study that our approach can create large high-dimensional data collections with desired properties in reasonable time.
CONSOLI Sergio;
TURCHI Marco;
PERROTTA Domenico;
2016-04-28
IOS PRESS
JRC93359
1088-467X,
http://content.iospress.com/articles/intelligent-data-analysis/ida813,
https://publications.jrc.ec.europa.eu/repository/handle/JRC93359,
10.3233/IDA-160813,
Additional supporting files
| File name | Description | File type | |