Extensions to the recently introduced concept of pairwise overlap between mixture components are proposed. The notion of overlap is useful for studying the systematic performance of clustering algorithms. Existing methods can be used for simulating elliptical data according to pre-specified overlap characteristics. First, an approach to simulating skewed clusters with a desired overlap is proposed. Next, an extension to measuring overlap in cluster-weighted models is considered. Thus, this paper provides important extensions to the exisiting methods for simulating heterogeneous data for studying the systematic performance of clustering algorithms.
MELNYKOV Volodymyr;
WANG Yang;
MELNYKOV Yana;
TORTI Francesca;
PERROTTA Domenico;
RIANI Marco;
2024-04-11
TAYLOR & FRANCIS INC
JRC129831
1061-8600 (online),
https://www.tandfonline.com/doi/full/10.1080/10618600.2023.2210338,
https://publications.jrc.ec.europa.eu/repository/handle/JRC129831,
10.1080/10618600.2023.2210338 (online),
| Name | Country | City | Type |
|---|
This document is only visible at the Commission level.
You are not authorized to publish or distribute it outside the European Commission.
This is a public document. You can share this publication.
Datasets
| ID | Title | Public URL |
|---|
Dataset collections
| ID | Acronym | Title | Public URL |
|---|
Scripts / source codes
| Description | Public URL |
|---|
Additional supporting files
| File name | Description | File type |
|---|