Clustering is one of the main data mining techniques used to analyze and group data, but often applications have to deal with a very large amount of spatially distributed data for which most of the clustering algorithms available so far are impractical. In this paper we present P2PRASTER, a distributed algorithm relying on a gossip–based protocol for clustering that exploits the RASTER algorithm and has been designed to handle big data in a decentralized manner. The experiments carried out show that P2PRASTER returns perfect results under both optimal and non-optimal conditions, and also provides excellent scalability.
Grid-Based Contraction Clustering in a Peer-to-Peer Network
Epicoco, ItaloSecondo
Methodology
;Cafaro, Massimo
Penultimo
Methodology
;Pulimeno, MarcoUltimo
Methodology
2023-01-01
Abstract
Clustering is one of the main data mining techniques used to analyze and group data, but often applications have to deal with a very large amount of spatially distributed data for which most of the clustering algorithms available so far are impractical. In this paper we present P2PRASTER, a distributed algorithm relying on a gossip–based protocol for clustering that exploits the RASTER algorithm and has been designed to handle big data in a decentralized manner. The experiments carried out show that P2PRASTER returns perfect results under both optimal and non-optimal conditions, and also provides excellent scalability.File | Dimensione | Formato | |
---|---|---|---|
camera ready.pdf
solo utenti autorizzati
Descrizione: Atto di convegno
Tipologia:
Post-print referato (Refereed author’s manuscript)
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
297.74 kB
Formato
Adobe PDF
|
297.74 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.