Multidimensional balance-based cluster boundary detection for high-dimensional data

Xiaofeng CAO, Baozhi QIU, Xiangli LI, Zenglin SHI, Guandong XU, Jianliang XU

Research output: Contribution to journalArticlespeer-review

9 Citations (Scopus)

Abstract

The balance of neighborhood space around a central point is an important concept in cluster analysis. It can be used to effectively detect cluster boundary objects. The existing neighborhood analysis methods focus on the distribution of data, i.e., analyzing the characteristic of the neighborhood space from a single perspective, and could not obtain rich data characteristics. In this paper, we analyze the high-dimensional neighborhood space from multiple perspectives. By simulating each dimension of a data point's k nearest neighbors space (k NNs) as a lever, we apply the lever principle to compute the balance fulcrum of each dimension after proving its inevitability and uniqueness. Then, we model the distance between the projected coordinate of the data point and the balance fulcrum on each dimension and construct the DHBlan coefficient to measure the balance of the neighborhood space. Based on this theoretical model, we propose a simple yet effective cluster boundary detection algorithm called Lever. Experiments on both low- and high-dimensional data sets validate the effectiveness and efficiency of our proposed algorithm. Copyright © 2018 IEEE.

Original languageEnglish
Pages (from-to)1867-1880
JournalIEEE Transactions on Neural Networks and Learning Systems
Volume30
Issue number6
Early online dateOct 2018
DOIs
Publication statusPublished - Jun 2019

Citation

Cao, X., Qiu, B., Li, X., Shi, Z., Xu, G., & Xu, J. (2019). Multidimensional balance-based cluster boundary detection for high-dimensional data. IEEE Transactions on Neural Networks and Learning Systems, 30(6), 1867-1880. https://doi.org/10.1109/TNNLS.2018.2874458

Keywords

  • Balance principle
  • Cluster boundary
  • High-dimensional space
  • Unlimited lever

Fingerprint

Dive into the research topics of 'Multidimensional balance-based cluster boundary detection for high-dimensional data'. Together they form a unique fingerprint.