Differentiating between oriental and European scripts by statistical features

Suk Wah Louisa LAM, Jie DING, Ching Y. SUEN

Research output: Contribution to journalArticlespeer-review

11 Citations (Scopus)

Abstract

Two types of techniques are usually adopted in language differentiation: token matching and statistical analysis. In this paper we present a method which uses a combined analysis of several discriminating statistical features, for the differentiation between European and oriental language scripts. When applied to more than 23 languages, it has proved to be effective in differentiating between documents printed in these different scripts. Copyright © 1998 World Scientific Publishing.
Original languageEnglish
Pages (from-to)63-79
JournalInternational Journal of Pattern Recognition & Artificial Intelligence
Volume12
Issue number1
DOIs
Publication statusPublished - Feb 1998

Citation

Lam, L., Ding, J., & Suen, C. Y. (1998). Differentiating between oriental and European scripts by statistical features. International Journal of Pattern Recognition & Artificial Intelligence, 12(1), 63-79.

Keywords

  • Asian scripts
  • Script classification
  • Language differentiation
  • Oriental languages
  • Chinese characters
  • Roman scripts

Fingerprint

Dive into the research topics of 'Differentiating between oriental and European scripts by statistical features'. Together they form a unique fingerprint.