Classification of oriental and European scripts by using characteristic features

Suk Wah Louisa LAM

Research output: Chapter in Book/Report/Conference proceedingChapter

44 Citations (Scopus)

Abstract

Two types of techniques are usually adopted in language differentiation: token matching and statistical analysis. In this paper we present a method which uses a combined analysis of several discriminating statistical features for the differentiation between European and oriental language scripts. When applied to more than 23 languages, it has proved to be effective in classifying documents printed in these different scripts. Copyright © 1997 IEEE.
Original languageEnglish
Title of host publicationProceedings of the 4th International Conference on Document Analysis and Recognition
Place of PublicationUlm, Germany
PublisherIEEE
Pages1023-1027
ISBN (Print)0818678984, 0818678992
Publication statusPublished - 1997

Fingerprint

Statistical methods

Citation

Ding, J., Lam, L., & Suen, C. Y. (1997). Classification of oriental and European scripts by using characteristic features. In Proceedings of the 4th International Conference on Document Analysis and Recognition (pp. 1023-1027). Ulm, Germany: IEEE.