Two types of techniques are usually adopted in language differentiation: token matching and statistical analysis. In this paper we present a method which uses a combined analysis of several discriminating statistical features, for the differentiation between European and oriental language scripts. When applied to more than 23 languages, it has proved to be effective in differentiating between documents printed in these different scripts. Copyright © 1998 World Scientific Publishing.
|Journal||International Journal of Pattern Recognition & Artificial Intelligence|
|Publication status||Published - Feb 1998|
CitationLam, L., Ding, J., & Suen, C. Y. (1998). Differentiating between oriental and European scripts by statistical features. International Journal of Pattern Recognition & Artificial Intelligence, 12(1), 63-79.
- Asian scripts
- Script classification
- Language differentiation
- Oriental languages
- Chinese characters
- Roman scripts