The Beijing Sentence Corpus: A Chinese sentence corpus with eye movement data and predictability norms

Jinger PAN, Ming YAN, Eike M. RICHTER, Hua SHU, Reinhold KLIEGL

Research output: Contribution to journalArticlespeer-review

19 Citations (Scopus)


This report introduces the Beijing Sentence Corpus (BSC). This is a Chinese sentence corpus of eye-tracking data with relatively clear word boundaries. In addition, we report predictability norms for each word in the corpus. Eye movement corpora are available in alphabetic scripts such as English, German, and French. However, there is no publicly available corpus for Chinese. Thus, to study predictive processes during reading in Chinese, it is necessary to establish such a corpus. Also, given the clear word boundaries in the sentences, BSC is especially useful to provide evidence relevant to the theoretical debate of saccade target selection in Chinese. With the large-scale predictability norms, we conducted new analyses based on 60 BSC readers, testing the influences of launch word and target word properties while controlling for visual and oculomotor constraints, as well as sentence and subject-level individual differences. We discuss implications for guidance of eye movements in Chinese reading. Copyright © 2021 The Psychonomic Society, Inc.
Original languageEnglish
Pages (from-to)1989-2000
JournalBehavior Research Methods
Issue number4
Early online dateNov 2021
Publication statusPublished - Aug 2022


Pan, J., Yan, M., Richter, E. M., Shu, H., & Kliegl, R. (2022). The Beijing Sentence Corpus: A Chinese sentence corpus with eye movement data and predictability norms. Behavior Research Methods, 54(4), 1989-2000. doi: 10.3758/s13428-021-01730-2


  • Corpus analysis
  • Eye tracking
  • Chinese reading
  • Predictability


Dive into the research topics of 'The Beijing Sentence Corpus: A Chinese sentence corpus with eye movement data and predictability norms'. Together they form a unique fingerprint.