Abstract
The Corpus of Mid-20th Century Hong Kong Cantonese (HKCC hereafter) is one of the very few Cantonese corpora that provides interactive spoken language data for Cantonese linguistic research. The first phase of HKCC was launched in 2012 with about 200,000 character tokens. The second phase of HKCC is much expanded with data from 60 movies, totaling about 800,000 character tokens.
While the primary purpose of the corpus was to support diachronic studies of Cantonese spoken half a century ago, the dialogic and interactive nature of the corpus data is also useful for other research issues. Besides basic information such as word lists, token frequency and sentences, HKCC, further processed by computer processing and analyses, can provide more useful and interesting quantitative and qualitative data. One such example is word collocation. In this talk, we will demonstrate how such information can be obtained from the second phase of HKCC, and its applications in Cantonese studies. Copyright © 2019 Workshop on Cantonese (WOC).
While the primary purpose of the corpus was to support diachronic studies of Cantonese spoken half a century ago, the dialogic and interactive nature of the corpus data is also useful for other research issues. Besides basic information such as word lists, token frequency and sentences, HKCC, further processed by computer processing and analyses, can provide more useful and interesting quantitative and qualitative data. One such example is word collocation. In this talk, we will demonstrate how such information can be obtained from the second phase of HKCC, and its applications in Cantonese studies. Copyright © 2019 Workshop on Cantonese (WOC).
Original language | English |
---|---|
Publication status | Published - Apr 2019 |
Event | 第十九屆粵語討論會:粵語研究:實證,實正! = The Nineteenth Workshop on Cantonese (WOC-19): Cantonese Study: An Empirical Approach - 香港理工大學, Hong Kong Duration: 13 Apr 2019 → 13 Apr 2019 |
Workshop
Workshop | 第十九屆粵語討論會:粵語研究:實證,實正! = The Nineteenth Workshop on Cantonese (WOC-19): Cantonese Study: An Empirical Approach |
---|---|
Abbreviated title | WOC-19 |
Country/Territory | Hong Kong |
Period | 13/04/19 → 13/04/19 |