Abstract
The life insurance questionnaire is a large document containing responses in a mixture of structured and unstructured data. The unstructured data poses issues for the user, in the form of extra input effort, and the insurance company, in the form of interpretation and analysis. In this work, we aim to address these problems by proposing a semi-supervised framework for clustering responses into categories using vector space embedding of responses and soft k-means clustering. Our experiments show that our method achieves adequate results. The resulting category clusters from our method can be used for analysis and to replace free text input questions with structured questions in the questionnaire. Copyright © 2018 IEEE.
Original language | English |
---|---|
Title of host publication | Proceedings of 2018 5th International Conference on Behavioral, Economic, and Socio-Cultural Computing, BESC 2018 |
Editors | Juan E. GUERRERO |
Place of Publication | Danvers, MA |
Publisher | IEEE |
Pages | 30-31 |
ISBN (Electronic) | 9781728102078 |
DOIs | |
Publication status | Published - Jul 2018 |