%0 Journal Article %A Fu Qunchao %A Han Xu %A Liu Jun %A Wang Hongsu %A Zhang Sanqian %T Sentence segmentation for classical Chinese based on LSTM with radical embedding %D 2019 %R 10.19682/j.cnki.1005-8885.2019.1001 %J 中国邮电高校学报(英文) %P 1-8 %V 26 %N 2 %X A low-than character feature embedding called radical embedding is proposed, and applied on a long-short term memory (LSTM) model for sentence segmentation of pre-modern Chinese texts. The dataset includes over 150 classical Chinese books from 3 different dynasties and contains different literary styles. LSTM-conditional random fields (LSTM-CRF) model is a state-of-the-art method for the sequence labeling problem. This model adds a component of radical embedding, which leads to improved performances. Experimental results based on the aforementioned Chinese books demonstrate better accuracy than earlier methods on sentence segmentation, especial in Tang’s epitaph texts (achieving an F1-score of 81.34%). %U https://jcupt.bupt.edu.cn/CN/10.19682/j.cnki.1005-8885.2019.1001