Mon-2-5-10 Focal Loss for Punctuation Prediction

Jiangyan Yi(Institute of Automation Chinese Academy of Sciences), Jianhua Tao(National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences), Zhengkun Tian(Institute of Automation, Chinese Academy of Sciences), Ye Bai(National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences) and Cunhang Fan(Institute of Automation, Chinese Academy of Sciences)
Abstract: Many approaches have been proposed to predict punctuation marks. Previous results demonstrate that these methods are effective.However, there still exists class imbalance problem during training. Most of the classes in the training set for punctuation prediction are non-punctuation marks. This will affect the performance of punctuation prediction tasks. Therefore, this paper uses a focal loss to alleviate this issue. The focal loss can down-weight easy examples and focus training on a sparse set of hard examples. Experiments are conducted on IWSLT2011 datasets. The results show that the punctuation predicting models trained with a focal loss obtain performance improvement over that trained with a cross entropy loss by up to 2.7% absolute overall F_1-score on test set. The proposed model also outperforms previous state-of-the-art models.
Student Information

Student Events

Travel Grants