航空公司微博评论的意见信息抽取研究------以国航、南航和东航为例

马晓君,刘亚雪,魏晓雪,刘艳,于渊博

系统科学与数学 ›› 2017, Vol. 37 ›› Issue (4) : 1072-1091.

PDF(1062 KB)
PDF(1062 KB)
系统科学与数学 ›› 2017, Vol. 37 ›› Issue (4) : 1072-1091. DOI: 10.12341/jssms13142
论文

航空公司微博评论的意见信息抽取研究------以国航、南航和东航为例

    马晓君1,刘亚雪1,魏晓雪1,刘艳1,于渊博2
作者信息 +

Research on Information Extraction of Airline Microblog Reviews--- Taking Air China, China Southern Airlines and China Eastern Airlines as Example

    MA Xiaojun1 ,LIU Yaxue1, WEI Xiaoxue1, LIU Yan1 ,YU Yuanbo2
Author information +
文章历史 +

摘要

如今越来越多的乘客选择乘坐舒适快捷的飞机出行,中国航空运输需求因此逐年增长,航空公司在获得更多盈利空间的同时也面临激烈的竞争.对航空公司的用户评论进行意见信息抽取,不仅可用于航空公司改进服务质量和用户体验, 还可为用户选择满意的航空公司提供参考.文章首次以新浪微博平台上航空公司的用户评论为基础数据,利用条件随机场进行意见信息抽取.在有关研究中,专家学者大多凭借以往知识的了解对特征对象和特征词进行人工标注,鲜少分析用户在本评论语料中的关注点.因此,文章创新性地在人工标注前首先利用TF-IDF 算法进行关键词提取,找到本评论语料中用户的关注点,最后以超过93\% 的F平均值证明模型的有效性,为后续的研究提供了新方向.

Abstract

Nowadays, the requirements of air transportation increased year after year, whether the business man or the common people are more and more willing to choose the comfortable and speedy plane as their transportation. The airline get the more profit, as well, the competition between the different airlines is intense. Not only does it can improve quality of service and user experience, but also provide more reference for passengers by using the comment of airline based on information extraction. In this paper, we use the conditional random field to extract information of comments based on the comments of airlines from Sina Weibo. In the relevant study, most experts and researchers utilize manually annotated to deal with feature object and feature words on the basis of past knowledge. Nobody has made a careful analysis of users' concerns of comments corpus. Therefore, we creatively use the TF-IDF algorithm to extract keywords before manually annotated and find the users' concerns in comment corpus. Finally, it proves the validity of the model by F-value which is more than 93\%, and it provides a new direction for the follow-up study.

引用本文

导出引用
马晓君 , 刘亚雪 , 魏晓雪 , 刘艳 , 于渊博. 航空公司微博评论的意见信息抽取研究------以国航、南航和东航为例. 系统科学与数学, 2017, 37(4): 1072-1091. https://doi.org/10.12341/jssms13142
MA Xiaojun , LIU Yaxue , WEI Xiaoxue , LIU Yan , YU Yuanbo. Research on Information Extraction of Airline Microblog Reviews--- Taking Air China, China Southern Airlines and China Eastern Airlines as Example. Journal of Systems Science and Mathematical Sciences, 2017, 37(4): 1072-1091 https://doi.org/10.12341/jssms13142
PDF(1062 KB)

414

Accesses

0

Citation

Detail

段落导航
相关文章

/