A Latent Space Model for Weighted Keyword Co-occurrence Networks with Applications in Knowledge Discovery in Statistics
报告人简介
潘蕊,中央财经大学统计与数学学院教授、博士生导师,中央财经大学龙马学者青年学者。主要研究领域为网络结构数据的统计建模、时空数据的统计分析等。在Annals of Statistics、Journal of the American Statistical Association、Journal of Business & Economic Statistics等期刊发表论文30余篇。著有中文专著《数据思维实践》《网络结构数据分析与应用》。主持国家自然科学基金项目、全国统计科学研究项目等。具有丰富的统计案例创作经验。曾获得中央财经大学青年教师教学基本功比赛二等奖,首届中国高校财经慕课联盟“同课异构”课程思政教学竞赛一等奖。
内容简介
Keywords are widely recognized as pivotal in conveying the central idea of academic articles. In this article, we construct a weighted and dynamic keyword co-occurrence network and propose a latent space model for analyzing it. Our model has two special characteristics. First, it is applicable to weighted networks; however, most previous models were primarily designed for unweighted networks. Simply replacing the frequency of keyword co-occurrence with binary values would result in a significant loss of information. Second, our model can handle the situation where network nodes evolve over time, and assess the effect of new nodes on network connectivity. We utilize the projected gradient descent algorithm to estimate the latent positions and establish the theoretical properties of the estimators. In the real data application, we study the keyword co-occurrence network within the field of statistics. We identify popular keywords over the whole period as well as within each time period. For keyword pairs, our model provides a new way to assess the association between them. Finally, we observe that the interest of statisticians in emerging research areas has gradually grown in recent years. Supplementary materials for this article are available online.