清华主页 EN
导航菜单

Methods and Theory on Model Selection and Model Averaging

来源: 08-30

时间:9:50-12:15,Sept.15,Sept.22,Sept.29,Oct.9,2022

地点:近春园西楼第一会议室Conference Room 1,Jin Chun Yuan West Bldg.;Zoom Meeting ID: 271 534 5558 Passcode: YMSC

主讲人:Prof.Yuhong Yang(University of Minnesota)

Description

Model selection and its diagnosis are foundational elements in modern statistical and machine learning applications that serve the purpose of obtaining reliable information and reproducible results. In this short course, we introduce the principles and theories on model selection and model averaging and their applications in high-dimensional regression. Model selection methods include information criteria (AIC, BIC etc), cross validation, penalized regression (LASSO, SCAD, MCP) and more. We will learn to understand their differences, connections, performances, limitations, proper uses, and approaches to achieving the best performance without knowing which method is the best for the data at hand. In addition, we will study new tools to characterize model selection reliability. When model selection uncertainty is high, model averaging/combining typically offers more accurate prediction and more reliable conclusions. Theoretical results covered include model selection consistency, consistent cross validation, adaptive minimax optimal regression learning in high-dimensional regression, and optimalities of model averaging methods.

返回顶部
相关文章
  • Model averaging for time-varying vector autoregressions

    Statistical SeminarOrganizer:Yunan Wu 吴宇楠 (YMSC)Speaker:Yuying Sun 孙玉莹中国科学院数学与系统科学研究院副研究员Time:Fri., 16:00-17:00, Nov. 22, 2024Venue:C654, Shuangqing Complex Building ATitle:Model averaging for time-varying vector autoregressionsAbstract:This paper proposes a novel time-varying model averaging (TVMA) approach to enhancing forecast accuracy for multivariate time ser...

  • Model Selection for Optimal Regression Learning

    In statistical learning, various mathematical optimalities are used to characterize performances of different learning methods. They include minimax optimality from a worst-case standpoint and asymptotic efficiency from a rosy view that the regression function to be learned sits there to be discovered. When multiple models, e.g., trees, neural networks and support vector machines, are considere...