报告人:罗涛副教授,上海交通大学
报告时间:2024年10月27日,下午16:00—17:00
报告地点:数学楼2-2会议室
报告题目:The Theory of Parameter Condensation in Neural Networks
报告摘要:In this talk, we will first introduce the phenomenon of parameter condensation in neural networks, which refers to the tendency of certain parameters to converge towards the same values during training. Then, for certain types of networks, we prove that condensation occurs in the early stages of training. We further analyze which hyperparameters and training strategies influence parameter condensation. In some cases, we even provide a phase diagram that delineates whether parameter condensation occurs. We will also briefly discuss the relationship between parameter condensation and generalization ability. Finally, towards the end of the training, we study the set of global minima and present a detailed analysis of its geometric structure and convergence properties.
个人简介:
罗涛于2012年在上海交大首届理科班(今致远学院)获得学士学位。2017年在香港科技大学获得博士学位,导师是项阳教授,博士期间主要做晶体位错和外延生长的建模与分析,获得香港数学学会最佳博士论文奖。2017-2020年在普渡大学数学系担任Golomb访问助理教授。2020年至今,任上海交通大学数学科学学院与自然科学研究院长聘教轨副教授,研究领域为机器学习和材料科学的数学理论。
邀请人:王飞教授