BEGIN:VCALENDAR VERSION:2.0 PRODID:-//Drupal iCal API//EN X-WR-CALNAME:Events items teaser X-WR-TIMEZONE:America/Toronto BEGIN:VTIMEZONE TZID:America/Toronto X-LIC-LOCATION:America/Toronto BEGIN:DAYLIGHT TZNAME:EDT TZOFFSETFROM:-0500 TZOFFSETTO:-0400 DTSTART:20250309T070000 END:DAYLIGHT BEGIN:STANDARD TZNAME:EST TZOFFSETFROM:-0400 TZOFFSETTO:-0500 DTSTART:20241103T060000 END:STANDARD END:VTIMEZONE BEGIN:VEVENT UID:686a3c1ad4845 DTSTART;TZID=America/Toronto:20250404T153000 SEQUENCE:0 TRANSP:TRANSPARENT DTEND;TZID=America/Toronto:20250404T163000 URL:/combinatorics-and-optimization/events/tutte-colloq uium-aukosh-jagannath SUMMARY:Tutte colloquium-Aukosh Jagannath CLASS:PUBLIC DESCRIPTION:Summary \n\nTITLE:: The training dynamics and local geometry of high-dimensional\nlearning\n\nSPEAKER:\n Aukosh Jagannath\n\nAFFILIATION: \n University of À¶Ý®ÊÓÆµ\n\nLOCATION:\n MC 5501\n\nABSTRACT:Many modern d ata science tasks can be expressed as optimizing\na complex\, random funct ions in high dimensions. The go-to methods for\nsuch problems are variatio ns of stochastic gradient descent (SGD)\,\nwhich perform remarkably well —c.f. the success of modern neural\nnetworks. However\, the rigorous ana lysis of SGD on natural\,\nhigh-dimensional statistical models is in its i nfancy. In this talk\,\nwe study a general model that captures a broad ran ge of learning\ntasks\, from Matrix and Tensor PCA to training two-layer n eural\nnetworks to classify mixture models. We show the evolution of natur al\nsummary statistics along training converge\, in the high-dimensional\n limit\, to a closed\, finite-dimensional dynamical system called their\nef fective dynamics. We then turn to understanding the landscape of\ntraining from the point-of-view of the algorithm. We show that in this\nlimit\, th e spectrum of the Hessian and Information matrices admit an\neffective spe ctral theory: the limiting empirical spectral measure and\noutliers have e xplicit characterizations that depend only on these\nsummary statistics. I will then illustrate how these techniques can be\nused to give rigorous d emonstrations of phenomena observed in the\nmachine learning literature su ch as the lottery ticket hypothesis and\nthe \"spectral alignment\" phenom enona. This talk surveys a series of\njoint works with G. Ben Arous (NYU)\ , R. Gheissari (Northwestern)\, and\nJ. Huang (U Penn).\n\nThis talk is ba sed on joint work with Saeed Ghadimi and Henry\nWolkowicz from University of À¶Ý®ÊÓÆµ and Diego Cifuentes and Renato\nMonteiro from Georgia Tech.\n\ n \n\n \n DTSTAMP:20250706T090426Z END:VEVENT END:VCALENDAR