Tutorial

Thursday, Aug. 24, 2023

Time: 2:00 p.m. — 6:00 p.m.

Location: 华东师范大学普陀校区思羣堂

Host: TBA

Zhihua Zhang

Peking University

Title: 构建人工智能的基座模型：技术、挑战和未来

Abstract: 自从OpenAI发布了ChatGPT，大语言模型(LLM)引起了社会各界广发关注和遐想，同时也衍生了各种大模型的应用场景开发热潮。大语言模型的构建是一个复杂而又精细的巨系统，它不仅牵涉到数据质量、算力分配，而且同样取决于工程技艺、算法实现细节等。这个报告主要讨论构建大模型的一些技术问题，比如, 大模型基本组件，数据清洗，分词(Tokenization), 对齐(Alignment) 等。同时从Scaling Law和Compression角度来讨论理解大模型的机理。最后报告也试图分享个体或学术届在大模型研发的机会和作为，以及未来通用人工智能的潜在方向。

Yuling Jiao

Wuhan University

Title: Theoretical Study on Deep Learning: Approximation, Generalization, Optimization, Representation and Generation

Abstract: In the first part of this talk, I will discuss some theoretical studies on deep learning with a focus on approximation, generalization, optimization, and representation. In particular, I will cover error analysis with over-parameterization. In the second part, I will delve into sampling and generative learning via and SDE and ODE.