A111
Course Coordinator: 
Gerald Pao
Nonlinear Time Series Analysis and Manifold Learning
Description: 

Over the last 50 years or so nonlinear dynamics and chaos has revolutionized our understanding of the complexity of the natural world. However, the vast majority of this understanding, has come from the study of toy models that captured the flavor of real problems but were not models of real systems. As such dynamical systems theory has become more an area of mathematics than natural experimental science. Much of this is because experiments were difficult to perform and we lived in a data limited world. Now with the advent of big data and increases in computing performance in the 3rd decade of the 21st century we are no longer data-limited and can take full advantage of the theory of dynamical systems for the analysis of observed data. This data science course is focused on the analysis of dynamical systems, mostly from nonlinear time series, to find hidden properties and causal relationships in complex systems. The course teaches the students what can practically be done using the intellectual framework of chaos theory and dynamical systems using a data driven approach to maximally extract information from time series and their mathematical properties hidden in the geometry of their embeddings.

Aim: 
Course Content: 

week Topic Module exercises
1 Introduction to time series & prediction Autocorrelation; Prediction horizons; Linear models; Correlation/synchrony Exercises: Python/R Plotting of time series and their basic statistical properties
2 Takens theorem & Embeddings Simplex algorithm Exercises: Embedding dimension and time constant search
3 Non-linearity & noise S-MAP algorithm Exercises: S-MAP exercises
4 Denoising time series Hilbert-Huang transform, Fourier transform, Kalman filter Exercises: Hilbert-Huang transform, Fourier transform, Kalman filter
5 Causal inference Convergent crossmapping (CCM) Exercises: Convergent crossmapping
6 Scaling up and multivariate models CCM and multivariate models Exercises: BLOCK_LNLP & multivariate simplex
7 Dimensionality reduction PCA, UMAP, t-SNE, MDS, Isomap, PHATE, CC
8 Early warning signs of critical transitions and anomaly detection in streaming data
9 Student presentations
From the student presentation we will select the most promising dataset to analyze and work on a group project that should produce publishable data that will be jointly authored by the entire class.
10 - 12 Group project

Course Type: 
Elective
Credits: 
2
Assessment: 

Student will present a proposed data set and perform the analyses and present in class

Text Book: 

Analysis of Observed Chaotic Data, Henry DI Abarbanel; Springer
Nonlinear Time Series Analysis, Holger Kantz & Thomas Schreiber; Cambridge

Reference Book: 
Prior Knowledge: 
Python and/or R programming, linear algebra, B49 Dynamical systems
Notes: 

Alternate years course: AY2023