1. Awards
- CALS Outstanding Graduate Teaching Assistant in the Department of Statistics and Data Science, 2021-2022
- CALS Outstanding Graduate Teaching Assistant in the Department of Statistics and Data Science, 2018-2019
2. Teaching Experience
2.1. Visiting Assistant Professor
2.1.1. (upcoming) Data Science For All STSCI/CS/INFO/ORIE 1380, Spring 2024
This course provides an introduction to data science using the statistical programming language R. We focus on building skills in inferential thinking and computational thinking, guided by the practical questions we seek to answer from data sets arising in medicine, economics and other social sciences. The course starts with essential R programming principles, and how to use R for data manipulation, visualization, and sampling. These techniques are then used to summarize and visualize real data sets, draw meaningful conclusions from those data, and assess the uncertainty surrounding those conclusions. Throughout the process, students will learn to develop hypotheses about their data, and use simulations and statistical techniques to test these hypotheses. The course also covers how to use the Tidyverse open-source R packages to clean and organize complex data sets, and create high quality graphics for data visualization.
2.1.2. Statistical Methods I BTRY 6010, Fall 2023
Develops and uses statistical methods to analyze data arising from a wide variety of applications. Topics include descriptive statistics, point and interval estimation, hypothesis testing, inference for a single population, comparisons between two populations, one- and two-way analysis of variance, comparisons among population means, analysis of categorical data, and correlation and regression analysis. Introduces interactive computing through statistical software. Emphasizes basic principles and criteria for selection of statistical techniques.
2.2. Graduate Teaching Assistant
At Cornell University, I have been a teaching assistant with responsibilities of leading discussion/lab sections, holding office hours, and grading work, in the following classes.
2.2.1. Data Science For All STSCI/CS/INFO/ORIE 1380, Spring 2023
This course provides an introduction to data science using the statistical programming language R. We focus on building skills in inferential thinking and computational thinking, guided by the practical questions we seek to answer from data sets arising in medicine, economics and other social sciences. The course starts with essential R programming principles, and how to use R for data manipulation, visualization, and sampling. These techniques are then used to summarize and visualize real data sets, draw meaningful conclusions from those data, and assess the uncertainty surrounding those conclusions. Throughout the process, students will learn to develop hypotheses about their data, and use simulations and statistical techniques to test these hypotheses. The course also covers how to use the Tidyverse open-source R packages to clean and organize complex data sets, and create high quality graphics for data visualization.
2.2.2. Linear Models with Matrices STSCI 4030/5030, Fall 2022
The focus of this course is the theory and application of the general linear model expressed in its matrix form. Topics will include: least squares estimation, multiple linear regression, coding for categorical predictors, residual diagnostics, anova decomposition, polynomial regression, model selection techniques, random effects and mixed models, maximum likelihood estimation and distributional theory assuming normal errors. Homework assignments will involve computation using the R statistical package.
2.2.3. Theory of Statistics STSCI 4090/5090, Spring 2022
Introduction to classical theory of parametric statistical inference that builds on the material covered in BTRY 3080. Topics include sampling distributions, principles of data reduction, likelihood, parameter estimation, hypothesis testing, interval estimation, and basic asymptotic theory.
2.2.4. Statistical Methods I BTRY 6010, Fall 2021, Fall 2020, Fall 2019, Fall 2018
Develops and uses statistical methods to analyze data arising from a wide variety of applications. Topics include descriptive statistics, point and interval estimation, hypothesis testing, inference for a single population, comparisons between two populations, one- and two-way analysis of variance, comparisons among population means, analysis of categorical data, and correlation and regression analysis. Introduces interactive computing through statistical software. Emphasizes basic principles and criteria for selection of statistical techniques.
2.2.5. Bayesian Data Analysis BTRY 4780, Spring 2020
Bayesian data analysis uses probability theory as a kind of calculus of inference, specifying how to quantify and propagate uncertainty in data-based chains of reasoning. Students will learn the fundamental principles of Bayesian data analysis, and how to apply them to varied data analysis problems across science and engineering. Topics include: basic probability theory, Bayes's theorem, linear and nonlinear models, hierarchical and graphical models, basic decision theory, and experimental design. There will be a strong computational component, using a high-level language such as R or Python, and a probabilistic language such as BUGS or Stan.
2.2.6. Statistical Methods II BTRY 6020, Spring 2019
Continuation of BTRY 6010. Emphasizes the use of multiple regression analysis, analysis of variance, and related techniques to analyze data in a variety of situations. Topics include an introduction to data collection techniques; least squares estimation; multiple regression; model selection techniques; detection of influential points, goodness-of-fit criteria; principles of experimental design; analysis of variance for a number of designs, including multi-way factorial, nested, and split plot designs; comparing two or more regression lines; and analysis of covariance. Emphasizes appropriate design of studies before data collection, and the appropriate application and interpretation of statistical techniques. Practical applications are implemented using a modern, widely available statistical package.