# Instructor

Professor Nathan Taback

Office: SS6027C

# Class Time

Classroom sessions: T 14:00-16:00, R 10:00-11:00

# Course Content

This course will provide an introduction to the fundamental concepts of the design of scientific studies including the design of experiments and observational studies. Students will be become acquainted with statistical methods used to design and analyze experiments and observational studies. In particular, this course will cover: experiments versus observational studies, clinical trial design, comparing several groups using a completely randomized design, randomized blocks, Latin squares, incomplete block designs, factorial designs, causal inference in randomized and non-randomized studies, and adjusting for selection bias using propensity score methods.

The learning objectives of this course are:

• Understand the ideas, principles, and considerations that are common to the design and analysis of scientific studies including the statistical design of experiments and observational studies.
• Develop a statistical toolbox of methods for the design and analysis of experiments and observational studies.
• Identify appropriate uses and interpretations of experimental designs, and observational studies, including their strengths and limitations.

## Topics

### Experiments, observational studies, and causal inference

Experiments versus observational studies, and causal inference in randomized experiments.

### Selection Bias in Observational Studies

Causal inference in randomized experiments versus observational studies. Introduction to the propensity score and three ways to use the propensity score to adjust for selection bias: matching; sub classification; direct regression adjustment.

### Probability and Statistics

Mathematical statistics used in experimental design.

### Comparing Several Groups

Comparing several groups in an experimental and observational setting and deciding whether differences that are found are likely to be real or due to chance.

### Power and Sample Size

Power and sample size will be introduced for several designs. Applications will include the design and analysis of clinical trials with continuous or binary endpoints.

### Blocking Techniques

Blocked designs, Latin squares, randomized incomplete block designs.

### Factorial Designs

Factorial, blocked factorial, and fractional factorial designs will be discussed.

### Split Plot Designs

Split plot designs will be discussed as an example of restricted randomization in the design of experiments.

# Course Books

## Optional

1. Statistics for Experimenters: Design, Innovation, and Discovery. Box, G.E.P., Hunter, J.S., Hunter, W.G. Wiley 2nd Ed. 2005

2. Design and Analysis of Experiments. Dean, A., and Voss, D. Springer. 1999. UofT link to electronic copy: http://go.utlib.ca/cat/2573215

3. Design of Observational Studies. Rosenbaum, P. R. Springer 2010. UofT link to electronic copy: http://go.utlib.ca/cat/7890274

4. Experiments: planning, analysis, and optimization. Wu, C.F.J., Hamada, M.S. Wiley, 2009, 2nd ed.

5. Causal inference for statistics, social, and biomedical sciences. Imbens and Rubin. Cambridge University Press, 2015. http://go.utlib.ca/cat/10127748

NB: Textbooks 2,3, 5 are available electronically through the UofT library (i.e., electronic copies of both these textbooks are available at no extra cost)

# Course Materials, including lecture notes

# Evaluation

Students will be evaluated according to the University Assessment and Grading Practices Policy.

Undergraduate students will be evaluated according to the following marking scheme.

Weight Date Time
Term Test #1 20% Oct. 8 14:00
Term Test #2 20% Nov. 19 14:00
Draft/proposal project 5% TBD TBD
Final project 10% Dec. 5 23:59
Final Exam 45% Scheduled by Faculty

The tests will be written during class time (14:10 – 15:40) in a location to be announced.

You are allowed a two-sided 8-1/2“x 11” (standard letter size) hand-written aid sheet on the term test and a two-sided hand-written aid sheet on the final exam. You must bring your student identification to the term tests and the final exam.

You will not need to know R syntax on the tests and exam, but you will need to know how to interpret output from R.

# Class Schedule

## Marking concerns

Any requests to have marked work re-evaluated must be made in writing to the instructor within one week of the date the work was returned. The request must contain a justification for consideration.

## Missed Tests

• If a test is missed for a valid reason, you must submit documentation to the course instructor.

• If a test is missed for a valid medical reason, you must submit the University of Toronto Verification of Student Illness or Injury form to your instructor within one week of the test.

• The form will only be accepted as valid if the form is filled out according to the instructions on the form.

• The form must indicate that the degree of incapacitation on academic functioning is moderate, serious, or severe in order to be considered a valid medical reason for missing the term test. If the form indicates that the degree of incapacitation on academic functioning is negligible or mild then this will NOT be considered a valid medical reason.

• If a test is missed for a valid reason then half the weight of the test will be shifted to the other midterm and half will be shifted to the final exam. In this case the other term test will be worth 30% and the final exam will be worth 55%.

• If a student misses BOTH term tests for any reason then an oral exam with members of the teaching team will be scheduled at a mutually convenient time in lieu of the two term tests worth 40%.

• Students must complete at least one midterm test or oral exam. If a student misses both midterm tests and does not take an oral exam before the end of term then a grade of zero will be assigned to the term work.

• Other reasons for missing a test will require prior approval by your instructor. If prior approval is not received for a non-medical reason, then you will receive a term test grade of zero.

## Late Project Submission

If the draft/propsal project or final project is submitted after the due date then a late penalty of 20% per day (i.e., 24 hours) will be applied to the part of the project handed in late. For example, if the draft/proposal project is submitted after 5 days (including weekend days) then you will receive a grade of zero for the draft/proposal.

# Computing

We will use R for all examples. R is freely available for download at http://cran.r-project.org for Windows, Mac, and Linux operating systems. For the tests and exam, you will need to know how to interpret output from R.

## Jupyter Notebook

The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. (Ref: https://jupyter.org)

R can be run in a Jupyter notebook in any web brwoser by logging into https://utoronto.syzygy.ca with your UTORid.

To get started using R in a Jupyter notebook see this page

## RStudio

RStudio is a fantastic integrated development environment (IDE) for R. It is freely available at https://www.rstudio.com/products/rstudio/

I am assuming that students have never used R before. I will provide you with the R syntax for all examples in lecture, which should be sufficient for you to complete the practice problems.

