Week 01: Tools for Computing and Reporting
-
1. set up computing tools R, RStudio, and MikTex.
2. use some R commands and functions.
3. create an R Markdown document.
4. install and load R packages.
- Introduction: class structure, topics, assessments, and logistics.
- Install R [free download] , RStudio[free download] , and MiTex[download]
- Getting started with R: basic operations, vectors, data frames (R data sets).
- Install and load R libraries
- Instruction for Installing R and RStudio and load packages (Appendices A, B, and C.)
- Quick Tour of R Markdown
- Getting started with R, RStudio, RMarkdown, RPubs, and GitHub. [HTML] [RMD] [PDF]
- RMarkdown template of statistical report . [HTML] [RMD] [PDF]
- Setting-up GitHib [HTML] [RMD] [TXT]
- Writing Mathematical Equations in RMarkdown [HTML] [RMD] [PDF]
- Reading Assignments
- Practice R code in Chapters 1-2.
- Introduction to R Markdown
- A (very) short introduction to R
- Create an R Markdown Document and convert to HTML, Word and PDF, respectively
- No written assignment due for this week!
Week 02: Nonparametric Bootstrap Method
-
1. justify when to use sampling distribution and bootstrap sampling distribution.
2. construct bootstrap confidence interval of population parameters such as means, standard deviation, etc.
3. write basic R code to perform bootstrap analysis and make basic plots.
- Review of simple random sampling (SRS)
- Sampling from empirical distribution: Bootstrap sampling
- Construction of Bootstrap confidence intervals for the population mean
- R Applications
- Class note: [HTML] , [PDF] , [RMD]
- Data Sets Used in Class Note : (1). WCU Heights [txt, Simulated Data] (2). Iris Data [txt]
- Lab note: Basics of R Graphics [HTML] , [PDF] , [RMD]
- Reading Assignments
- Chapter 4 of Roff's eBook (pages 66-70 and 74-76.)
- Class Note: Understand the concept and practice examples
- Written Assignments
- Week #2 Assignment: Application of Bootstrap Method [Link].
- Wee #2 Assignment Due: Sunday, 11:30 PM
Week 03: Simple Linear Regression Models
-
1. perform residual analysis of the SLR.
2. Interpret the regression coefficient and goodness of fit measures.
3. write R basic R code to perform bootstrap regression analysis.
- Linear regression model – structure and assumptions.
- Diagnostics, R square, and interpretations
- Bootstrap Regression
- Course Materials
- Reading Assignments
- Chapter 2 of Ciaburro's eBook
- Chapter 4 of Sahay's Book (using Excel and MINITAB but with good interpretations)
- Week #3 Assignment : [HTML] [RMD]
- Due: 11:30 PM, Sunday.
- D2L Drop Box(link)
Week 04: Multiple Linear Regression Models
-
1. use factor() to handle categorical variable.
2. perform variable selection: step-wise method
3. perform residual analysis of the SLR.
4. write a mini report to summarize the output of MLR.
- Dummy variables
- Assumptions, goodness-of-fit, and diagnostics
- Variable selection methods
- Summarizing and interpretations
- Notes and R Applications:
- Reading Assignments
- Chapter 3 of Ciaburro's eBook
- Chapter 5 of Sahay's Book: multiple linear regression)
- Blog post: with R code
- Week #4 Assignment : [HTML] [RMD] [PDF]
- Due: 11:30 PM, Sunday.
- D2L Drop Box (link)
Week 05: Bootstrapping Regression Models
-
1. take bootstrap records
2. use bootstrap residuals to define bootstrap responses
3. justify the validity of inferences based on bootstrap and the normal regression modeling
4. write a formal report.
- Bootstrapping records
- Bootstrapping residuals
- Bootstrap confidence intervals of regression coefficients
- R Applications: Bootstrap MLR with R
- Reading Assignments
- Chapter 3 of Ciaburro's eBook
- Chapter 5 of Sahay's Book: multiple linear regression)
- Bootstrap Rehression: with R code
- Report of Project #1 : [HTML] [RMD] [PDF]
- Due: 11:30 PM, Sunday.
- D2L Drop Box(link)
Week 06: Simple Logistic Regression Models
-
1. correct identify the response variable for a logistic model
2. fit a logistic model with R
3. interpret of regression coefficients
4. make prediction with the logistic regression model.
- Practical question and model formulation
- Model structure - log odds regression
- Interpretation of the regression coefficients
- R Applications: Case study – logistic regression with R
- Class note: Simple Logistic Regression [HTML] [RMD] [PDF]
- Data Sets Used in Class Note: [Pima Indians Diabetes2]
- Read sections 2.5 and 2.6 of the required textbook.
- Week #6 Assignment : [HTML] [RMD] [PDF]
- Due on Sunday : 11:30 PM
- D2L Drop Box (login)
Week 07: Multiple Logistic Regression Models
-
1. inspect and transform predictor variables
2. perform automatical vaiable selection in loditic regression with R
3. interpret of odds ratios
4. interpret prediction and validation performance.
- Models with only categorical predictor variables–Dummy variable
- Variable selection methods and criteria
- Interpretation of coefficients
- R Applications: Case study – Multiple logistic regression with R
- Notes
- Chapter 4 of Ciaburro's eBook: Pages 134 - 154. Very nice interpretation for the odds ratios
- Week #7 Assignment
[HTML]
[RMD]
[PDF]
- Due: 11:30 PM, Sunday.
- D2L Drop Box (link)
Week 08: Logistic Regression: Predictive Modeling
-
1. calculate the sensitivity and specificity
2. calculate predictive error from the confusion matrix
3. perform k-fold cross-validation for model selection
4. plot ROC curve.
- Estimating ture positive and false positive rates
- Confusion matrix and predictive error
- Cross-validation and ROC curve
- Notes
- Reading Assignments
- Chapter 4 of Ciaburro's eBook: Pages 134 - 154. Very nice interpretation for the odds ratios
- This blog discussed most of the topics we covered this week
- Variable Selection in Logistic Regression with R code.
- Week #8 Assignment : [HTML] [RMD] [PDF]
- Due: 11:30 PM, Sunday.
- D2L Drop Box (link)
Week 09: Poisson Regression Models
-
1. perform the regression analysis on counts
2. perform the regression analysis on rates
3. Interpret Poisson regression coefficients
- structure of Poisson model
- regression on counts and rates
- model-based visualization
- Notes
- Reading Assignments
- This blog discussed Poisson regression with R codes
- A nice article with more than what we covered this week.
- Week #9 Assignment: [HTML] [RMD] [PDF]
- Due: Sunday.
- D2L Drop Box (link)
Week 10: Dispersed Poisson Regression Models
-
1. calculate dispersion parameter
2. build Quasi-Poisson regression models on counts and rates
3. interpret quasi-Poisson models
- Dispersion parameter
- Dispersed Poisson regression on counts and rates
- Modeling diagnostics
- Notes
- SQL with SAS: I encourage you to think about using the free SAS Studio via SAS® OnDemand for Academics [link]. You need to create a profile and use it to run your SAS code in the SAS cloud.
- Basics of Relational Databases [PPT]
- Basics of SQL[SAS code in txt format].
Three relational data tables used in the code are: [plots.csv], [species.csv], [surveys.csv]
- Reading Assignments
- This blog discussed Poisson regression with R codes
- A nice article with more than what we covered this week.
- Week #10 Assignment : [HTML] [RMD] [PDF]
- Due:Sunday.
- D2L Drop Box (link)
Week 11: Concepts of Time Series
-
1. identify the types of times series.
2. create time series objects.
3. build the baseline forecast models.
4. calculate the acuracy measures.
- Technical terms of TS
- Type of TS data
- Time series objects and the baseline forecasting methods
- Calculation of accuracy measures.
- Class Note:
- Reading Assignments
- scan the first 2 chapters for the concepts used in the class note
- An excellent time series textbook: chapters 2 and 3.
- Week #11 Assignment : [HTML] [RMD] [PDF]
- Due: Sunday.
- D2L Drop Box(link)
Week 12: Moving Average and LOESS Smoothing
-
1. decompose time series into trend, seasonal, and random commponents.
2. use STL frame work to decompose TS.
3. forecast with decomposition.
- Components time series
- Classical decomposition of TS
- STL decomposition of TS
- Forecast with decomposition.
- Class Note
- Reading Assignments
- Week #12 Assignment : [HTML] [RMD] [PDF]
- Due: Sunday.
- D2L Drop Box(link)
Week 13: Exponential Smoothing Models
-
1. recognize and identify ETS models.
2. fit SES, trend and seasonal models to appropriate data.
3. calculate prediction errors.
- ETS framework
- Simple exponential smoothing model
- Holt's trend models
- Holt-Winter's trend and seasonal smoothing models.
- R Applications: Case study
ASSIGNMENTS:
- Reading Assignments
- Week #13 HW : [HTML] [RMD] [PDF]
- Due: 11:30 PM, Wednesday.
- D2L Drop Box(link)
Week 14: Finalizing Project Report and Prepare Oral Presentation
- Specifics
- Prepare a Powerpoint presentation and submit it after the oral presentation.
- Keep your presentation within 15 minutes including Q&A.
- Project Presentation Rubrics [link]
- Tentative Schedule (alphabetical order in the first name)