Fall 2024
1. Platforms and Tools for Data Science
2. Technologies for Communication
3. Data Acquisition and Integration
4. Use of EDA and Data Visualization
5. Methods for Feature Engineering
6. Statistical Models for Data Science
7. Methods of Cross-validation
8. From Statistics to Machine Learning
9. Overview of Supervised ML Algorithms
10. Ensemble Supervised ML Algorithms
11. Common Unsupervised Algorithms
12. Algorithms for Anomaly Detection
13. Model-based Feature Extraction
14. Presentation Preparation
Module 01
Toggle Menu
Syllabus
DS Tools
RamCloud (SAS)
SAS Studio
Tableau Public
RMarkdown
Github
RPubs
Data
Zoom
Office Hours
Monday
3:00PM-5:00PM
Tuesday
11:00AM-12:00PM
Thursday
2:00PM-5:00PM
D2L
Email Me
1. Platforms and Technical Tools for Data Science
Topics, Class Notes and Code
Assignments and Side Readings
Course logistics, policies and project guideline document:
[PDF]
Software and Platforms
Computing:
R/Rstudio and SAS Studio
Viz: R graphic libraries & Tableau Public
Repository and collaboration: GitHub
Free web server: RPubs
Tech writing and reporting: Markdown and LaTex:
[Intro to RMarkdown]
Data Science - A Big Picture
Class Note 1
: What is data science?
[PPT]
Supplemental Note
:
[HTML]
[PDF]
[RMD]
Data science is still in high demand:
the U.S. Bureau of Labor Statistics.
Data Science Ethics:
[ASA: Ethical Guidelines for Statistical Practice]
Installation
R and RStudio
If you have not used R for 6 months, please install the newer version.
Tableau Public
MikTex
Accounts Creation
SAS Studio [via SAS Cloud]
Tableau Public
GitHub
RPubs
More on Markdown
Basic Markdown Syntax
Math Equation in Markdown
[HTML]
[PDF]
No written assignment for this week.