Biostatistics (BioE 439)
This course is designed for bioinformatics students. The industry standard in statistical software, which will be taught in this course, is R and S++. The application of basic algorithms and the theory behind the statistical analysis will be covered. Extensive examples and small projects will be used in order to learn how to use R and Java to accomplish bioinformatics tasks. Topics covered also include: sample analysis, interval-censored survival data analysis, longitudinal data analysis, multivariate analysis, theory of distributions in statistics, and experiment and design.
Topics covered:
- descriptive statistics
- hypothesis testing
- estimation
- confidence intervals
- t-tests
- chi-squared tests
- analysis of variance
- linear regression and correlation
Syllabus:
- Lecture 1: Introduction to Biostatistics
- Lecture 2: Descriptive Statistics
- Lecture 3: Basic Probability Concepts (1)
- Lecture 4: Basic Probability Concepts (2)
- Lecture 5: Probability distribution (1)
- Lecture 6: Probability distribution (2)
- Lecture 7: Some important sampling distributions (1)
- Lecture 8: Some important sampling distributions (2)
- Lecture 9: Estimation (1)
- Lecture 10: Estimation (2)
- Lecture 11: Hypothesis testing (1)
- Lecture 12: Hypothesis tests (2)
- Lecture 13: Hypothesis tests (3)
- Lecture 14: Analysis of variance (1)
- Lecture 15: Analysis of variance (2)
- Lecture 16: Analysis of variance (3)
- Lecture 17: Simple linear regression and correlation (1)
- Lecture 18: Simple linear regression and correlation (2)
- Lecture 19: Multiple regression and correlation (1)
- Lecture 20: Multiple linear regression and correlation (2)
- Lecture 21: Chi-square distribution and the analysis of frequencies (1)
- Lecture 22: Chi-square distribution and the analysis of frequencies (2)
- Lecture 23: Chi-square distribution and the analysis of frequencies (3)
We will normally post 2 lectures per week (on Mondays).
Prerequisites:
- Math 210 (Calculus III)
- Programming knowledge of Java (preferred), Perl or C++
Textbook:
W.Q. Daniel, Biostatistics – A foundation for analysis in the health sciences, Eighth edition, Wiley, ISBN 0-471-45654-3
Grading:
- Homework
- worth 100 points will be assigned each week.
- The homework will be posted on Wednesday and will be due the following Wednesday.
- For any week, we may post homework early. This will not affect the due date of the homework.
- Late homework will be accepted until the first Friday following the due date with a penalty of 20 points per day late. Homework will not be accepted after Friday.
- Comprehensive project (due on the same day as the final exam)
- Midterm and final exams
