Lifetime data (survival data) are commonly encountered in
epidemiology, biostatistics, biomedicical and clinical studyies. Different from other type of data, survival times usualy have skewed distribution and include censoring observations. For example, in a cancer study, a patient may leave the country and stop visiting clinical center before responding to treatment. Statistical softwares SAS and R will be used to handle problems when hand calculation is not feasible. Techeniques developped in survival analysis are also used in reliability research of business and engineering. This class is useful for Actuarial Exams and important for biostatistics.
Instructor: Jimin Ding;
Office: Cupples I, Room 112A;
Email: jmding@math.wustl.edu
TA/Grader: Chunlin Fan email: cfan@artsci.wustl.edu
Office Hours: Mon. 4:00-5:30pm or by appointment
Topics covered:
- Survival and hazard functions, suitable parametric distributions for lifetime data, life-table techinque and population stndardization, Kaplan-Meier (product climit) survival curve estimation, nonparametric hypothesis testing for censored data, log-rank test, parametric regression models for lifetime data, logistic regression, Cox proportional hazard models, Cox model with time-independent and time-dependent covariates.
Prerequisites:
Math 309 and Math 320, or equivlanets.
Textbook:
Elisa Lee and John Wang
Statistical Methods for Survival Data Analysis, 3rd edition
John Wiley & Son, 2003, ISBN 0-471-36997-7
Exams:
In-class midterm will be given on Oct.18 (Thur.).
Final will be a take-home-exam and due on Dec. 13 (Thur.) by 5 pm. (The problem assignments will be given at least 2 weeks before the due date.) If you turn in your final within 24 hours after due date, the grade will be scaled by 60%. No final after 5 pm. of Dec. 14 will be graded.
Grades:
- There will be around five homework sets, one midterm, and a
takehome final. Grades will be based on the homework sets (around 50%),
on the midterm (around 20%), and on the takehome final (around 25%).
Electonic files of your programs are required within 24 hours of due day.
Efiles should be sent to TA through email directly and will be used to only random check.
If all calculation are done by hand, please also email TA saying no program is used. Submitting efiles/ sending emails will be counted for 5% of your grade.
Collaboration:
- Collaboration on homework is allowed and can be helpful (and fun).
Collaboration on homework is encouraged, both for using the computer and
for doing problems. However, you must do all written work by yourself,
both computer programs and answers to homework questions. You must also
write, enter, and run all programs yourself.
- If you collaborate with someone on a homework, list his or her
name in a note at the top of the first part of your homework.
Collaboration on Takehome Tests: There should be NO COLLABORATION on takehome tests, other than for
the mechanics of using the computer.
WARNING:
Make a copy of each homework before you hand it in !!
It may not be returned before you need to refer to it for the next
homework (or for the next test).
Format of your homework:
-
- Part I: Answer all questions part by part in writing or printing. Cite the output from appendix to support your argument and conclusion.
- If the answer to a problem requires a table or a plot that you need to refer to in your answers, add page numbers to your homework and make
references in part I by page number, such as ``The scatterplot
for part (c) is on page #X in the SAS output below.''
Alternatively, you could copy outputs and
include it in part I along with annotations as well as in
part III, but references by page number will usually be enough.
- If a problem asks you to do a statistical test, EXPLAIN CLEARLY what
the null hypothesis H_0 is, what test you used, what the P-value is, and
whether the data is significant, highly significant, or neither.
-
- Part II: Attach all SAS or R programs as appendix if you have used any of them to make your argument or conclusion in part I.
- All programs should be structured, or have enough comments, so that
someone who looks at the program a year from now can easily tell what the
program is doing. For program in SAS, it is even better if descriptive comments can be put in
title (or title2 or title3) statements, since
these will appear in the SAS output as well as in the SAS program.
Programs may be graded for understandability.
-
- Part III: Attach nessary outputs from your programs to support part I.
Some useful links and references:
Guide to USING SAS by Prof. Stanley Sawyer
SAS Online Printed Manuals: detailed decriptions about SAS procedures, including all available options and related statistical theories. See SAS Online Manuals by Prof. Sawyer for more details.
Using the SAS Windowing Environment: A Quick Tutorial,L. Hatcher, SAS Institute Press, 2001.
The statistical analysis of failure time data, 2nd Edition, J. D. Kalbfleisch and R. L. Prentice (2002), John Wiley & Sons.
Survival analysis, John Klein and Melvin Moeschberger, 2nd Ed., Springer, 2003. (http://www.biostat.mcw.edu/homepgs/klein/book.html)
Survival analysis: a self-learning text, David G. Kleinbaum, Springer, 1996.
Survival analysis using the SAS system: a practical guide, Paul D. Allison, SAS Institute Press,1995.
Good books for reviewing elementary statistics:
A Data-Based Approach to Statistics,R. L. Iman,
Duxbury Press, 1994.
Statistics and Data Analysis
from Elementary to Intermediate, A. J. Tamhane and D. D. Dunlop, Prentice-Hall, 2000.