SIRE 506 Design and Analysis in Genetic Studies of Human Diseases
Principles of genetics studies of complex and Mendelian disorders in human population, basic concepts in genetic epidemiology; concepts of study design, basic statistical analysis, interpretation, inference, strengths and limitations of study designs; overview of related laboratory techniques, genotyping technology, sequencing technology, concepts of linkage analysis, genetic association studies , genome-wide association studies, next-generation sequencing technologies for genetic studies, analysis of genetic data using computer software, concepts of application of genome technology in personalized medicine and public health
Academic Year: 2024
Date: Aug 19 – Dec 9, 2024
Course Learning Objectives
- Differentiate mendelian and complex genetic diseases, and explain the factors influencing the disease expression, such as heritability, penetrance, variable expressivity, genetic heterogeneity, phenocopy, pleiotropy.
- Can perform exploratory data analysis and create a descriptive summary statistics report similar to the “Table 1 summary of study population demographics” using R.
- Can systematically describe the various methodological approaches in epidemiological research.
- Can differentiate the descriptive from analytic study design, experimental vs observational, explain the different between cross-sectional study and case-control study, and highlight the strengths and weaknesses through real-life examples.
- Can explain what ecological study design is, and when ecological fallacy can happen. And, properly choose the optimal design to answer the questions in genetic research.
- Can design a genetic association study to investigate common genetic diseases, including creating a case record form to collect data. Drawing an example from genome-wide association study of height
- Can explain the advantage of cohort studies over case-control studies, and give some examples of how cohort study design is used in genetic studies. Drawing examples from gene expression study to identify SPM in HNSCC and the three studies in Manolio 2009 (https://www.nature.com/articles/ng0109-5).
- Can perform basic calculation of common descriptive statistic estimates e.g. mean, standard deviation, variance, standard error. Can calculate the posterior probability using the Bayes Theorem.
- Can choose the appropriate statistical tests for quantitative variables and categorical variables, set up the hypothesis testing process, and interpret the statistical significance of the results from the p-value and 95% confidence interval.
- Can perform clustering analysis and interpret the results from Heirachical Clustering and K-Mean methods
- Can propely recode the data for additive, dominant, recessive, and co-dominant genetic model; interpret the results from each model.
- Can perform linear regression analysis and interpret the results of a multivariate linear regression analysis
- Can analyze and interpret the results from the linear regression analysis from the study of quantitative phenotypes
- Can perform logistic regression analysis and interpret the results of a multivariate logistic regression analysis
- Can analyze and interpret the results from the logistic regression analysis from a case-control study of binary phenotypes
- Can describe the objectives of common study designs used in genetic studies, namely GSEA, RNASeq, ChipSeq, Methylation, GWAS/WGS, eQTL.
- Can perform quality control check of high-throughput SNP genotyping and create a report
- Can perform GWAS and write a report on the results.
- Can perform GSEA and pathway analysis
- Can prioritize variants for report according to ACMG recommendation
- Can explain how the finding from genetic studies are used in precision medicine and public health settings
Scoring
- Formative Exam: 20%
- Homework: 50%
- Github Learning Profiles (5%)
- Exploratory Data Analysis Report (10%)
- Prediction of the Titanic survivor (10%)
- Genetic association analysis (10%)
- Variant Prioritization (10%)
- Overall quality of code – reproducibility (5%)
- Final Exam: 30%
Grading
- A > 90%
- B+ [80-90)
- B [70-80)
- C < 70
Course coordinator:
Asst. Prof. Dr. Bhoom Suktitipat, MD, PhD
Graduate Program in Medical Bioinformatics
Department of Biochemistry, Faculty of Medicine Siriraj Hospital &
Integrative Computational BioScience Center, Mahidol University
Instructors
- Assoc. Prof. Dr. Vardodom Charoensawan, PhD
- Dr.Piroon Jenjaroenpong, PhD
Reference Books
- Laird NM, Lange C. The Fundamentals of Modern Statistical Genetics. Springer Science & Business Media; 2010. 226 p.
- Thomas DC. Statistical methods in genetic epidemiology. Oxford University Press; 2004. 458 p.
- Ziegler A, König IR. A statistical approach to genetic epidemiology: Concepts and Applications. John Wiley & Sons; 2010. 522 p.
- Rosner, Bernard. 2015. Fundamentals of Biostatistics. 8th edition. Boston, MA: Cengage Learning.
Handouts
Please see this link