Difference between revisions of "BIST Introduction to Statistics 2016"

From Bioinformatics Core Wiki
(Created page with "__TOC__ == BIST "Introduction to Biostatistics" Course == ==== Online Resources ==== * Nature Web-collection "Statistics for Biologists": http://www.nature.com/collections/qg...")
 
Line 15: Line 15:
 
* VIB "Basic statistics in R" course. Tutorial and links.    https://www.bits.vib.be/index.php/training/180#download
 
* VIB "Basic statistics in R" course. Tutorial and links.    https://www.bits.vib.be/index.php/training/180#download
  
==== Comparison of two samples ====
+
* [https://www.edx.org/course/introduction-r-programming-microsoft-dat204x-0 Self-paced online course from Microsoft "Intro to R programming"]
* The t-test, paired or unpaired, in R >t.test (x,y, paired=TRUE). The t-test provides an exact test for the equality of the means of two normal populations with unknown, but equal, variances. The latter can be checked with F-test, or in R >var.test(x,y). https://en.wikipedia.org/wiki/Student's_t-test#Paired_samples
+
 
+
* Non-parametric tests. No assumption about variances and normality.
+
** Independent samples. The Wilcoxon rank-sum test, aka Mann-Witney test. https://en.wikipedia.org/wiki/Mann–Whitney_U_test. In R, >wilcox.test(x,y). H0= Ranks of means of two samples are not different.
+
** Paired samples. The Wilcoxon signed-Rank Test. In R, >wilcox.test(x,y, paired=TRUE). See http://vassarstats.net/textbook/ch12a.html.
+
** The Kolmogorov-Smirnov test. In R, >ks.test(x,y). https://en.wikipedia.org/wiki/Kolmogorov–Smirnov_test. If two samples have the same mean but different variance or/and shape/distribution, this test can spot it. It is more powerful than the Wilcoxon test. The statistic is calculated by finding the maximum absolute value of the differences between the two sample cumulative distribution functions. See http://www.physics.csbsju.edu/stats/KS-test.html.
+
 
+
==== Comparison of two microbiome samples ====
+
* New (2012) biostatistical methods for the analysis of microbiome data based on a fully parametric approach using all the data. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3527355/#pone.0052078-LaRosa1
+
** The use of a fully parametric model for these data has the benefit over alternative non-parametric approaches such as bootstrapping and permutation testing, in that this model is able to retain more information contained in the data.
+
** R package "HMP" is available. http://cran.r-project.org/web/packages/HMP/HMP.pdf. To install it: > source("http://www.bioconductor.org/biocLite.R"); biocLite("HMP")
+
 
+
== Online courses ==
+
* Ten rules for online learning. http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002631
+
 
+
* Self-paced online course "Intro to Linux" https://www.edx.org/course/introduction-linux-linuxfoundationx-lfs101x-2
+
* Self-paced online course from Microsoft "Intro to R programming" https://www.edx.org/course/introduction-r-programming-microsoft-dat204x-0
+
 
* Self-paced online course "Introduction to Cloud Computing" https://www.edx.org/course/introduction-cloud-computing-ieeex-cloudintro-x-0
 
* Self-paced online course "Introduction to Cloud Computing" https://www.edx.org/course/introduction-cloud-computing-ieeex-cloudintro-x-0
 
* Self-paced online course from Microsoft "Data Science and Machine Learning Essentials" https://www.edx.org/course/data-science-machine-learning-essentials-microsoft-dat203x-0
 
* Self-paced online course from Microsoft "Data Science and Machine Learning Essentials" https://www.edx.org/course/data-science-machine-learning-essentials-microsoft-dat203x-0

Revision as of 09:04, 28 April 2016

BIST "Introduction to Biostatistics" Course

Online Resources

Bioinformatics Core Facility @ CRG — 2011-2024