Name		Name	Last commit message	Last commit date
parent directory ..
attachments		attachments
ANOVA.ipynb		ANOVA.ipynb
Mann-Whitney_U-test.ipynb		Mann-Whitney_U-test.ipynb
README.md		README.md
one_sample_t-test.ipynb		one_sample_t-test.ipynb
one_sample_z-test.ipynb		one_sample_z-test.ipynb
statistical_design.ipynb		statistical_design.ipynb
stats_helpers.py		stats_helpers.py
two-way_ANOVA.ipynb		two-way_ANOVA.ipynb
two_sample_equivalence_test.ipynb		two_sample_equivalence_test.ipynb
two_sample_t-test.ipynb		two_sample_t-test.ipynb

README.md

Statistical analysis examples

This directory contains notebooks with self-contained examples of common statistical analysis techniques.

The purpose is to provide at least one example for each of the test covered in the Inventory of statistical test recipes.

List of recipes

Z-tests:
- One sample $z$-test: one_sample_z-test.ipynb
Proportion tests
- One-sample $z$-test for proportions
- Binomial test
- Two-sample $z$-test for proportions
T-tests
- One-sample $t$-test: one_sample_t-test.ipynb
- Welch's two-sample $t$-test: two_sample_t-test.ipynb
- Two-sample $t$-test with pooled variance (not important)
- Paired $t$-test
Chi-square tests
- Chi-square test for goodness of fit
- Chi-square test of independence
- Chi-square test for homogeneity
- Chi-square test for the population variance
ANOVA tests
- One-way analysis of variance (ANOVA): ANOVA.ipynb
- Two-way ANOVA
Nonparametric tests
- Sign test for the population median
- One-sample Wilcoxon signed-rank test
- Mann-Whitney U-test: Mann-Whitney_U-test.ipynb
- Kruskal–Wallis analysis of variance by ranks
Resampling methods
- Simulation tests
- Two-sample permutation test
- Permutation ANOVA
Miscellaneous tests
- Equivalence tests: two_sample_equivalence_test.ipynb
- Kolmogorov–Smirnov test
- Shapiro–Wilk normality test

Template

For each statistical testing recipe, the notebook follows the same structure:

Data
Assumptions
Hypotheses
Power calculations
Test statistic
Sampling distribution
Examples
- Example 0: synthetic data when H0 is true
- Example 1: synthetic data when H0 is false
- Examples 2...n: other examples
Effect size estimates
Related tests
Discussion
Links

Why use synthetic data

We use "fake" data for the examples 0 and 1 in order to illustrate the canonical data type each statistical test is designed to detect. This is a good "sanity check" to use for any statistical analysis technique: before trying on your real-world dataset, try it on synthetic data to make sure it works as expected (is able to detect a difference when a difference exists, and correctly fails to reject H0 when no difference exists).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

README.md

Statistical analysis examples

List of recipes

Template

Why use synthetic data

Files

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

Statistical analysis examples

List of recipes

Template

Why use synthetic data