protogene-analysis

This repo contains the code to identify a yeast strain as beneficial, neutral, or deleterious based on colony size measurements.

analysis_pipeline.m

This script contains the code to identify the effect of a genetic perturbation in a yeast strain based on colony size measurements.

Input data for this script are from tables in a MySQL database.

This pipeline uses various functions in the sql_functions folder to perform the calculations and statistical analysis.

The pipeline contains colony size and growth rate analysis. However, the workflow described below only covers the colony size analysis.

Clean up data by removing values and fixing sample swaps.
Calculate fitness by taking the normalized colony size at the time point where colony size stops increasing.
Calculate fitness statistics (mean, median).
Perform statistical tests between the target and control strain fitnesses.
Correct p-values for multiple testing hypothesis using q-values.
Calculate the effect size thresholds as the 5th and 95th percentile of the control strain fitness.
Classify yeast strains as beneficial/neutral/deleterious based on q-value threshold and effect size.

This script contains the code to bulk upload raw colony sizes to a MySQL database.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Matlab-Colony-Analyzer-Toolkit @ d58d142		Matlab-Colony-Analyzer-Toolkit @ d58d142
bean-matlab-toolkit @ ab1885c		bean-matlab-toolkit @ ab1885c
sql_functions		sql_functions
utilities		utilities
.gitmodules		.gitmodules
README.md		README.md
analysis_pipeline.m		analysis_pipeline.m
upload_raw_cs_to_db.m		upload_raw_cs_to_db.m