Features tf-idf: text feature extraction scikit tf=idf Models & Methods cosine similarity for vector space models Bag of Words (used in Bayesian Spam Filtering, document term frequency models) Statistical Definitions Accuracy & Precision (relevant to classification, measuring efficacy of learned models)