Principal component analysis (PCA) in Python can be used to speed up model training or for data visualization. This tutorial covers both using scikit-learn.
Logistic regression is a classification technique that identifies the best fitting model to describe the relationship between the dependent and independent variables in a data set.
Term frequency-inverse document frequency (TF-IDF) is an NLP technique that measures the importance of each word in a sentence. Here’s how to create your own.