top of page

Collinearity in Transaction Dataset

Dataset Summary

​

  • 2000 Customer records: age, credit in store, email (yes or no), distance to store

  • Online and In-store sales data

  • Satisfaction survey response: to service and products

​

Content

​

1. Visualize associations between variables with scatterplots and histograms.

2. Measure associations using covariance.

3. Non-normal distributed data transformation.

4. Correlation coefficient for ordinal responses.

5. Fit with linear model. 

6. Check for collinearity between variables.

7. Remediate collinearity with vif as measurement.   

​

Tools

​

R + Rstudio

​

Library

​

library(gridExtra)

library(ggplot2)

library(car)

library(gpairs)

library(corrplot)

library(gplots)

library(psych)

library(plotly)

Collinearity

Crystal Wang @ 2017

bottom of page