Articles on: Training models

Column effect importance

Column effect importance in machine learning refers to the measurement of the effect of each feature (column) in a dataset on the outcome being predicted. The purpose is to understand how each feature influences the prediction and to identify which features have the greatest impact on the model's accuracy.

For example, consider a dataset that contains information about marketing and sales. The columns might include the number of clicks, the gender of customers, the location, and the facebook campaign id. Column effect importance would help determine which of these factors has the most significant effect on the ROAS, and how each feature interacts with the other features in the dataset.

A Sample Column Importance Graph

NextBrain performs SHAP analysis to quantify the contribution of each feature to the prediction of an instance. These values can be positive or negative, indicating the increase or decrease in the predicted outcome that is due to a particular feature. Overall, SHAP values provide a flexible and powerful tool for understanding and interpreting the predictions of machine learning models, and can be useful in a wide range of applications. You will get recommendations about which columns contribute poorly to the prediction on the Model Summary panel.

A Sample Shap Analysis

There is no ideal ratio for column effect importance, as the impact of each feature on the model's predictions can vary greatly based on the specific dataset and type of model being used. It's important to carefully evaluate and interpret the results of column effect importance calculations to make informed decisions about which features to keep or remove from a model.

Updated on: 02/02/2023

Was this article helpful?

Share your feedback


Thank you!