我正在使用Azure Machine Learning Studio,以便使用两级增强决策树和拆分数据来预测一列.
I'm using Azure Machine Learning Studio in order to predict a column using Two-Class Boosted Decision Tree and split data.
我组装的图可以在这里找到:
The diagram that I have assembled can be found here:
我需要的是,我想查看数据集中影响最大和影响预测的列.换句话说,与数据集中的其他列相比,更改预测结果的列更多.
What I need is that I'd like to see the column in the dataset that affects and influences the prediction the most. In other words, the column that changes the prediction result more than the other columns in the dataset.
很抱歉,是否曾经有人问过这个问题,但是我找不到这个简单问题的正确答案.
Sorry if this has been asked before, but I couldn't find a proper answer to this simple question.
推荐答案如前所述,置换功能的重要性可以解决问题.将置换特征重要性模块附加到训练模块上,单击输出端口,然后选择可视化以获取模块的结果.上图显示了按排列重要性得分的降序排列的特征列表.
As said before, Permutation Feature Importance do the trick. Attach the Permutation Feature Importance block do the train block, click on the output port, and select visualize to get results of the module. The figure above shows the list of features sorted in descending order of their permutation importance scores.
建议:当您具有较高的相关特征时,在解释置换得分的结果时要小心.
An advice: be careful when interpreting results of permutation score when you have high correlated features.
有关更多信息,请参见: standupdata/category/permutation-feature-importance/ gallery.cortanaintelligence/Experiment/Permutation-Feature-Importance-5
For more info, see: standupdata/category/permutation-feature-importance/ gallery.cortanaintelligence/Experiment/Permutation-Feature-Importance-5
更多推荐
如何看对预测结果影响最大的列?
发布评论