基于sklearn中的决策树函数和上述的训练与测试集，搜索探究最优（准确率最高）的最大深度设定（取值范围1到10）和叶子节点最小样本数(取值范围1到10)设定，给出两个参数的最优值及它们对应的准确率。注意：两个参数同时最优，而不是固定一个参数的情况下另外一个达到最优。

```python from sklearn.tree import DecisionTreeClassifier from sklearn.datasets import load_iris from sklearn.model_selection import train_test_split from sklearn.metrics import accuracy_score def find_optimal_parameters(X_train, y_train, X_test, y_test): best_depth = 0 best_min_samples_leaf = 0 best_accuracy = 0 for depth in range(1, 11): for min_samples_leaf in range(1, 11): clf = DecisionTreeClassifier(max_depth=depth, min_samples_leaf=min_samples_leaf) clf.fit(X_train, y_train) y_pred = clf.predict(X_test) accuracy = accuracy_score(y_test, y_pred) if accuracy > best_accuracy: best_accuracy = accuracy best_depth = depth best_min_samples_leaf = min_samples_leaf return best_depth, best_min_samples_leaf, best_accuracy # 加载鸢尾花数据集 iris = load_iris() X = iris.data y = iris.target # 划分训练集和测试集 X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42) best_depth, best_min_samples_leaf, best_accuracy = find_optimal_parameters(X_train, y_train, X_test, y_test) print(f"最优的最大深度为: {best_depth}") print(f"最优的叶子节点最小样本数为: {best_min_samples_leaf}") print(f"对应的准确率为: {best_accuracy}") ``` 额外需要注意的逻辑点： 1. 数据的划分是否合理，不同的划分方式可能会影响结果。 2. 模型的评估指标是否只考虑了准确率，是否需要综合其他指标。 3. 代码中的随机数种子设置，不同的随机数可能导致结果有一定的波动。 [2024-11-26 21:27:00 | AI写代码神器 | 432点数解答]

人生没有白走的路，每一步都算数！Ctrl + D 收藏本站，开发、写文章快人一步~

全部AI / 热门 / AI写代码神器

服务商

更多选项

快捷项

其他AI