您现在的位置是：首页 > Python

当前栏目

sklearn cross validation_python sklearn

sklearn,cross,validation,python

2025-03-13 21:27:30 时间

大家好，又见面了，我是你们的朋友全栈君。

KFold通过提供index来给你确定不同组的训练集以及测试的index，来构造交叉验证数据集。

参数(n, n_folds=3, shuffle=False, random_state=None)
n为总数
n_folds为分为多少个交叉验证集
shuffle为是否随机
random_state设置随机因子

from sklearn.cross_validation import KFold

import numpy as np

X = np.arange(24).reshape(12,2)

array([[ 0,  1],
       [ 2,  3],
       [ 4,  5],
       [ 6,  7],
       [ 8,  9],
       [10, 11],
       [12, 13],
       [14, 15],
       [16, 17],
       [18, 19],
       [20, 21],
       [22, 23]])

1.shuffle=False

kf = KFold(12,n_folds=5,shuffle=False)
for i,(train_index,test_index) in enumerate(kf):
    print(i,train_index,test_index)

0 [ 3  4  5  6  7  8  9 10 11] [0 1 2]
1 [ 0  1  2  6  7  8  9 10 11] [3 4 5]
2 [ 0  1  2  3  4  5  8  9 10 11] [6 7]
3 [ 0  1  2  3  4  5  6  7 10 11] [8 9]
4 [0 1 2 3 4 5 6 7 8 9] [10 11]

kf = KFold(12,n_folds=5,shuffle=False)
for i,(train_index,test_index) in enumerate(kf):
    print(i,train_index,test_index)

0 [ 3  4  5  6  7  8  9 10 11] [0 1 2]
1 [ 0  1  2  6  7  8  9 10 11] [3 4 5]
2 [ 0  1  2  3  4  5  8  9 10 11] [6 7]
3 [ 0  1  2  3  4  5  6  7 10 11] [8 9]
4 [0 1 2 3 4 5 6 7 8 9] [10 11]

2.shuffle=True,俩次不同了

kf = KFold(12,n_folds=5,shuffle=True)
for i,(train_index,test_index) in enumerate(kf):
    print(i,train_index,test_index)

0 [ 0  1  2  3  4  6  7  8 11] [ 5  9 10]
1 [ 0  2  3  4  5  8  9 10 11] [1 6 7]
2 [ 0  1  2  3  4  5  6  7  9 10] [ 8 11]
3 [ 0  1  2  5  6  7  8  9 10 11] [3 4]
4 [ 1  3  4  5  6  7  8  9 10 11] [0 2]

kf = KFold(12,n_folds=5,shuffle=True)
for i,(train_index,test_index) in enumerate(kf):
    print(i,train_index,test_index)

0 [ 0  3  4  6  7  8  9 10 11] [1 2 5]
1 [ 1  2  5  6  7  8  9 10 11] [0 3 4]
2 [ 0  1  2  3  4  5  8  9 10 11] [6 7]
3 [ 0  1  2  3  4  5  6  7  8 10] [ 9 11]
4 [ 0  1  2  3  4  5  6  7  9 11] [ 8 10]

3.shuffle=True,random_state赋值，俩次又相同了

kf = KFold(12, n_folds=5, shuffle=True, random_state=5)
for i,(train_index,test_index) in enumerate(kf):
    print(i,train_index,test_index)

0 [ 0  1  3  4  6  8  9 10 11] [2 5 7]
1 [ 0  1  2  3  5  6  7  8 10] [ 4  9 11]
2 [ 0  2  3  4  5  6  7  9 10 11] [1 8]
3 [ 1  2  3  4  5  6  7  8  9 11] [ 0 10]
4 [ 0  1  2  4  5  7  8  9 10 11] [3 6]

kf = KFold(12, n_folds=5, shuffle=True, random_state=5)
for i,(train_index,test_index) in enumerate(kf):
    print(i,train_index,test_index)

0 [ 0  1  3  4  6  8  9 10 11] [2 5 7]
1 [ 0  1  2  3  5  6  7  8 10] [ 4  9 11]
2 [ 0  2  3  4  5  6  7  9 10 11] [1 8]
3 [ 1  2  3  4  5  6  7  8  9 11] [ 0 10]
4 [ 0  1  2  4  5  7  8  9 10 11] [3 6]

发布者：全栈程序员栈长，转载请注明出处：https://javaforall.cn/193297.html原文链接：https://javaforall.cn

猜你喜欢

mac如何卸载python_如何彻底卸载pycharm
Python 调用接口进行文件上传的踩坑记录
pycharm查找替换快捷键_python查找指定字符串并替换
Python-drf前戏38-前端Vue
Python学习笔记：几种排序算法
python中itchat_python打招呼的代码
python学习
python报错no module named_pycharm报错no module named
使用Python暴力激活成功教程密码
miniconda可以运行python吗_pycharm怎么配置anaconda环境
【过程记录】python环境离线迁移
Python基础13-模块的使用
一个提供公告和打赏功能的 django 应用插件 django-tctip
django在docker环境中断点调试
pycharm无法使用第三方库_python第三方库安装失败
SPC（Statistical Process Control 统计过程控制）图——Python+JS实现
推荐！国内气象人开发的基于Python的Grads文件解析利器
Python基础24-MySQL模块pymysql
Python基础08-名称空间与作用域
Python 学生信息管理系统——文章中源码100%真实有效—–如何将类、初始化属性、模块、循环判断、静态方法等一系列知识点结合起来做一个项目「建议收藏」