使用训练好的sklearn svm.SVC的模型,去对测试数据预测的时候报错了
2022/04/11

  • 代码:
tfidfTrain = TfidfVectorizer(stop_words=stopWordList,min_df= 0.02).fit_transform(dataTrain_list)tfidfTest = TfidfVectorizer(stop_words=stopWordList,min_df= 0.02,max_features = 605 #之前没加也是同样的报错#觉得是训练集和测试集的特征数量不一样才报错,#打算控制测试集的特征和训练集一样,#结果还是报错).fit_transform(dataTest_list)#用训练集训练模型
model = OneVsRestClassifier(svm.SVC(kernel='linear'))
clf = model.fit(tfidfTrain, labelTrain_list)ytest_pred = clf.predict(tfidfTest)
  • 报错:
    ValueError: X has 597 features, but SVC is expecting 605 features as input.
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-14-8d3a4697f978> in <module>()30 31 print("对测试集进行模型评估:")
---> 32 ytest_pred = clf.predict(tfidfTest)33 print("classification_report: ")34 print(classification_report(labelTest_list, ytest_pred))~\AppData\Roaming\Python\Python37\site-packages\sklearn\multiclass.py in predict(self, X)441             argmaxima = np.zeros(n_samples, dtype=int)442             for i, e in enumerate(self.estimators_):
--> 443                 pred = _predict_binary(e, X)444                 np.maximum(maxima, pred, out=maxima)445                 argmaxima[maxima == pred] = i~\AppData\Roaming\Python\Python37\site-packages\sklearn\multiclass.py in _predict_binary(estimator, X)98         return estimator.predict(X)99     try:
--> 100         score = np.ravel(estimator.decision_function(X))101     except (AttributeError, NotImplementedError):102         # probabilities of the positive class~\AppData\Roaming\Python\Python37\site-packages\sklearn\svm\_base.py in decision_function(self, X)754         transformation of ovo decision function.755         """
--> 756         dec = self._decision_function(X)757         if self.decision_function_shape == "ovr" and len(self.classes_) > 2:758             return _ovr_decision_function(dec < 0, -dec, len(self.classes_))~\AppData\Roaming\Python\Python37\site-packages\sklearn\svm\_base.py in _decision_function(self, X)512         # NOTE: _validate_for_predict contains check for is_fitted513         # hence must be placed before any other attributes are used.
--> 514         X = self._validate_for_predict(X)515         X = self._compute_kernel(X)516 ~\AppData\Roaming\Python\Python37\site-packages\sklearn\svm\_base.py in _validate_for_predict(self, X)596                 order="C",597                 accept_large_sparse=False,
--> 598                 reset=False,599             )600 ~\AppData\Roaming\Python\Python37\site-packages\sklearn\base.py in _validate_data(self, X, y, reset, validate_separately, **check_params)583 584         if not no_val_X and check_params.get("ensure_2d", True):
--> 585             self._check_n_features(X, reset=reset)586 587         return out~\AppData\Roaming\Python\Python37\site-packages\sklearn\base.py in _check_n_features(self, X, reset)399         if n_features != self.n_features_in_:400             raise ValueError(
--> 401                 f"X has {n_features} features, but {self.__class__.__name__} "402                 f"is expecting {self.n_features_in_} features as input."403             )ValueError: X has 597 features, but SVC is expecting 605 features as input.

还没好好看源码,先记录一下

2022/04/12:

  • 解决:

把测试数据的fit_transform()改为transform()就可以了

查找解决方法过程:

  • Python scikit svm “ValueError: X has 62 features p
  • TfidfVectorizer.fit_transfrom和tfidf.transform有什么区别?

ValueError: X has 597 features, but SVC is expecting 605 features as input.相关推荐

  1. ValueError: X has 2 features, but LogisticRegression is expecting 5 features as input.

    问题背景 用python的sklearn库做逻辑回归模型训练后,用一些数值去预测结果时报错,已知是多项式的逻辑回归模型 用的是二阶的边界函数画出来的抛物线 代码如下 pay1和pay2是用来预测y结果 ...

  2. 【TensorFlow】ValueError: Shape must be rank 1 but is rank 0 for ' ’ with input shapes: [].问题

    基于TensorFlow训练mnist数据集出现如下错误: 检测代码,发现是偏置设置格式错误导致. 1.错误代码: # 定义权重和偏置 n_input = 784 n_output = 10 weig ...

  3. SVM ValueError: y should be a 1d array, got an array of shape (1, 250) instead. Found input variable

    阅读前请看一下:我是一个热衷于记录的人,每次写博客会反复研读,尽量不断提升博客质量.文章设置为仅粉丝可见,是因为写博客确实花了不少精力.希望互相进步谢谢!! 文章目录 阅读前请看一下:我是一个热衷于记 ...

  4. ML-支持向量:SVM、SVC、SVR、SMO原理推导及实现

    目录 1.导出目标 2拉格朗日转换 3对偶问题: 4求对偶问题 5 求b 6 得出模型 6.1 f(x)的约束条件: 7 核函数 7.1 软间隔 7.2 松弛变量: 7.3 KKT约束 8 SMO求a ...

  5. Bag of Features (BOF)图像检索算法

    FROM:http://blog.csdn.net/loadstar_kun/article/details/39450137 1.首先,我们用surf算法生成图像库中每幅图的特征点及描述符. 2.再 ...

  6. ACM-ICPC 2018 徐州赛区网络预赛 Features Track(STL二维map)

    Morgana is learning computer vision, and he likes cats, too. One day he wants to find the cat moveme ...

  7. pytorch简单代码实现deep dream图(即CNN特征可视化 features visualization)

      本文给出完整代码实现CNN特征的可视化输入图像,也就是简单的deep dream图,有助于更好的理解CNN工作原理,并掌握用梯度上升法生成满足要求输入图像的技术.更清晰美观的deep dream图 ...

  8. Android Kernel Features

    2019独角兽企业重金招聘Python工程师标准>>> Android Kernel Features Jump to: navigation, search Contents 1 ...

  9. vSphere 6.5 High Availability New Features – Proactive HA(vSphere 6.5高可用性新功能–主动HA)

    vSphere 6.5 High Availability New Features – Proactive HA vSphere 6.5高可用性新功能–主动HA http://www.vmwarea ...

  10. 【NVMe2.0b 14-8】Set Features(下篇)

    Set Features 5.27.1.13Host Controlled Thermal Management (Feature Identifier 10h), (可选) 5.27.1.14Non ...

最新文章

  1. 【iOS Tips】002-实现“简单单例模式”的几种方法?
  2. 告别“臃肿”,选择微服务(文末福利)
  3. p服务器不响应,无法加载资源:服务器响应状态为500
  4. 栈结构的总结【数据结构F】
  5. 揭秘5位爬藤“牛娃” 他们吸引藤校的到底是什么?
  6. Magento: 友好显示库存 User friendly product availability message in Magento
  7. CSS / CSS3(新增)选择器及优先级原则
  8. Markdown数学公式大全
  9. RestTemplate 配置http连接池
  10. Linux命令详解词典高频命令(1)
  11. java中点击按钮读取和写入文件实现登录和注册
  12. TypeError: Cannot set property ‘styles‘ of undefined
  13. UiPath常用元素识别
  14. JavaScript正则表达式学习笔记之一 - 理论基础
  15. linux gbk 语言包安装,linux 安装gbk字符集
  16. 2012 快速减肥 美女舒淇的懒人减肥法
  17. go语言不用complex包实现二值运算复数计算器
  18. 为什么选择高防DNS云解析?
  19. 凝思系统机器名怎么查看_凝思操作系统Custom Linx安装教程
  20. Spring Boot 集成 Activiti7(工作流)

热门文章

  1. True Liars (POJ - 1417)带权并查集+dp路径
  2. 作为学生,我是怎么半年赚到人生第一个三十万的
  3. HTML中Form表单的使用
  4. SIM800C实验记录之通过GPRS上网
  5. 关于react、vue的相关问题
  6. Form_Load():不要甩锅给我
  7. ES文件浏览器访问电脑共享文件夹,输入正确的密码、账号,始终在登录界面
  8. closest()方法简介
  9. python实现火车票查询_Python 实现火车票查询工具
  10. 基于FPGA的数字混频器设计与仿真