ValueError: X has 597 features, but SVC is expecting 605 features as input.
使用训练好的sklearn svm.SVC的模型,去对测试数据预测的时候报错了
2022/04/11
- 代码:
tfidfTrain = TfidfVectorizer(stop_words=stopWordList,min_df= 0.02).fit_transform(dataTrain_list)tfidfTest = TfidfVectorizer(stop_words=stopWordList,min_df= 0.02,max_features = 605 #之前没加也是同样的报错#觉得是训练集和测试集的特征数量不一样才报错,#打算控制测试集的特征和训练集一样,#结果还是报错).fit_transform(dataTest_list)#用训练集训练模型
model = OneVsRestClassifier(svm.SVC(kernel='linear'))
clf = model.fit(tfidfTrain, labelTrain_list)ytest_pred = clf.predict(tfidfTest)
- 报错:
ValueError: X has 597 features, but SVC is expecting 605 features as input.
---------------------------------------------------------------------------
ValueError Traceback (most recent call last)
<ipython-input-14-8d3a4697f978> in <module>()30 31 print("对测试集进行模型评估:")
---> 32 ytest_pred = clf.predict(tfidfTest)33 print("classification_report: ")34 print(classification_report(labelTest_list, ytest_pred))~\AppData\Roaming\Python\Python37\site-packages\sklearn\multiclass.py in predict(self, X)441 argmaxima = np.zeros(n_samples, dtype=int)442 for i, e in enumerate(self.estimators_):
--> 443 pred = _predict_binary(e, X)444 np.maximum(maxima, pred, out=maxima)445 argmaxima[maxima == pred] = i~\AppData\Roaming\Python\Python37\site-packages\sklearn\multiclass.py in _predict_binary(estimator, X)98 return estimator.predict(X)99 try:
--> 100 score = np.ravel(estimator.decision_function(X))101 except (AttributeError, NotImplementedError):102 # probabilities of the positive class~\AppData\Roaming\Python\Python37\site-packages\sklearn\svm\_base.py in decision_function(self, X)754 transformation of ovo decision function.755 """
--> 756 dec = self._decision_function(X)757 if self.decision_function_shape == "ovr" and len(self.classes_) > 2:758 return _ovr_decision_function(dec < 0, -dec, len(self.classes_))~\AppData\Roaming\Python\Python37\site-packages\sklearn\svm\_base.py in _decision_function(self, X)512 # NOTE: _validate_for_predict contains check for is_fitted513 # hence must be placed before any other attributes are used.
--> 514 X = self._validate_for_predict(X)515 X = self._compute_kernel(X)516 ~\AppData\Roaming\Python\Python37\site-packages\sklearn\svm\_base.py in _validate_for_predict(self, X)596 order="C",597 accept_large_sparse=False,
--> 598 reset=False,599 )600 ~\AppData\Roaming\Python\Python37\site-packages\sklearn\base.py in _validate_data(self, X, y, reset, validate_separately, **check_params)583 584 if not no_val_X and check_params.get("ensure_2d", True):
--> 585 self._check_n_features(X, reset=reset)586 587 return out~\AppData\Roaming\Python\Python37\site-packages\sklearn\base.py in _check_n_features(self, X, reset)399 if n_features != self.n_features_in_:400 raise ValueError(
--> 401 f"X has {n_features} features, but {self.__class__.__name__} "402 f"is expecting {self.n_features_in_} features as input."403 )ValueError: X has 597 features, but SVC is expecting 605 features as input.
还没好好看源码,先记录一下
2022/04/12:
- 解决:
把测试数据的fit_transform()改为transform()就可以了
查找解决方法过程:
- Python scikit svm “ValueError: X has 62 features p
- TfidfVectorizer.fit_transfrom和tfidf.transform有什么区别?
ValueError: X has 597 features, but SVC is expecting 605 features as input.相关推荐
- ValueError: X has 2 features, but LogisticRegression is expecting 5 features as input.
问题背景 用python的sklearn库做逻辑回归模型训练后,用一些数值去预测结果时报错,已知是多项式的逻辑回归模型 用的是二阶的边界函数画出来的抛物线 代码如下 pay1和pay2是用来预测y结果 ...
- 【TensorFlow】ValueError: Shape must be rank 1 but is rank 0 for ' ’ with input shapes: [].问题
基于TensorFlow训练mnist数据集出现如下错误: 检测代码,发现是偏置设置格式错误导致. 1.错误代码: # 定义权重和偏置 n_input = 784 n_output = 10 weig ...
- SVM ValueError: y should be a 1d array, got an array of shape (1, 250) instead. Found input variable
阅读前请看一下:我是一个热衷于记录的人,每次写博客会反复研读,尽量不断提升博客质量.文章设置为仅粉丝可见,是因为写博客确实花了不少精力.希望互相进步谢谢!! 文章目录 阅读前请看一下:我是一个热衷于记 ...
- ML-支持向量:SVM、SVC、SVR、SMO原理推导及实现
目录 1.导出目标 2拉格朗日转换 3对偶问题: 4求对偶问题 5 求b 6 得出模型 6.1 f(x)的约束条件: 7 核函数 7.1 软间隔 7.2 松弛变量: 7.3 KKT约束 8 SMO求a ...
- Bag of Features (BOF)图像检索算法
FROM:http://blog.csdn.net/loadstar_kun/article/details/39450137 1.首先,我们用surf算法生成图像库中每幅图的特征点及描述符. 2.再 ...
- ACM-ICPC 2018 徐州赛区网络预赛 Features Track(STL二维map)
Morgana is learning computer vision, and he likes cats, too. One day he wants to find the cat moveme ...
- pytorch简单代码实现deep dream图(即CNN特征可视化 features visualization)
本文给出完整代码实现CNN特征的可视化输入图像,也就是简单的deep dream图,有助于更好的理解CNN工作原理,并掌握用梯度上升法生成满足要求输入图像的技术.更清晰美观的deep dream图 ...
- Android Kernel Features
2019独角兽企业重金招聘Python工程师标准>>> Android Kernel Features Jump to: navigation, search Contents 1 ...
- vSphere 6.5 High Availability New Features – Proactive HA(vSphere 6.5高可用性新功能–主动HA)
vSphere 6.5 High Availability New Features – Proactive HA vSphere 6.5高可用性新功能–主动HA http://www.vmwarea ...
- 【NVMe2.0b 14-8】Set Features(下篇)
Set Features 5.27.1.13Host Controlled Thermal Management (Feature Identifier 10h), (可选) 5.27.1.14Non ...
最新文章
- 【iOS Tips】002-实现“简单单例模式”的几种方法?
- 告别“臃肿”,选择微服务(文末福利)
- p服务器不响应,无法加载资源:服务器响应状态为500
- 栈结构的总结【数据结构F】
- 揭秘5位爬藤“牛娃” 他们吸引藤校的到底是什么?
- Magento: 友好显示库存 User friendly product availability message in Magento
- CSS / CSS3(新增)选择器及优先级原则
- Markdown数学公式大全
- RestTemplate 配置http连接池
- Linux命令详解词典高频命令(1)
- java中点击按钮读取和写入文件实现登录和注册
- TypeError: Cannot set property ‘styles‘ of undefined
- UiPath常用元素识别
- JavaScript正则表达式学习笔记之一 - 理论基础
- linux gbk 语言包安装,linux 安装gbk字符集
- 2012 快速减肥 美女舒淇的懒人减肥法
- go语言不用complex包实现二值运算复数计算器
- 为什么选择高防DNS云解析?
- 凝思系统机器名怎么查看_凝思操作系统Custom Linx安装教程
- Spring Boot 集成 Activiti7(工作流)