报错解决:SyntaxError: Non-UTF-8 code starting with ‘\xe7‘
今天抓取数据时使用re对数据进行提取时遇到的问题:syntaxError: Non-UTF-8 code starting with '\xe7',意思是有的中文字符无法转成utf-8的形式,如图所示:
这个是因为抓取的数据中有的中文字符识别不了,相应的数据如下:
"""
class="sale-num">0</span>件</div> </div> </dd> </dl> <dl class="item " data-id="658360334789"> <dt class="photo"> <a class="J_TGoldData" href="//item.taobao.com/item.htm?id=658360334789" target="_blank" data-gold-url="/inshopse" data-gold-data='{"gokey":"at_bucketid=&srppage=1&scid=&lf_aclog=20-658360334789-24-null-1672116384&?src=shopsystem--33.7.235.200&sort=popular:des&q=%E5%A8%BC%EE%86%BD%E7%A5%A6%E6%BF%82%E5%AE%A0%EE%97%8A%E7%BC%87%E7%95%8C%E7%B2%A7%E9%8F%88%EF
%BF%BD&tab=all&ss_bucket=18&rank_src=inshop_pc_tb&buyernick=tb089523630&shop_id=103889497&navigator=property&s=0&n=24&app=inshop&outfmt=json&stats_click=&rn=bf364af4330ca3ede3b1b5c333e63464", "cna": "BNQVHHXMNzMCATrRitnbrIZf","bc_type":"c" }' > <img src="//img.alicdn.com/bao/uploaded/i3/1672116384/O1CN01FXdpY61x1vFNFCEMr_!!0-item_pic.jpg_240x240.jpg" > </a> </dt> <dd class="detail"> <a class="item-name J_TGoldData" href="//item.taobao.com/item.htm?id=658360334789" target="_blank" data-gold-url="/inshopse" data-gold-data='{"gokey":"at_bucketid=&srppage=1&sci
d=&lf_aclog=20-658360334789-24-null-1672116384&?src=shopsystem--33.7.235.200&sort=popular:des&q=%E5%A8%BC%EE%86%BD%E7%A5%A6%E6%BF%82%E5%AE%A0%EE%97%8A%E7%BC%87%E7%95%8C%E7%B2%A7%E9%8F%88%EF%BF%BD
&tab=all&ss_bucket=18&
rank_src=inshop_pc_tb&buyernick=tb089523630&shop_id=103889497&navigator=property&s=0&n=24&app=inshop&outfmt=json&stats_click=&rn=bf364af4330ca3ede3b1b5c333e63464", "cna": "BNQVHHXMNzMCATrRitnbrIZf","bc_type":"c" }' > 秋季街头短款外套2021年新款女纯色百搭拉链连帽长袖卫衣直筒上衣</a> <div class="attribute"> <div class="cprice-area"><span class="symbol">¥</span><span class="c-price">89.00</span></div> <div class="sprice-area"><span class="symbol">¥</span><span class="s-price">158.00 </span></div> <!--rsdata.showSaleData: true--> <div class="sale-area">已售:<span class="sale-num">1</span>件</div> </div> </dd> </dl> <dl class="item last" data-id="6585372
76231"> <dt class="photo"> <a class="J_TGoldData" href="//item.taobao.com/item.htm?id=658537276231" target="_blank" data-gold-url="/inshopse" data-gold-data='{"gokey":"at_bucketid=&srppage=1&scid=&lf_aclog=21-658537276231-24-null-1672116384&?src=shopsystem-
-33.7.235.200&sort=popular:des&q=%E5%A8%BC%EE%86%BD%E7%A5%A6%E6%BF%82%E5%AE%A0%EE%97%8A%E7%BC%87%E7%95%8C%E7%B2%A7%E9%8F%88%EF%BF%BD&tab=all&ss_bucket=18&rank_src=inshop_pc_tb&buyernick=tb089523630&shop_id=103889497&navigator=property&s=0&n=24&app=inshop&outfmt=json&stats_click=&rn=bf364af4330ca3ede3b1b5c333e63464", "cna": "BNQVHHXMNzMCATrRitnbrIZf","bc_type":"c" }' > <img src="//img.alicdn.com/bao/uploaded/i4/1672116384/O1CN010VjOa51x1vFLT2NET_!!0-item_pic.jpg_240x240.jpg" > </a> </dt> <dd class="detail"> <a class="item-name J_TGoldData" href="//item.taobao.com/item.htm?id=658537276231" target="_blank" data-gold-url="/inshopse" d
ata-gold-
data='{"gokey":"at_bucketid=&srppage=1&scid=&lf_aclog=21-658537276231-24-null-1672116384&?src=shopsystem-
-33.7.235.200&sort=popular:des&q=%E5%A8%BC%EE%86%BD%E7%A5%A6%E6%BF%82%E5%AE%A0%EE%97%8A%E7%BC%87%E7%95%8C%E7%B2%A7%E9%8F%88%EF%BF%BD&tab=all&ss_bucket=18&rank_src=inshop_pc_tb&buyernick=tb089523630&shop_id=103889497&navigator=property&s=0&n=24&app=inshop&outfmt=json&stats_click=&rn=bf364af4330ca3ede3b1b5c333e63464", "cna": "BNQVHHXMNzMCATrRitnbrIZf","bc_type":"c" }' > 2022秋冬欧美街头短款修身棉服女立领拉链加厚外套保暖休闲棉衣</a> <div class="attribute"> <div class="cprice-area"><span class="symbol">¥</span><span class="c-price">98.00</span></div> <div class="sprice-area"><span class="symbol">¥</span><span class="s-price">159.00 </span></div> <!--rsdata.showSaleData: true--> <div class="sale-area">已售:<span class="sal
e-num">100+</span>件</div> </div> </dd> </dl> </div> <div class="item3line1"> <dl class="item " data-id="659042741963"> <dt class="photo"> <a class="J_TGoldData" href="//item.taobao.com/item.htm?id=659042741963" target="_blank" data-gold-url="/inshopse" data-gold-data='{"gokey":"at_bucketid=&srppage=1&scid=&lf_aclog=22-659042741963-24-null-1672116384&?src=shopsystem--33.7.235.200&sort=popular:des&q=%E5%A8%BC%EE%86%BD%E7%A5%A6%E6%BF%82%E5%AE%A0%EE%97%8A%E7%BC%87%E7%95%8C%E7%B2%A7%E9%8F%88%EF%BF%BD&tab=all&ss_bucket=18&rank_src=inshop_pc_tb&buyernick=tb089523630&shop_id=103889497&navigator=property&s=0&n=24&app=inshop&outfmt=
json&stats_click=&rn=bf364af4330ca3ede3b1b5c333e63464", "cna": "BNQVHHXMNzMCATrRitnbrIZf","bc_type":"c" }' > <img src="//img.alicdn.com/bao/uploaded/i4/1672116384/O1CN01kJIsAT1x1vFLvhtUu_!!0-item_pic.jpg_240x240.jpg" > </a> </dt> <dd class="detail"> <a class="item-name J_TGoldData" href="//item.taobao.com/item.htm?id=659042741963" target="_blank" data-gold-url="/inshopse" data-gold-data='{"gokey":"at_bucketid=&srppage=1&
;scid=&lf_aclog=22-659042741963-24-null-1672116384&?src=shopsystem--33.7.235.200&sort=popular:des&q=%E5%A8%BC%EE%86%BD%E7%A5%A6%E6%BF%82%E5%AE%A0%EE%97%8A%E7%BC%87%E7%95%8C%E7%B2%A7%E9%8F%88%EF%BF%BD&tab=all&ss_bucket=18&rank_src=inshop_pc_tb&buyernick=tb089523630&shop_id=103889497&navigator=property&s=0&n=24&app=inshop&outfmt=json&stats_click=&rn=bf364af4330ca3ede3b1b5c333e63464", "cna": "BNQVHHXMNzMCATrRitnbrIZ
f","bc_type":"c" }' > 2021冬新款镂空麻花条纹针织开衫女长袖连帽宽松拉链纯色休闲外套</a> <div class="attribute"> <div class="cprice-area"><span class="symbol">¥</span><span class="c-price">99.00</span></div> <div class="sprice-area"><span class="symbol">¥</span><span class="s-price">175.00 </span></div> <!--rsdata.showSaleData: true--> <div class="sale-area">已售:<span class="sale-num">0</span>件</div> </div> </dd> </dl> <dl class="item " data-id="659055837980"> <dt class="photo"> <a class="J_TGoldData" href="//item.taobao.com/item.htm?id=659055837980" target="_blank" data-gold-url="/inshopse" data-gold-data='{"gokey":"at_bucketid=&srppage=1&scid=&lf_aclog=23-659055837980-24-null-1672116384&?src=shopsystem--33.7.235.200&sort=popular:des&q=%E5%A8%BC%EE%86%BD%E7%A5%A6%E6%BF%82%E5%AE%A0%EE%97%8A%E7%BC%87%E7%95%8C%E7%B2%A7%E9%8F%88%EF%BF%BD&tab=all&ss_bucket=18&rank_src=inshop_pc_tb&buyernick=tb089523630&shop_id=103889497&navigator=property&s=0&n=24&app=inshop&outfmt=json&stats_click=&rn=bf364af4330ca3ede3b1b5c333e63464", "cna": "BNQVHHXMNzMCATrRitnbrIZf","bc_type":"c" }' > <img src="//img.alicdn.com/bao/uploaded/i3/1672116384/O1CN01KNlfAg1x1vFOCFwTe_!!0-item_pic.jpg_240x240.jpg" > </a> </dt> <dd class="detail"> <a class="item-name J_TGoldData" href="//item.taobao.com/item.htm?id=659055837980" target="_blank" data-gold-url="/inshopse" data-gold-data='{"gokey":"at_bucketid=&srppage=1&scid=&lf_aclog=23-659055837980-24-null-1672116384&?src=shopsystem--33.7.235.200&sort=popular:des&q=%E5%A8%BC%EE%86%BD%E7%A5%A6%E6%BF%82%E5%AE%A0%EE%97%8A%E7%BC%87%E7%95%8C%E7%B2%A7%E9%8F%88%EF%BF%BD&tab=all&ss_bucket=18&rank_src=inshop_pc_tb&buyernick=tb089523630&shop_id=103889497&navigator=property&s=0&n=24&app=inshop&outfmt=json&stats_click=&rn=bf364af4330ca3ede3b1b5c333e63464", "cna": "BNQVHHXMNzMCATrRitnbrIZf","bc_type":"c" }' > 2021欧美街头风高腰紧身纯色短款上衣女长袖圆领拉链开衫外套洋气</a> <div class="attribute"> <div class="cprice-area"><span class="symbol">¥</span><span class="c-price">65.00</span></div> <div class="sprice-area"><span class="symbol">¥</span><span class="s-price">142.00 </span></div> <!--rsdata.showSaleData: true--> <div class="sale-area">已售:<span class="sale-num">6</span>件</div> </div> </dd> </dl> <dl class="item last" data-id="659059105719"> <dt class="photo"> <a class="J_TGoldData" href="//item.taobao.com/item.htm?id=659059105719" target="_blank" data-gold-url="/inshopse" data-gold-data='{"gokey":"at_bucketid=&srppage=1&scid=&lf_aclog=24-659059105719-24-null-1672116384&?src=shopsystem--33.7.235.200&sort=popular:des&q=%E5%A8%BC%EE%86%BD%E7%A5%A6%E6%BF%82%E5%AE%A0%EE%97%8A%E7%BC%87%E7%95%8C%E7%B2%A7%E9%8F%88%EF%BF%BD&tab=all&ss_bucket=18&rank_src=inshop_pc_tb&buyernick=tb089523630&shop_id=103889497&navigator=property&s=0&n=24&app=inshop&outfmt=json&stats_click=&rn=bf364af4330ca3ede3b1b5c333e63464", "cna": "BNQVHHXMNzMCATrRitnbrIZf","bc_type":"c" }' > <img src="//img.alicdn.com/bao/uploaded/i1/1672116384/O1CN01YqbdA81x1vFJe0PMa_!!0-item_pic.jpg_240x240.jpg" > </a> </dt> <dd class="detail"> <a class="item-name J_TGoldData" href="//item.taobao.com/item.htm?id=659059105719" target="_blank" data-gold-url="/inshopse" data-gold-data='{"gokey":"at_bucketid=&srppage=1&scid=&lf_aclog=24-659059105719-24-null-1672116384&?src=shopsystem--33.7.235.200&sort=popular:des&q=%E5%A8%BC%EE%86%BD%E7%A5%A6%E6%BF%82%E5%AE%A0%EE%97%8A%E7%BC%87%E7%95%8C%E7%B2%A7%E9%8F%88%EF%BF%BD&tab=all&ss_bucket=18&rank_src=inshop_pc_tb&buyernick=tb089523630&shop_id=103889497&navigator=property&s=0&n=24&app=inshop&outfmt=json&stats_click=&rn=bf364af4330ca3ede3b1b5c333e63464", "cna": "BNQVHHXMNzMCATrRitnbrIZf","bc_type":"c" }' > 2022秋冬新款短款卫衣女欧美街头宽松套头运动健身半拉链翻领上衣</a> <div class="attribute"> <div class="cprice-area"><span class="symbol">¥</span><span class="c-price">69.00</span></div> <div class="sprice-area"><span class="symbol">¥</span><span class="s-price">155.00 </span></div> <!--rsdata.showSaleData: true--> <div class="sale-area">已售:<span class="sale-num">76</span>件</div> </div> </dd> </dl> </div> <div class="pagination"> <a class="disable">上一页</a> <a class="page-cur">1</a> <a class="J_SearchAsync" href="//shop103889497.taobao.com/search.htm?input_charset=gbk&mid=w-23677803207-0&wid=23677803207&path=%2Fsearch.htm&search=y&searcy_type=item&s_from=newHeader&ssid=s5-e&keyword=%E6%BD%AE%E6%B5%81%E5%A5%B3%E8%A3%85%E7%BE%BD%E7%BB%92%E6%9C%3F&pageNo=2#anchor">2</a> <a class="J_SearchAsync" href="//shop103889497.taobao.com/search.htm?input_charset=gbk&mid=w-23677803207-0&wid=23677803207&path=%2Fsearch.htm&search=y&searcy_type=item&s_from=newHeader&ssid=s5-e&keyword=%E6%BD%AE%E6%B5%81%E5%A5%B3%E8%A3%85%E7%BE%BD%E7%BB%92%E6%9C%3F&pageNo=3#anchor">3</a> <a class="J_SearchAsync next" href="//shop103889497.taobao.com/search.htm?input_charset=gbk&mid=w-23677803207-0&wid=23677803207&path=%2Fsearch.htm&search=y&searcy_type=item&s_from=newHeader&ssid=s5-e&keyword=%E6%BD%AE%E6%B5%81%E5%A5%B3%E8%A3%85%E7%BE%BD%E7%BB%92%E6%9C%3F&pageNo=2#anchor">下一页</a> <form action="//shop103889497.taobao.com/search.htm" method="get"> <input type="hidden" name="input_charset" value="gbk"> <input type="hidden" name="mid" value="w-23677803207-0"> <input type="hidden" name="wid" value="23677803207"> <input type="hidden" name="path" value="%2Fsearch.htm"> <input type="hidden" name="search" value="y"> <input type="hidden" name="searcy_type" value="item"> <input type="hidden" name="s_from" value="newHeader"> <input type="hidden" name="ssid" value="s5-e"> 到第 <input type="text" value="1" size="3" name="pageNo"> 页 <button type="submit">确定</button> </form> <!--END OF pagination--> </div> </div> </div>
"""
解决方法:在脚本最上方添加
# coding=utf-8即可
报错解决:SyntaxError: Non-UTF-8 code starting with ‘\xe7‘相关推荐
- 【npm i 报错解决方法】npm ERR! code ERESOLVEnpm ERR!npm ERR! While resolving: by-web@1.2.2npm ERR!
[npm i 报错解决方法]npm ERR! code ERESOLVE npm ERR! ERESOLVE unable to resolve dependency tree npm ERR! np ...
- Python-PyCharm 报错解决:ImportError: cannot import name 'InteractiveConsole' from 'code'
此文首发于我的个人博客:Python-PyCharm 报错解决:ImportError: cannot import name 'InteractiveConsole' from 'code' - z ...
- 已解决(Python语法报错)SyntaxError: invalid syntax
已解决(Python语法报错)SyntaxError: invalid syntax 文章目录 报错信息 报错翻译 报错原因 解决方法 千人全栈VIP答疑群联系博主帮忙解决报错 报错信息 粉丝群里面一 ...
- JS报错解决:SyntaxError: Unexpected token 《 in JSON at position 0
ThinkPHP5.1的环境要求如下: PHP >= 5.6.0 PDO PHP Extension MBstring PHP Extension 最近下载了tp 5.1.19来玩,造轮子难免会 ...
- 报错解决:TypeError: Object type class 'str' cannot be passed to C code
此文首发于我的个人博客:报错解决 TypeError Object type class 'str' cannot be passed to C code - zhang0peter的个人博客 下午在 ...
- Reids报错解决:Job for redis-server.service failed because the control process exited with error code.
此文首发于我的个人博客:Reids报错解决 Job for redis-server.service failed because the control process exited with er ...
- no identity found Command /usr/bin/codesign failed with exit code 1 报错解决方法
no identity found Command /usr/bin/codesign failed with exit code 1 报错解决方法 参考文章: (1)no identity foun ...
- Redis 启动报错 QForkMasterInit: system error caught. error code=0x000005af 解决
title: Redis 启动报错 QForkMasterInit system error caught error code=0x000005af 解决 date: 2022-03-16 16:2 ...
- 码云git push报错 DeployKey does not support push code 解决办法
码云git push报错 DeployKey does not support push code 解决办法 首先生成公钥去码云添加公钥有具体教程 添加公钥 一顿操作之后测试一下 git push 嗯 ...
最新文章
- 北京大学自考计算机应用本科,北京大学自学考试本科2019年还能报考吗
- vc6.0连接mysql数据库
- 用C#创建COM组件全过程
- 微软彻底拥抱 Python!
- CentOS7.9下实战安装MySQL5.7
- QT添加MySQL驱动依赖
- 产品经理,设计师,前端工程师必备的绘图工具(原型图,思维导图,UML,流程图,架构图)
- 创建型模式之简单工厂模式
- 北京中国石油大学计算机考研分数线,中国石油大学(北京)2018年考研复试基本分数线...
- 打开Word提示向程序发送命令时出现问题怎么办
- dh模型表matlab,建立DH模型的三种方法以及区别
- 实现一个简单的Database1
- Pale Moon 15.3 - Firefox“苍月”优化版发布
- body与html 会有间隙,css – thead和tbody之间的间距
- 【转】SpringMVC的工作原理图
- 编程资料 -C# 多线程
- 计算机导论以python为舟大纲,清华大学出版社-图书详情-《计算机科学导论——以Python为舟(第3版)》...
- 可视化系列讲解:css2.5D动画->帧动画
- 【论文笔记】A Reinforcement Learning Method for Multi-AGV Scheduling in Manufacturing
- 题目 2224: Glenbow Museum
热门文章
- 使用 SAP UI5 Smart Chart 控件轻松绘制十数种不同类型的专业图表试读版
- 哈尔滨工业大学计算机考研难吗,哈尔滨工业大学(专业学位)计算机技术考研难吗...
- quartus ii 增量编译
- 《测量助理》最新版本V3.0.220618发布更新
- trans系列是sci几区_如何看SCI期刊属于几区
- java解决包依赖冲突
- 使用ipp静态库,ipp-samples在linux下的make过程
- 我的awk常用命令备忘 xargs备忘
- Java 姓名脱敏的一点点改进 针对大于三个字 或叠字
- Linux ssh无密登陆