python 爬网站上的图片

最近在做bot的动物识别，最后根据大会给出来的测试数据，发现对简笔画的动物识别处于懵圈状态，识别效果很差~故我需要自己爬取一些简笔画的图片~
手写学习了某一网站的图片爬取：

附上代码：

# -*- coding: utf-8 -*-
import urllib
import re
import time
import os#显示下载进度
def schedule(a,b,c):'''''a:已经下载的数据块b:数据块的大小c:远程文件的大小'''per = 100.0 * a * b / cif per > 100 :per = 100print '%.2f%%' % perdef getHtml(url):page = urllib.urlopen(url)html = page.read()return htmldef downloadImg(html):reg = r'src="(.+?\.jpg)" pic_ext'imgre = re.compile(reg)imglist = re.findall(imgre, html)#定义文件夹的名字t = time.localtime(time.time())foldername = str(t.__getattribute__("tm_year"))+"-"+str(t.__getattribute__("tm_mon"))+"-"+str(t.__getattribute__("tm_mday"))picpath = 'H:\\getpic\\pic\\%s' % (foldername) #下载到的本地目录if not os.path.exists(picpath):   #路径不存在时创建一个os.makedirs(picpath)   x = 0for imgurl in imglist:target = picpath+'\\%s.jpg' % xprint 'Downloading image to location: ' + target + '\nurl=' + imgurlimage = urllib.urlretrieve(imgurl, target, schedule)x += 1return image;if __name__ == '__main__':print '''         ***************************************      Welcome to use Spider   ****     Created on  2016-09-08   ****       @author:dcx         ***************************************'''html = getHtml("http://tieba.baidu.com/p/2460150866")downloadImg(html)print "Download has finished."

但是，我其实想爬取百度图片，这些图片来源于不同的网站，这没法直接用上面的代码，需要获取到每个图片的url 查找了各方资料，还是没搞明白最后索性直接用ctr+s 保存网页就可以将网页上的图片保存下来~

其实我还是想学习怎么将百度图片上的数据爬下来~~继续努力未完待续！

python 爬网站上的图片相关推荐

Python 保存网页上的图片
使用Python保存网站上的图片. 测试网站是豆瓣电影上的海报网址使用以下code成功的建立了一个文件夹,并且在其中保存该页面上的17张海报图片(仅保存海报图片,其他图片不保存). 使用`Pyth ...
python爬取汽车之家_python爬虫实战之爬取汽车之家网站上的图片
随着生活水平的提高和快节奏生活的发展.汽车开始慢慢成为人们的必需品,浏览各种汽车网站便成为购买合适.喜欢车辆的前提.例如汽车之家网站中就有最新的报价和图片以及汽车的相关内容,是提供信息最快最全的中国汽 ...
python 存储图片_使用python存储网页上的图片实例
使用python存储网页上的图片实例本文介绍在已知网络图片的地址下,存储图片到本地本文例子随便选择LOFTER上一张图片,复制图片的地址,如下图所示在Python中输入代码 import req ...
python爬取动态网页图片
爬取某知名网站图片(爬取动态网页) python爬取动态网页图片 python爬取动态网页图片环境: python3.pycharm 库: requests.urllib.json 思路: 1.分析 ...
Python爬取小姐姐图片
使用Python爬取小姐姐图片首先上网站链接唯美女生爬取图片主要分为一下几步: 1.打开一个你喜欢的小姐姐的网站 E.g xiaojiejie web 2.下载并安装python环境 pytho ...
python 爬取cosplay福利图片
刚开始学习爬虫,做了一个简单的例子,爬取网站上的图片 # -*- coding:utf-8 -*- import time from urllib import request from bs4 im ...
网站图片多服务器选多大,网站上的图片一般多大合适
网站上的图片一般多大合适内容精选换一换安装了SSL证书后,访问网站时,HTTPS比HTTP要多几次握手的时间,HTTPS协议握手阶段比较费时,同时还要进行RSA校验,因此使用了SSL证书后,相较 ...
python爬取贴吧图片_Python爬取贴吧多页图片
Python爬取贴吧图片都只能爬取第一页的,加了循环也不行,现在可以了. #coding:utf-8 import urllib import urllib2 import re import os ...
Python爬取自然风景图片代码
Python爬取自然风景图片代码 \qquad 需要用到python的etree库和requests库,需要提前下载安装. from lxml import etree import requests ...

python 爬网站上的图片

python 爬网站上的图片相关推荐

最新文章

热门文章