selenium 模拟登陆古诗文网含验证码

ocr.py / 阿里云市场

import base64
import json
import urllib.request
from urllib import parse
import ssl
ssl._create_default_https_context = ssl._create_unverified_contextdef get_code():#修改API说明修改接口地址host = 'https://imgurlocr.market.alicloudapi.com/urlimages'method = 'POST'appcode = 'c657ecb2f1cd4f779ff4f8bf3ebb0af1'querys = ''bodys = {}url = host# 组装本地需要识别的 图片fp = open('./code.jpg', 'rb')res = base64.b64encode(fp.read()).decode()bodys['image'] = 'data:image/jpeg;base64,' + respost_data = urllib.parse.urlencode(bodys).encode(encoding='UTF8')request = urllib.request.Request(url, post_data)#根据API的要求，定义相对应的Content-Typerequest.add_header('Content-Type', 'application/x-www-form-urlencoded; charset=UTF-8')request.add_header('Authorization', 'APPCODE ' + appcode)ctx = ssl.create_default_context()ctx.check_hostname = Falsectx.verify_mode = ssl.CERT_NONEresponse = urllib.request.urlopen(request, context=ctx)content = response.read()if content:res = json.loads(content.decode('UTF-8'))code = res['result'][0]['words']return code

模拟登陆

import timefrom selenium import webdriverfrom .ocr import get_codechrome_path = '/Users/apple/soft/chromedriver'driver = webdriver.Chrome(executable_path=chrome_path)driver.get('https://so.gushiwen.org/user/login.aspx?from=http://so.gushiwen.org/user/collect.aspx')driver.find_element_by_id('email').send_keys('290793992zb@163.com')
time.sleep(1)
driver.find_element_by_id('pwd').send_keys('python123_')
time.sleep(1)
driver.find_element_by_id('imgCode').screenshot('./code.jpg')
time.sleep(1)
# 通过接口 获取 验证码信息
code = get_code()# 填写验证码
driver.find_element_by_id('code').send_keys(code)
time.sleep(1)# 点击登陆
driver.find_element_by_id('denglu').click()

selenium 模拟登陆古诗文网含验证码相关推荐

python爬虫之古诗文网中验证码的识别并登录----第三方平台
目标网站:古诗文网目标网址:http://so.gushiwen.org/user/collect.aspx 任务要求: (1)通过selenium的方式模拟该网站的登录,并成功输入用户名和密码: ...
爬虫day01(上午) 模拟登录古诗文网
前言:今天是学习爬虫的第一天,因为看的教学视频比较老,所以很多案例都不能用了,于是我自己发挥动手操作,做了个比视频里更有含金量的练习,由于与视频案例大有不同,所以期间发生了点问题,经过探索现已解决,留 ...
selenium模拟登陆去哪儿网
序言在模拟网页的表单登陆的时候,比较头疼的一个问题就是图片验证码的情况,碰到了验证码,比如像普通的文字图片类型的验证码,目前一个比较好的思路就是,通过selenium自身提供的截图功能,对指定的图片 ...
python 裁判文书网_python - 用selenium模拟登陆裁判文书网，系统报错找不到元素。...
问题 from selenium import webdriver from selenium.webdriver.common.desired_capabilities import Desire ...
用python实现古诗文网个人主页爬取
#coding=gbk #为了解决编码问题加入的coding=gbk from chaojiying import Chaojiying_Client import requests from lxm ...
用机器学习sklearn+opencv-python过古诗文网4位数字+字母混合验证码
目录获取验证码图片用opencv-python处理图片制作训练数据集训练模型识别验证码编写古诗文网的登录爬虫代码总结与提高源码下载在本节我们将使用sklearn和opencv-pyt ...
python爬虫模拟登录古诗文网站
爬取目标网站https://so.gushiwen.cn/user/login.aspx?from=http://so.gushiwen.cn/user/collect.aspx?type=s 工具: ...
Python使用网络抓包的方式，利用超级鹰平台识别验证码登录爬取古诗文网、上篇--识别验证码
Python使用网络抓包的方式,利用超级鹰平台识别验证码登录,<爬取古诗文网>. 上篇–识别验证码序言: 哈喽,各位小可爱们,我又来了,这次我新学习到的内容是python爬虫识别验证码. ...
python爬虫-古诗文网验证码识别
文章目录一.前期准备二.示例代码一.前期准备古诗文网验证码识别,是通过对古诗文网登陆界面的验证码图片进行识别的,利用专门的验证码识别网站,可以提取验证码图片中的验证码网站推荐:超级鹰注册登 ...

selenium 模拟登陆古诗文网含验证码

ocr.py / 阿里云市场

模拟登陆

selenium 模拟登陆古诗文网含验证码相关推荐

最新文章

热门文章

selenium 模拟登陆 古诗文网 含验证码

ocr.py / 阿里云市场

模拟登陆

selenium 模拟登陆 古诗文网 含验证码相关推荐

最新文章

热门文章

selenium 模拟登陆古诗文网含验证码

selenium 模拟登陆古诗文网含验证码相关推荐