2019独角兽企业重金招聘Python工程师标准>>>

Landon Campbell

Landon Campbell    
   Email: c***@hotmail.com        
   Posts: 4 Find Posts    
   Threads: 2    Find Threads

11 months ago

Permalink

Raw Message

Report

Hi,

Pretty new to Scrapy, so forgive me if this is obvious. We're running
Scrapy 0.24.2 (under Portia/Slybot), with ProxyMiddleware enabled and a
fairly large pool of proxies. Any time I request an HTTPS URL, I recieve a
"Could not open CONNECT tunnel" error, which ultimately causes the spider
to close. In my development environment, I'm running Scrapy 0.24.4
(Portia/Slybot), through the same proxies, and I do NOT have this problem.
Is this simply a Scrapy version issue, or is it something else? Can't
figure out why it's OK one place but not the other. Any thoughts would be
appreciated.

Thanks,
Landon

--
You received this message because you are subscribed to the Google Groups "scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scrapy-users+***@googlegroups.com.
To post to this group, send email to scrapy-***@googlegroups.com.
Visit this group at http://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

Travis Leleu

11 months ago

Permalink

Raw Message

Report

Why don't you upgrade to 0.24.4 on your production environment?

...

--
You received this message because you are subscribed to the Google Groups "scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scrapy-users+***@googlegroups.com.
To post to this group, send email to scrapy-***@googlegroups.com.
Visit this group at http://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

Landon Campbell

11 months ago

Permalink

Raw Message

Report

Upgrading is an option, but I prefer to know *why* something is happening.
If this is a known issue that's been fixed, great. Otherwise, if anybody
has an explanation, that would be appreciated.

...

--
You received this message because you are subscribed to the Google Groups "scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scrapy-users+***@googlegroups.com.
To post to this group, send email to scrapy-***@googlegroups.com.
Visit this group at http://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

Daniel Fockler

11 months ago

Permalink

Raw Message

Report

I've generally seen this error on sites that are using SSL. I'm not sure
about the specifics, but it's because the SSL handler in Scrapy can't
manage the connection with whatever site you are working with.

...

--
You received this message because you are subscribed to the Google Groups "scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scrapy-users+***@googlegroups.com.
To post to this group, send email to scrapy-***@googlegroups.com.
Visit this group at http://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

Travis Leleu

11 months ago

Permalink

Raw Message

Report

Are you running through a proxy?  IIRC, there is some funkiness when trying
to connect via https when your proxy is an http-only proxy.

I use crawlera, which has an alternative endpoint (you connect via http to
crawlera, pass the encoded https url, and the proxy connects via https to
the target server).  You may need to configure to do http to your proxy,
https from your proxy to the target server.

Without more specifics of your situation, I'm afraid that's all the help I
can give.  You might try and make sure all your SSL type libraries are
up-to-date, as I've run into errors when out of date libs prevent the SSL
handshake, borking everything.

...

--
You received this message because you are subscribed to the Google Groups "scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scrapy-users+***@googlegroups.com.
To post to this group, send email to scrapy-***@googlegroups.com.
Visit this group at http://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

Landon Campbell

11 months ago

Permalink

Raw Message

Report

Travis,

Yes, we are using proxies, about 100 of them, but I don't *think* that's
the issue, as I'm able to crawl these sites successfully using those
proxies from my local Ubuntu. I think your point regarding SSL type
libraries is promising, but being new to Python, I'm not sure which
libraries those would be. Do you have any suggestions for which libraries I
might investigate?

Thanks,
Landon

...

--
You received this message because you are subscribed to the Google Groups "scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scrapy-users+***@googlegroups.com.
To post to this group, send email to scrapy-***@googlegroups.com.
Visit this group at http://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

转载于:https://my.oschina.net/airship/blog/628812

ERROR: Could not open CONNECT tunnel相关推荐

  1. mysqldump: Got error: 2003: Can't connect to MySQL server on '127.0.0.1' (10060)

    今天在用批处理进行MySQL自动备份的过程中遇到一个问题,错误提示:mysqldump: Got error: 2003: Can't connect to mysql server on '127. ...

  2. 【错误记录】Android Studio 编译报错 ( Error:Connection timed out: connect | 更新配置依赖仓库方式 )

    文章目录 一.报错信息 二.解决方案 一.报错信息 编译 VirtualAppEx 源码时 , 报如下错误 : Gradle 'VirtualAppEx-master' project refresh ...

  3. MySql error 2003 Can't connect to MySQL server on 'localhost' (0)

    事情是这样的,今天群里一个小伙伴使用MySql的时候出现了error 2003 Can't connect to MySQL server on 'localhost' (0).见下图. 我们来分析, ...

  4. Navicate ---error 2003: can‘t connect to mysql server on ‘localhost‘(10061)“

    之前遇到过很多次,今天又遇到了,在这里做一下记录: 问题: 点击navicat,打不开数据库,报错如下: error 2003: can't connect to mysql server on 'l ...

  5. Android Studio Error:Connection timed out: connect.解决方案

    Android Studio Error:Connection timed out: connect.解决方案 参考文章: (1)Android Studio Error:Connection tim ...

  6. Loadrunner执行https报错Action.c(7): Error -27778: SSL protocol error when attempting to connect with hos

    一.问题说明 Loadrunner回放包含https的请求时,报一下错误: Action.c(7): Error -27778: SSL protocol error when attempting ...

  7. Error -27796: Failed to connect to server ip地址: [10060] Connection timed out

    如果出现Error -27796: Failed to connect to server "ip地址": [10060] Connection timed out 这样的错误,如 ...

  8. samba Error NT_STATUS_CONNECTION_REFUSED Failed to connect with SMB1 -- no workgroup available

    连接同事的共享服务时报错: smbclient  -L ip -U user  WARNING: The "syslog" option is deprecated Enter W ...

  9. LoadRunner Error -27792: Failed to connect to server

    用Google打开,replay的时候报错 Action.c(74): Error -27792: Failed to connect to server "accounts.google. ...

最新文章

  1. 语言 全排列 函数_Power Query 中日期时间格式转换需要了解的区域语言对照表
  2. java do while变量无法赋值_Java流是否等同于具有变量赋值的while
  3. 投稿Cover Letter如何写出彩
  4. gc()两分钟了解JDK8默认垃圾收集器(附英文)
  5. 出生日期,看出你的天赋
  6. 关于URL指向的icon的存储问题
  7. python 爬虫工具 butter_GitHub - TheButterflyOdor/proxy_pool: Python爬虫代理IP池(proxy pool)
  8. linux 下显卡优化,[转载]Linux 下 NVIDIA 显卡闭源驱动的一些优化
  9. ReactiveCocoa的学习内容
  10. springboot的底层注解【详细】
  11. R语言建立ARIMA模型预测数据
  12. 《高等数学》练习题库含答案(大学期末复习资料)
  13. 忠实履行职责,成就辉煌人生 ——读《西点军校的经典法则》有感
  14. 计算机绘图中常用指令,【CAD快捷键运用】CAD常用命令汇总
  15. 如何使用可提高员工敬业度的绩效管理软件
  16. Xilinx FPGA平台DDR3设计保姆式教程(2)DDR3各时钟频率及带宽分析
  17. 微信小程序 引用 weui 问题合集
  18. JAVA 实现《拳皇误闯冒险岛》游戏
  19. java编译器下载_java手机版编译器下载
  20. SCI回复审稿意见的模板

热门文章

  1. 首师大2计算机考研分数线,2021考研分数线:首都师范大学2021年考研复试分数线...
  2. 域名授权系统源码 网站源码授权系统_单域名授权系统
  3. 使用MONO使.net程序脱离.net框架运行
  4. 自己动手架设linux下Web服务器(图)2
  5. 13个您应该安装的WordPress插件
  6. 朴素贝叶斯分类器的python实现
  7. LocalDateTime日期转换错误:JSON parse error: Cannot deserialize value of type java.time.LocalDateTime
  8. 如何用Pygame写游戏(十六)
  9. Maven——windows下安装配置及IDEA设置本地仓库的步骤总结
  10. Properties的使用