完整报错如下:

Traceback (most recent call last):File "<stdin>", line 6, in <module>File "/home/appleyuchi/bigdata/spark-2.3.1-bin-hadoop2.7/python/pyspark/sql/readwriter.py", line 703, in saveself._jwrite.save()File "/home/appleyuchi/bigdata/spark-2.3.1-bin-hadoop2.7/python/lib/py4j-0.10.7-src.zip/py4j/java_gateway.py", line 1257, in __call__File "/home/appleyuchi/bigdata/spark-2.3.1-bin-hadoop2.7/python/pyspark/sql/utils.py", line 63, in decoreturn f(*a, **kw)File "/home/appleyuchi/bigdata/spark-2.3.1-bin-hadoop2.7/python/lib/py4j-0.10.7-src.zip/py4j/protocol.py", line 328, in get_return_value
py4j.protocol.Py4JJavaError: An error occurred while calling o92.save.
: java.lang.ClassNotFoundException: com.mysql.cj.jdbc.Driverat java.net.URLClassLoader.findClass(URLClassLoader.java:381)at java.lang.ClassLoader.loadClass(ClassLoader.java:424)at java.lang.ClassLoader.loadClass(ClassLoader.java:357)at org.apache.spark.sql.execution.datasources.jdbc.DriverRegistry$.register(DriverRegistry.scala:45)at org.apache.spark.sql.execution.datasources.jdbc.JDBCOptions$$anonfun$6.apply(JDBCOptions.scala:79)at org.apache.spark.sql.execution.datasources.jdbc.JDBCOptions$$anonfun$6.apply(JDBCOptions.scala:79)at scala.Option.foreach(Option.scala:257)at org.apache.spark.sql.execution.datasources.jdbc.JDBCOptions.<init>(JDBCOptions.scala:79)at org.apache.spark.sql.execution.datasources.jdbc.JDBCOptions.<init>(JDBCOptions.scala:35)at org.apache.spark.sql.execution.datasources.jdbc.JdbcRelationProvider.createRelation(JdbcRelationProvider.scala:60)at org.apache.spark.sql.execution.datasources.SaveIntoDataSourceCommand.run(SaveIntoDataSourceCommand.scala:45)at org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult$lzycompute(commands.scala:70)at org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult(commands.scala:68)at org.apache.spark.sql.execution.command.ExecutedCommandExec.doExecute(commands.scala:86)at org.apache.spark.sql.execution.SparkPlan$$anonfun$execute$1.apply(SparkPlan.scala:131)at org.apache.spark.sql.execution.SparkPlan$$anonfun$execute$1.apply(SparkPlan.scala:127)at org.apache.spark.sql.execution.SparkPlan$$anonfun$executeQuery$1.apply(SparkPlan.scala:155)at org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope.scala:151)at org.apache.spark.sql.execution.SparkPlan.executeQuery(SparkPlan.scala:152)at org.apache.spark.sql.execution.SparkPlan.execute(SparkPlan.scala:127)at org.apache.spark.sql.execution.QueryExecution.toRdd$lzycompute(QueryExecution.scala:80)at org.apache.spark.sql.execution.QueryExecution.toRdd(QueryExecution.scala:80)at org.apache.spark.sql.DataFrameWriter$$anonfun$runCommand$1.apply(DataFrameWriter.scala:654)at org.apache.spark.sql.DataFrameWriter$$anonfun$runCommand$1.apply(DataFrameWriter.scala:654)at org.apache.spark.sql.execution.SQLExecution$.withNewExecutionId(SQLExecution.scala:77)at org.apache.spark.sql.DataFrameWriter.runCommand(DataFrameWriter.scala:654)at org.apache.spark.sql.DataFrameWriter.saveToV1Source(DataFrameWriter.scala:273)at org.apache.spark.sql.DataFrameWriter.save(DataFrameWriter.scala:267)at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)at java.lang.reflect.Method.invoke(Method.java:498)at py4j.reflection.MethodInvoker.invoke(MethodInvoker.java:244)at py4j.reflection.ReflectionEngine.invoke(ReflectionEngine.java:357)at py4j.Gateway.invoke(Gateway.java:282)at py4j.commands.AbstractCommand.invokeMethod(AbstractCommand.java:132)at py4j.commands.CallCommand.execute(CallCommand.java:79)at py4j.GatewayConnection.run(GatewayConnection.java:238)at java.lang.Thread.run(Thread.java:748)

解决方案:

mv mysql-connector-java-8.0.20.jar $SPARK_HOME/jars/

驱动文件mysql-connector-java-8.0.20.jar是从maven仓库下载的:

https://mvnrepository.com/artifact/mysql/mysql-connector-java/8.0.20

注意,这个报错的设置,需要搞清楚当前spark是什么mode,如果盲目照搬stackoverflow和百度,你会发现无效!

该结论的依据来自[1]中的表格.

spark-defaults.conf中设置如下:

spark.driver.extraClassPath   = /home/appleyuchi/bigdata/apache-hive-3.0.0-bin/lib/mysql-connector-java-8.0.20.jar
spark.executor.extraClassPath = /home/appleyuchi/bigdata/apache-hive-3.0.0-bin/lib/mysql-connector-java-8.0.20.jar

spark.jars = /home/appleyuchi/bigdata/apache-hive-3.0.0-bin/lib/mysql-connector-java-8.0.20.jar

测试方法如下:

①pyspark --master yarn(然后在交互是模式中输入交互式代码)

②spark-submit --master yarn --deploy-mode cluster 源码.py

#----------------------------------------------------------附录-------------------------------------------------------------------------------------------------

源码.py

import pandas as pd
from pyspark.sql import SparkSession
from pyspark import SparkContext
from pyspark.sql import SQLContextdef map_extract(element):file_path, content = elementyear = file_path[-8:-4]return [(year, i) for i in content.split("\n") if i]spark = SparkSession\.builder\.appName("PythonTest")\.getOrCreate()res = spark.sparkContext.wholeTextFiles('hdfs://Desktop:9000/user/mercury/names',minPartitions=40)  \.map(map_extract) \.flatMap(lambda x: x) \.map(lambda x: (x[0], int(x[1].split(',')[2]))) \.reduceByKey(lambda x,y:x+y)df = res.toDF(["key","num"])  #把已有数据列改成和目标mysql表的列的名字相同
# print(dir(df))
df.printSchema()
print(df.show())
df.printSchema()df.write.format("jdbc").options(url="jdbc:mysql://Desktop:3306/leaf",driver="com.mysql.cj.jdbc.Driver",dbtable="spark",user="appleyuchi",password="appleyuchi").mode('append').save()

Reference:

[1]Spark Shell Add Multiple Drivers/Jars to Classpath using spark-defaults.conf
[2]Spark Configuration

[3]py4j.protocol.Py4JJavaError: An error occurred while calling o90.save(官方bug,目前没有解决)

pyspark报错java.lang.ClassNotFoundException: com.mysql.cj.jdbc.Driver相关推荐

  1. 报错:java.lang.ClassNotFoundException: com.mysql.cj.jdbc.Driver at java.base/jdk.internal.loader.Bui

    前言必读 读者手册(必读)_云边的快乐猫的博客-CSDN博客 一.遇到问题 : java.lang.ClassNotFoundException: com.mysql.cj.jdbc.Driver   ...

  2. java.lang.ClassNotFoundException: com.mysql.cj.jdbc.Driver 确认配置\版本都没问题,那么你就差最后一步

    java.lang.ClassNotFoundException: com.mysql.cj.jdbc.Driver url\driverClassName 都没问题且jar包导入正确符合对应mysq ...

  3. 数据库加载驱动问题(java.lang.ClassNotFoundException: com.mysql.cj.jdbc.Driver)

    java.lang.ClassNotFoundException: com.mysql.cj.jdbc.Driver 遇到此问题,首先检查IDEA外部库中是否有mysql数据库驱动.如下所示: 如果发 ...

  4. IDEA导入MySQL的jdbc驱动出现“java.lang.ClassNotFoundException: com.mysql.cj.jdbc.Driver”

    目录 一.一般的解决思路 1.JDBC下载链接 2.选择下载内容,并进行下载 3.将驱动导入java项目 二.依然导入驱动失败怎么办 当我们在idea中使用java操作mysql数据库时会出现: Ex ...

  5. 解决方法dataSource init error java.sql.SQLException: com.mysql.cj.jdbc.Driver

    使用mysql+druid时报错 这是报错信息折腾了好久. 七月 08, 2019 2:39:14 下午 com.alibaba.druid.pool.DruidDataSource error 严重 ...

  6. 解决error {dataSource-1} init error java.sql.SQLException: com.mysql.cj.jdbc.Driver

    七月 08, 2019 2:39:14 下午 com.alibaba.druid.pool.DruidDataSource error 严重:error {dataSource-1} init err ...

  7. 首次使用eclipes运行项目报错“Java.lang.ClassNotFoundException”

    首次使用eclipes运行项目报错"Java.lang.ClassNotFoundException" 运行项目时提示: 解决方法: 1.点击菜单-Project-Build Au ...

  8. 报错:dataSource init error java.sql.SQLException: com.mysql.cj.jdbc.Driver

    解决方法dataSource init error java.sql.SQLException: com.mysql.cj.jdbc.Driver

  9. hbase启动的时候报错java.lang.ClassNotFoundException: org.slf4j.LoggerFactory

    启动hbase的时候报错: master running as process 15806. Stop it first. Desktop: regionserver running as proce ...

最新文章

  1. 网络营销外包——网络营销外包专员浅析提升用户体验从哪入手?
  2. 基于矩阵分解的推荐算法,简单入门
  3. boost::multi_array模块实现index_base 修改工具的测试
  4. Loj#6434「PKUSC2018」主斗地(搜索)
  5. 汇编_指令_FLAGS
  6. php的declare,php 中的declare
  7. linux 定时执行搅拌,Linux上定时shell脚本
  8. 信息学奥赛一本通 2048:【例5.18】串排序
  9. fastjson 添加key value_FastJson迁移至Jackson
  10. html script 设置编码,HTML Script text用法及代码示例
  11. flume1.7 TailDirSource断点续传与文件更名后数据重复采集的bug修复
  12. CoinFLEX的基本情况以及与Bakkt
  13. 分页实体类:PageBean
  14. lora终端连接云服务器_一种LoRa服务器及其数据传输方法与流程
  15. ps入门第17天_模糊与锐化 案例:基础磨皮效果_ps修图_ps磨皮_ps高低频修图
  16. 关于23届大数据岗实习总结
  17. Win10删除微软拼音输入法
  18. 给仍在「 选品 」的跨境卖家提个醒!
  19. 自定义通用信号处理核心板CMB-2C6657-1K7
  20. http-equiv=“X-UA-Compatible“是什么意思?

热门文章

  1. 34、JS/AJAX
  2. 选择Asp for javascript,非.net。
  3. iOS Xcode4.2代码管理工具的使用
  4. Cheatsheet: 2011 12.01 ~ 12.12
  5. 数字图像处理吴娱课后答案_数字图像处理课后题答案
  6. (一)数据结构与算法简介
  7. Vue 实现ToDoList
  8. xp电脑自动锁定计算机,WindowsXP系统组策略的锁死的解决方法
  9. redis储存List类型数据
  10. echarts定时加载动画数据