park.sql.AnalysisException: path hdfs://* already exists

  df.write.format("csv").save("hdfs://s206/user/hive/warehouse/mobike.db/logs/year=2019/month=7/day=5/hour=7/minute=4")
  df.show()

错误代码

scala> users.select("name","favorite_color").write.format("json").save("file:///home/hadoop/tmp/parquet2json/")
org.apache.spark.sql.AnalysisException: path file:/home/hadoop/tmp/parquet2json already exists.;
  at org.apache.spark.sql.execution.datasources.InsertIntoHadoopFsRelationCommand.run(InsertIntoHadoopFsRelationCommand.scala:109)
  at org.apache.spark.sql.execution.command.DataWritingCommandExec.sideEffectResult$lzycompute(commands.scala:104)
........................................................



解决:
可以通过设置savemode来解决这个问题

df.write.format("csv").mode("overwrite").save("hdfs://s206/user/hive/warehouse/mobike.db/logs/year=2019/month=7/day=5/hour=7/minute=4")
      df.show()

在这里插入图片描述

猜你喜欢

转载自blog.csdn.net/weixin_43093501/article/details/94771696
今日推荐