scala (type conversion between Row, Array, Tuple, RDD, DF)

Enterprise 2020-10-28 04:37:10 views: null

Preface

Because of the package function processing of Rdd and Dataframe in spark, type conversion is often encountered. Today, we will record some common type conversions.

Array => Row

val arr = Array("aa/2/cc/10","xx/3/nn/30","xx/3/nn/20")
// val row = Row.fromSeq(arr)
val row = RowFactory.create(arr)

Row => Array

val a:Array[Any] = row.toSeq.toArray

Sometimes the array type T is limited, such as String. At this time, intermediate processing is needed

val a:Array[String] = row.toSeq.map(m => m.toString).toArray

Tuple => Array

val tuple = ((20201022,5060180989186180L,"[12, 15)"),288556)
tuple.productIterator.toArray

Object T to array can also use the above method.

Array => RDD

val rdd = sparkSession.sparkContext.parallelize(Array(tuple))

RDD => DataFrame

// 定义类
case class Person(name:String, age:Int)

Can pass RDD[Row]

	val rdd = sparkSession.sparkContext.parallelize(Array(("tom",1),("luna",2))).map(row =>Row(row._1, row._2))
	// 创建Schema
    val schema=StructType(Array(
      StructField("name",StringType,true),
      StructField("age",IntegerType,true)
    ))
   	val df = sparkSession.createDataFrame(rdd,schema)

You can also go sparkSession.implicits._directly to df through implicit conversion.

	import sparkSession.implicits._
	val df = sparkSession.sparkContext.parallelize(Array(("tom",1),("luna",2)))
      .map(row =>Person(row._1, row._2)).toDF()

DataFrame => RDD

	val rdd1 = df.rdd

Guess you like

Origin blog.csdn.net/yyoc97/article/details/109273555

scala (type conversion between Row, Array, Tuple, RDD, DF)

Conversion between String type array and an array of type Integer

The difference between tuple and array

Array, tuple, list of collections in Scala

scala match array, list, tuple

Type checking and conversion in scala

SparkSQL implementation of RDD, DF and DS conversion code demonstration

scala and java data type conversion

scala and java data type conversion

Golang array type conversion

[Introduction to Scala] Scala data types and type conversion

Conversion between Array and List

Spark SQL: conversion between RDD, DataFrames, DataSet

Scala data type and its type conversion

Tuple of Scala

The use of Scala tuple tuple

(Scala version) Spark Sql RDD/DataFrame/DataSet conversion

Data type conversion between python

Studies related operations from zero array Scala (II), and maps the tuple

Convert between Scala variable array and immutable array

String type int type conversion between IP and

Conversion between JSON data type and object type

Conversion between Oracle Clob type and Blob type

Conversion between the array and a set of list

Conversion between Java List and Array

Conversion between javascript array and string

Conversion between byte array and string

Conversion between string and array, javaSE

Conversion between byte array and object

Conversion between PHP Array and Json

Recommended

Ranking

Likou-continuous sub-array sum (detailed official solution three)

Description of the database structure eshop5

DS string algorithm application --KMP

Operation log function on implementation experience

Smart home (4)---Fire alarm thread packaging

Zhaoxin began to submit patches to the Linux kernel to support the "Yongfeng" architecture

Distributed development (3)---Redis must know

CSS (six) version of the heart and layout process

How does Java tell if an object is dead?

Why is the version number added to the file imported by the project?

Daily

More

2024-07-06(0)

2024-07-05(0)

2024-07-04(0)

2024-07-03(0)

2024-07-02(0)

2024-07-01(0)

2024-06-30(0)

2024-06-29(0)

2024-06-28(0)

2024-06-27(0)