hive中 order by ,distribute by ,cluster by ,sort by 区别 - 代码天地

hive中 order by ,distribute by ,cluster by ,sort by 区别

其他 2018-12-04 02:54:47 阅读次数: 0

版权声明：本文为博主原创文章，未经博主允许不得转载。 https://blog.csdn.net/qq_18730505/article/details/82761198

id	name	old
1	张三	10
1	李四	15
3	王五	20
4	赵六	25

假设表中3个字段

order by old

展现出的数据将会根据old 这一列降序返回4条记录，不具备任何形式的数据分布

select * from table order by old desc

4 赵 25

3 王 20

1 李 15

1 张 10

distribute by

map reduce 中，map 端将数据按字段分布分发给 reduce 确保每个reduce 收到的值是相同的。具备分布的特性

distibute by id asc 则会将数据相同的分发到一个 reduce 内。具体怎么实现，字段的哈希值/reduce个数然后取余，余数相同归为一起

sort by

一般和 distribute 联用，达到分布数据，并按照指定列进行排序

cluster by

将数据按照指定列分布后降序排序分发到各个reduce中

用cluster by 默认指的是字段降序分发，降序排序

例子

select * from table distribute by id sort by id desc

和

select * from table cluster by id 具备同样效果

--结果

4 赵 25

3 王 20

1 李 15

1 张 10

例子

select * from table cluster by id sort by old asc

--bucket 1

4 赵 25

--bucket 2

3 王 20

1 张 10

1 李 15

猜你喜欢

转载自blog.csdn.net/qq_18730505/article/details/82761198

Hive中order by、sort by、distribute by、cluster by的区别

hive中 order by ,distribute by ,cluster by ,sort by 区别

Hive中order by，sort by，distribute by，cluster by的区别

hive中cluster by，order by，sort by，distribute by的区别

Hive中的order by,sort by,distribute by,cluster by 的区别

Hive的sort by, order by, distribute by, cluster by区别？

Hive的Order by、Sort by、Distribute by和Cluster by的区别

hive的 group 、distribute 、sort 、cluster、order 区别

Hive学习：order by，sort by，distribute by，cluster by的区别

HIVE 中 order by, sort by, distribute by, cluster by的用法和区别

hive中order by、distribute by、sort by和cluster by的区别和联系

Hive中order by,sort by, distribute by, cluster by区别，用法详解

hive中order by ，sort by ，distribute by 和 cluster by的区别

hive中order by、sort by、distribute by、cluster by的区别详解

hive Sort By/Order By/Cluster By/Distribute By

Hive中order by、sort by、distribute by和cluster by

Hive中order by sort by distribute by cluster by用法

hive 中 order by ,sort by ,distribute by ,cluster by 详解

Hive中的order by、sort by、distribute by和cluster by

hive中的order by、sort by、distribute by、cluster by排序

hive中order by ，sort by ，distribute by 和 cluster by

hive四种排序order by，sort by，distribute by，cluster by的区别

hive中几个排序方式的区别 hive中Sort By，Order By，Cluster By，Distribute By，Group By的区别

HIVE中，order by、sort by、 distribute by和 cluster by区别，以及cluster by有什么意义

【Hive】Order by、Sort by、Distribute by和Cluster by

谈谈hive的order by ，sort by ，distribute by 和 cluster by

Hive之Order,Sort,Cluster and Distribute By

Hive的排序（Order by，Sort by，Distribute by，Cluster by）

Hive 排序及优化 ORDER BY, SORT BY, DISTRIBUTE BY, CLUSTER BY

hive- order by、sort by 、distribute by、cluster by

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

让自己的头脑极度开放

CentOS 6.5(x64) 和Redhat6.5操作系误删libc

高可用注册中心

【日记】12.28/【题解】AtCoder AGC041

XML（5）_XML 约束_DTD

Java集合Map（四）

树梅派安装桌面环境教程

pipenv 的使用和安装

小程序白屏问题和内存研究

C语言简单选择排序

每日归档

更多

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)