What parameters
AI can be set in set in hivesql: In HiveQL, the SET command can be used to set the following parameters:
-
hive.execution.engine: specifies the execution engine of Hive, including MapReduce, Tez, Spark, etc.
-
hive.exec.dynamic.partition: Specifies the dynamic partition mode, including strict, nonstrict and strict_all.
-
hive.optimize.sort.dynamic.partition: Specifies whether to enable dynamic partition sorting.
-
hive.exec.parallel: Specifies whether to enable parallel execution.
-
hive.map.aggr: Specifies whether to enable Map-side aggregation.
-
hive.exec.reducers.bytes.per.reducer: Specifies the size of data processed by each Reducer.
-
hive.exec.compress.output: Specifies whether to enable output compression.
-
hive.exec.compress.intermediate: Specifies whether to enable intermediate data compression.
-
hive.vectorized.execution.enabled: Specifies whether to enable vectorized execution.
-
hive.vectorized.execution.reduce.enabled: Specifies whether to enable vectorized Reduce.
-
hive.optimize.sort.bucketmapjoin: Specifies whether to enable Bucket Map Join optimization.
-
hive.auto.convert.join: Specify whether to automatically convert Join.
-
hive.auto.convert.join.noconditionaltask: Specifies whether to automatically convert tasks without conditions in the join.
-
hive.auto.convert.join.noconditionaltask.size: Specifies the maximum data size of the unconditional Task in the automatic conversion Join.
-
hive.cbo.enable: Specifies whether to enable the Cost-Based Optimizer.
-
hive.stats.autogather: Specifies whether to automatically gather table statistics.
-
hive.stats.fetch.column.stats: Specifies whether to collect column statistics.
-
hive.stats.fetch.partition.stats: Specifies whether to collect partition statistics.
-
hive.stats.jdbc.timeout: Specifies the timeout period of the JDBC connection when collecting statistics.
-
hive.tez.container.size: Specifies the size of each Tez Container.
-
hive.tez.java.opts: Specifies the parameters of the Tez JVM.
22.hive.tez.auto.reducer.parallelism: Specifies whether to automatically set the parallelism of the Reducer.
set in hiveSQL to set the local operation mode
AI: In HiveSQL, you can use the SET command to set the local operation mode. The specific command is:
SET hive.exec.mode.local.auto=true;
This command will automatically choose between local operation mode and MapReduce operation mode. If the amount of data is small, Hive will automatically select the local operation mode.