hive:tldr:09cc3
The command hive --hiveconf ${conf_name}=${conf_value}
is used to run the Apache Hive command-line interface (CLI) with a specific configuration set for a given property, where ${conf_name}
is the name of the configuration property, and =${conf_value}
is the value assigned to that property.
Hive is a data warehouse infrastructure built on top of Apache Hadoop for querying and analyzing large datasets stored in Hadoop Distributed File System (HDFS). It uses a SQL-like query language called HiveQL. The hive
command is used to execute Hive queries and commands from the command line.
With the --hiveconf
option, you can pass configuration properties to the Hive CLI. This allows you to customize the behavior of Hive by changing various settings or passing parameters to your Hive queries.
In the command, ${conf_name}
represents the name of a specific Hive configuration property, such as hive.exec.parallel
or hive.metastore.uris
. ${conf_value}
is the value assigned to that property, which can be a string, number, or boolean value.
For example, if you want to set the property hive.exec.parallel
to true
, you can use the command:
hive --hiveconf hive.exec.parallel=true
This command will start the Hive CLI with the hive.exec.parallel
property set to true
. The configuration change will affect the execution behavior of Hive queries, potentially enabling parallel execution.