Forrest logo
back to the hive tool

hive:tldr:09cc3

hive: Run a HiveQL with HiveConfig (e.g. `mapred.reduce.tasks=32`).
$ hive --hiveconf ${conf_name}=${conf_value}
try on your machine

The command hive --hiveconf ${conf_name}=${conf_value} is used to run the Apache Hive command-line interface (CLI) with a specific configuration set for a given property, where ${conf_name} is the name of the configuration property, and =${conf_value} is the value assigned to that property.

Hive is a data warehouse infrastructure built on top of Apache Hadoop for querying and analyzing large datasets stored in Hadoop Distributed File System (HDFS). It uses a SQL-like query language called HiveQL. The hive command is used to execute Hive queries and commands from the command line.

With the --hiveconf option, you can pass configuration properties to the Hive CLI. This allows you to customize the behavior of Hive by changing various settings or passing parameters to your Hive queries.

In the command, ${conf_name} represents the name of a specific Hive configuration property, such as hive.exec.parallel or hive.metastore.uris. ${conf_value} is the value assigned to that property, which can be a string, number, or boolean value.

For example, if you want to set the property hive.exec.parallel to true, you can use the command:

hive --hiveconf hive.exec.parallel=true

This command will start the Hive CLI with the hive.exec.parallel property set to true. The configuration change will affect the execution behavior of Hive queries, potentially enabling parallel execution.

This explanation was created by an AI. In most cases those are correct. But please always be careful and never run a command you are not sure if it is safe.
back to the hive tool