On this page you find all important commands for the CLI tool hive. If the command you are looking for is missing please ask our AI.

hive

Hive is a command line tool and data warehouse infrastructure built on top of Hadoop. It is designed to provide a high-level interface for querying and analyzing large datasets stored in distributed systems. Hive uses a language called HiveQL, which is similar to SQL, to interact with the data. It supports complex queries, joins, aggregations, and transformations, making it suitable for data analysis tasks. Hive provides a schema on read approach, allowing users to structure the data when querying instead of when loading it into the system. It offers a familiar SQL-like interface, enabling users with SQL knowledge to easily work with big data. Hive is highly scalable and can handle petabytes of data by leveraging the distributed computing power of Hadoop. It provides a cost-effective solution for big data analytics as it can run on commodity hardware. Hive supports various file formats such as CSV, JSON, Parquet, and ORC, allowing users to work with different data types. It integrates well with other Hadoop ecosystem tools like HBase, Spark, and Pig, expanding its capabilities for data processing and analysis.

List of commands for hive:

- hive
hive:tldr:09cc3 hive: Run a HiveQL with HiveConfig (e.g. `mapred.reduce.tasks=32`).

$ hive --hiveconf ${conf_name}=${conf_value}

try on your machine

explain this command
- hive
hive:tldr:10d00 hive: Run a HiveQL file with a variable substitution.

$ hive --define ${key}=${value} -f ${filename-sql}

try on your machine

explain this command
- hive
hive:tldr:8a4ac hive: Start a Hive interactive shell.

$ hive

try on your machine

explain this command
- hive
hive:tldr:da360 hive: Run HiveQL.

$ hive -e "${hiveql_query}"

try on your machine

explain this command

tool overview