hive learning ---- basic sentences

The previous chapter introduced how to install hive and the basic introduction of hive. Here we start using hive. Before using, first introduce the learning of the basic statements of hive, and what are internal tables and external tables. hive base statement Let's take a look at the most basic formats, because there are many kinds of fo ...

Added by FezEvils on Mon, 27 Dec 2021 07:35:09 +0200

CDH6.2. The whole process of brainless construction and configuration (Beginner's version)

The software download link is at the bottom thank: CSDN Daniel: Travel through IT bilibili Daniel: amoscloud2013 1. Preliminary preparation Five 8G virtual machines are CDH1, cdh2, cdh3, CDH4 and cdh5 respectively. JDK is installed on all virtual machines 2. Modify IP and host name Select CentOS 7 for cluster deployment. All three vir ...

Added by thefollower on Mon, 27 Dec 2021 05:46:50 +0200

hadoop_ Hdfs07 hdfsha cluster configuration & ZK cluster configuration & yarnHA configuration

hadoop_ Hdfs07 hdfsha cluster configuration & ZK cluster configuration & yarnha configuration Note: notes (1) Cluster planning Hadoop102Hadoop03Hadoop04ZKZKZKJournaleNodeJournaleNodeJournaleNodeNameNodeNameNodeDataNodeDataNodeDataNodeResourceManagerResourceManagerNodeManagerNodeManagerNodeManager (2) Configuring Zookeeper clusters ...

Added by DarkArchon on Sun, 26 Dec 2021 06:36:22 +0200

Are you still using group by? Experts are using grouping sets

Let's take a look at a problem in actual generation: There is a user access record table, which records the user's access under different channels, operating systems and versions, As follows: create table test.test10 as select '10001' as uid,'xiaomi' as qid,'android' as os,'1.2.2' as ver union all select '10002' as uid,'xiaomi' as qid,'andro ...

Added by jpowermacg4 on Sat, 25 Dec 2021 18:29:07 +0200

Exception analysis of Hive SQL inserting into dynamic partition

82.1 problems When the INSERT... SELECT statement inserts data into a table in Parquet or ORC format, dynamic partitioning is enabled and cannot be executed normally Hive client: Task with the most failures(4): Diagnostic Messages for this Task: Error: GC overhead limit exceeded ... FAILED: Execution Error, return code 2 from org.apache.hadoo ...

Added by drkstr on Sat, 25 Dec 2021 04:41:15 +0200

Hadoop distributed platform construction

Building Hadoop distributed platform in linux system First, if the liunx network cannot be connected, click "Edit" in the VMwvare main interface, and then select "virtual network editor" After entering, restore the default settings in the following two steps. Generally, you can restore them after setting 1. Environmental ...

Added by Daveyz83 on Mon, 20 Dec 2021 15:04:22 +0200

HDFS slow node monitoring principle and source code analysis

background With the growing size of HDFS clusters, It is inevitable that there will be some "slow nodes" in the cluster, which are mainly manifested in slow network data transmission and slow disk reading and writing. Usually, these slow nodes are difficult to be found. Only when the business job data reading and writing involves the ...

Added by Vince on Sun, 19 Dec 2021 18:19:36 +0200

Troubleshooting hdfs for hadoop optimization

This blog is mainly about troubleshooting hadoop hdfs, including NameNode fault handling, cluster security mode and disk repair. If there is something bad, welcome everyone! thank! nn Fault Handling 1. Scene The NameNode process hangs and the stored data is lost. How to recover the NameNode 2. Fault simulation (1) kill -9 NameNode proce ...

Added by Stressed on Sat, 18 Dec 2021 15:50:32 +0200

[CDH 6.3.X] clouder manager 6.3.2 ,CDH 6.3.1. Installation process

Software packages required during deployment Link: https://pan.baidu.com/s/1UajMORVvQ_VSLOdVkJWYQQ Extraction code: e28y Link: https://pan.baidu.com/s/1dMj8JEaRIOaXP53W2kF_mQ Extraction code: xbyo a key: Sets the host name in FQDN formatTurn off firewallTurn off ipv6Configure local http serviceConfigure local storageThe mysql jdbc drive ...

Added by Jacquelyn L. Ja on Sat, 18 Dec 2021 06:00:54 +0200

hadoop data compression and related algorithms and (MapReduce) code example demonstration

To see which compression algorithms hadoop has [lqs@bdc112 hadoop-3.1.3]$ bin/hadoop checknative 2021-12-15 16:20:12,342 INFO bzip2.Bzip2Factory: Successfully loaded & initialized native-bzip2 library system-native 2021-12-15 16:20:12,345 INFO zlib.ZlibFactory: Successfully loaded & initialized native-zlib library 2021-12-15 16:20:12,3 ...

Added by thebighere on Wed, 15 Dec 2021 19:16:34 +0200