hive learning ---- basic sentences
The previous chapter introduced how to install hive and the basic introduction of hive. Here we start using hive. Before using, first introduce the learning of the basic statements of hive, and what are internal tables and external tables.
hive base statement
Let's take a look at the most basic formats, because there are many kinds of fo ...
Added by FezEvils on Mon, 27 Dec 2021 07:35:09 +0200
CDH6.2. The whole process of brainless construction and configuration (Beginner's version)
The software download link is at the bottom
thank: CSDN Daniel: Travel through IT bilibili Daniel: amoscloud2013
1. Preliminary preparation
Five 8G virtual machines are CDH1, cdh2, cdh3, CDH4 and cdh5 respectively. JDK is installed on all virtual machines
2. Modify IP and host name
Select CentOS 7 for cluster deployment. All three vir ...
Added by thefollower on Mon, 27 Dec 2021 05:46:50 +0200
hadoop_ Hdfs07 hdfsha cluster configuration & ZK cluster configuration & yarnHA configuration
hadoop_ Hdfs07 hdfsha cluster configuration & ZK cluster configuration & yarnha configuration
Note: notes
(1) Cluster planning
Hadoop102Hadoop03Hadoop04ZKZKZKJournaleNodeJournaleNodeJournaleNodeNameNodeNameNodeDataNodeDataNodeDataNodeResourceManagerResourceManagerNodeManagerNodeManagerNodeManager
(2) Configuring Zookeeper clusters
...
Added by DarkArchon on Sun, 26 Dec 2021 06:36:22 +0200
Are you still using group by? Experts are using grouping sets
Let's take a look at a problem in actual generation:
There is a user access record table, which records the user's access under different channels, operating systems and versions, As follows:
create table test.test10 as
select '10001' as uid,'xiaomi' as qid,'android' as os,'1.2.2' as ver
union all
select '10002' as uid,'xiaomi' as qid,'andro ...
Added by jpowermacg4 on Sat, 25 Dec 2021 18:29:07 +0200
Exception analysis of Hive SQL inserting into dynamic partition
82.1 problems
When the INSERT... SELECT statement inserts data into a table in Parquet or ORC format, dynamic partitioning is enabled and cannot be executed normally Hive client:
Task with the most failures(4):
Diagnostic Messages for this Task:
Error: GC overhead limit exceeded
...
FAILED: Execution Error, return code 2 from org.apache.hadoo ...
Added by drkstr on Sat, 25 Dec 2021 04:41:15 +0200
Hadoop distributed platform construction
Building Hadoop distributed platform in linux system
First, if the liunx network cannot be connected, click "Edit" in the VMwvare main interface, and then select "virtual network editor" After entering, restore the default settings in the following two steps. Generally, you can restore them after setting 1. Environmental ...
Added by Daveyz83 on Mon, 20 Dec 2021 15:04:22 +0200
HDFS slow node monitoring principle and source code analysis
background
With the growing size of HDFS clusters, It is inevitable that there will be some "slow nodes" in the cluster, which are mainly manifested in slow network data transmission and slow disk reading and writing. Usually, these slow nodes are difficult to be found. Only when the business job data reading and writing involves the ...
Added by Vince on Sun, 19 Dec 2021 18:19:36 +0200
Troubleshooting hdfs for hadoop optimization
This blog is mainly about troubleshooting hadoop hdfs, including NameNode fault handling, cluster security mode and disk repair. If there is something bad, welcome everyone! thank!
nn Fault Handling
1. Scene The NameNode process hangs and the stored data is lost. How to recover the NameNode 2. Fault simulation (1) kill -9 NameNode proce ...
Added by Stressed on Sat, 18 Dec 2021 15:50:32 +0200
[CDH 6.3.X] clouder manager 6.3.2 ,CDH 6.3.1. Installation process
Software packages required during deployment
Link: https://pan.baidu.com/s/1UajMORVvQ_VSLOdVkJWYQQ Extraction code: e28y
Link: https://pan.baidu.com/s/1dMj8JEaRIOaXP53W2kF_mQ Extraction code: xbyo
a key:
Sets the host name in FQDN formatTurn off firewallTurn off ipv6Configure local http serviceConfigure local storageThe mysql jdbc drive ...
Added by Jacquelyn L. Ja on Sat, 18 Dec 2021 06:00:54 +0200
hadoop data compression and related algorithms and (MapReduce) code example demonstration
To see which compression algorithms hadoop has
[lqs@bdc112 hadoop-3.1.3]$ bin/hadoop checknative
2021-12-15 16:20:12,342 INFO bzip2.Bzip2Factory: Successfully loaded & initialized native-bzip2 library system-native
2021-12-15 16:20:12,345 INFO zlib.ZlibFactory: Successfully loaded & initialized native-zlib library
2021-12-15 16:20:12,3 ...
Added by thebighere on Wed, 15 Dec 2021 19:16:34 +0200