Hadoop3.3.1 compilation, installation and deployment tutorial

preface    it's best to recompile the source code when building Hadoop, because some functions of Hadoop must coordinate Java class files and library files generated by Native code through JNT. To run Native code in linux system, first compile Native into [. so] file of target CPU architecture. Different processor architectures n ...

Added by roxki on Fri, 14 Jan 2022 13:21:03 +0200

Hadoop master's road -- 06--Flume log collection

Flume log collection 1, Flume introduction Logo refers to the transfer of wood (data) from one place (data source) to another place (data destination) through the river channel (channel) Working with documents 2, Installation configuration of Flume 1. Download and upload to the server 2. Decompression 3. Configuration Copy fl ...

Added by future_man on Sun, 09 Jan 2022 10:17:28 +0200

[hadoop job] Call MapReduce to count the number of occurrences of each word in the file

1, Environment introduction Install the Ubuntu virtual machine using VirtualBox. Install Hadoop and Eclipse 3.0 in Ubuntu 8 compiler. Download and install JAVA environment, Download jdk and complete the pseudo distributed environment configuration of Hadoop. Import all the required JAR packages encountered by the compiler in Eclipse. Start Had ...

Added by IRON FART on Tue, 04 Jan 2022 09:13:29 +0200

6 - click stream data analysis project - log collection to HDFS

6 - click stream data analysis project - log collection to HDFS reference resources: https://blog.csdn.net/tianjun2012/article/details/62424486 The basic information about logs has been introduced in the previous section. It will not be explained in detail here. Only the basic methods of generating logs and collecting logs are provided. ...

Added by ron8000 on Thu, 30 Dec 2021 07:23:26 +0200

CDH6.2. The whole process of brainless construction and configuration (Beginner's version)

The software download link is at the bottom thank: CSDN Daniel: Travel through IT bilibili Daniel: amoscloud2013 1. Preliminary preparation Five 8G virtual machines are CDH1, cdh2, cdh3, CDH4 and cdh5 respectively. JDK is installed on all virtual machines 2. Modify IP and host name Select CentOS 7 for cluster deployment. All three vir ...

Added by thefollower on Mon, 27 Dec 2021 05:46:50 +0200

hadoop_ Hdfs07 hdfsha cluster configuration & ZK cluster configuration & yarnHA configuration

hadoop_ Hdfs07 hdfsha cluster configuration & ZK cluster configuration & yarnha configuration Note: notes (1) Cluster planning Hadoop102Hadoop03Hadoop04ZKZKZKJournaleNodeJournaleNodeJournaleNodeNameNodeNameNodeDataNodeDataNodeDataNodeResourceManagerResourceManagerNodeManagerNodeManagerNodeManager (2) Configuring Zookeeper clusters ...

Added by DarkArchon on Sun, 26 Dec 2021 06:36:22 +0200

HDFS slow node monitoring principle and source code analysis

background With the growing size of HDFS clusters, It is inevitable that there will be some "slow nodes" in the cluster, which are mainly manifested in slow network data transmission and slow disk reading and writing. Usually, these slow nodes are difficult to be found. Only when the business job data reading and writing involves the ...

Added by Vince on Sun, 19 Dec 2021 18:19:36 +0200

Troubleshooting hdfs for hadoop optimization

This blog is mainly about troubleshooting hadoop hdfs, including NameNode fault handling, cluster security mode and disk repair. If there is something bad, welcome everyone! thank! nn Fault Handling 1. Scene The NameNode process hangs and the stored data is lost. How to recover the NameNode 2. Fault simulation (1) kill -9 NameNode proce ...

Added by Stressed on Sat, 18 Dec 2021 15:50:32 +0200

Flink Core Programming

Flink Core Programming 1,Environment When Flink Job submits to perform calculations, it first establishes a link with the Flink framework, that is, the current Flink runtime environment in which task can be scheduled to a different taskManager execution only if environmental information is available. This environment object is relatively simp ...

Added by MadRhino on Wed, 24 Nov 2021 22:44:02 +0200

What exactly is the ScodendaryNameNode for HDFS

0-Preface What does HDFS Secondary NameNode do? This is a classic basic interview question, and the interviewer has asked the interviewer many times (and of course many times). From the impression, about half of the interviewees can't answer correctly, and even give the answer "is not NameNode's hobby". In order to save spa ...

Added by samusk on Thu, 09 Sep 2021 19:31:11 +0300