Statistical Distribution of Data Exploration
In this paper, Python statistical simulation method is used to introduce four commonly used statistical distributions, including discrete distribution: binomial distribution and Poisson distribution, and continuous distribution: exponential distribution and normal distribution. Finally, the distribution of height and weight data is checked.
# I ...
Added by adren on Mon, 08 Jul 2019 01:45:02 +0300
Large Data Resource Monitoring-IDC Room Cluster Indicator Acquisition
Background:IDC computer room built by the company, based on IDC computer room to build a large data cluster; need to monitor cluster resources, the cluster uses CDH cluster, collection is mainly divided into two parts:
Collection of indexes related to HDFS and YARNIDC machine's own index collection Note: Some may wonder why the CM interface a ...
Added by tomprogers on Sun, 30 Jun 2019 20:02:08 +0300
RDD blood relationship source code details!
I. Dependency of RDD
RDD dependencies fall into two categories: wide dependencies and narrow dependencies. We can think of it as follows:
(1) Narrow dependencies: The partition of each parent RDD is used by at most one partition of the child RDD.
(2) Wide dependency: Each parent RDD partition is used by multiple child RDD partitions.
Narrow ...
Added by infomamun on Wed, 26 Jun 2019 23:38:31 +0300
What if app s can't be downloaded directly from micro-mail? Using Elephant Jump Interface to Open Default Browser to Download
What if Wechat's built-in browser can't download apps? What if Wechat always intercepts app download links? This is a question that many users consulted me recently in forums and post bars. In fact, this problem is not difficult to solve. We can avoid the above problems perfectly by using the Wechat jump API interface.So in response to this pr ...
Added by danielle on Mon, 27 May 2019 21:50:33 +0300
01 python reads files and packages
Read files using open
.read()
Using open, you can read files without requiring any packages
r means read.
If the file and notebook are in the same folder, you just need to write the name of the file instead of filling in the full path.
file.read means to read everything in this file
file = open('data.txt','r')
print(file. ...
Added by johnbrayn on Sun, 19 May 2019 11:04:43 +0300
Big Data Tutorial (14.2) Website Data Analysis
The previous article introduced the business background of the website click stream data analysis project; this blogger will continue to share the relevant knowledge of website analysis.
I. Overall technical process and architecture
1.1. Data Processing Flow
This project is a pure data analysis project, and its overall process is basically b ...
Added by gonsman on Wed, 15 May 2019 19:12:18 +0300
Hadoop Installation and Configuration in Ubuntu
Tencent Yun ubuntu 16.04.1 LTS 64 bits
Linux operation
Modify the password of root
sudo passwd root
Log off users
logout
Close the firewall
ufw disable
Uninstall iptables components
apt-get remove iptables
Download vim components (for text editing)
apt-get install vim
Word change
sudo dpkg-reconfigure console-setu ...
Added by EXiT on Tue, 14 May 2019 16:44:32 +0300