Statistical Distribution of Data Exploration

In this paper, Python statistical simulation method is used to introduce four commonly used statistical distributions, including discrete distribution: binomial distribution and Poisson distribution, and continuous distribution: exponential distribution and normal distribution. Finally, the distribution of height and weight data is checked. # I ...

Added by adren on Mon, 08 Jul 2019 01:45:02 +0300

Large Data Resource Monitoring-IDC Room Cluster Indicator Acquisition

Background:IDC computer room built by the company, based on IDC computer room to build a large data cluster; need to monitor cluster resources, the cluster uses CDH cluster, collection is mainly divided into two parts: Collection of indexes related to HDFS and YARNIDC machine's own index collection Note: Some may wonder why the CM interface a ...

Added by tomprogers on Sun, 30 Jun 2019 20:02:08 +0300

RDD blood relationship source code details!

I. Dependency of RDD RDD dependencies fall into two categories: wide dependencies and narrow dependencies. We can think of it as follows: (1) Narrow dependencies: The partition of each parent RDD is used by at most one partition of the child RDD. (2) Wide dependency: Each parent RDD partition is used by multiple child RDD partitions. Narrow ...

Added by infomamun on Wed, 26 Jun 2019 23:38:31 +0300

What if app s can't be downloaded directly from micro-mail? Using Elephant Jump Interface to Open Default Browser to Download

What if Wechat's built-in browser can't download apps? What if Wechat always intercepts app download links? This is a question that many users consulted me recently in forums and post bars. In fact, this problem is not difficult to solve. We can avoid the above problems perfectly by using the Wechat jump API interface.So in response to this pr ...

Added by danielle on Mon, 27 May 2019 21:50:33 +0300

01 python reads files and packages

Read files using open .read() Using open, you can read files without requiring any packages r means read. If the file and notebook are in the same folder, you just need to write the name of the file instead of filling in the full path. file.read means to read everything in this file file = open('data.txt','r') print(file. ...

Added by johnbrayn on Sun, 19 May 2019 11:04:43 +0300

Big Data Tutorial (14.2) Website Data Analysis

The previous article introduced the business background of the website click stream data analysis project; this blogger will continue to share the relevant knowledge of website analysis. I. Overall technical process and architecture 1.1. Data Processing Flow This project is a pure data analysis project, and its overall process is basically b ...

Added by gonsman on Wed, 15 May 2019 19:12:18 +0300

Hadoop Installation and Configuration in Ubuntu

Tencent Yun ubuntu 16.04.1 LTS 64 bits Linux operation Modify the password of root sudo passwd root Log off users logout Close the firewall ufw disable Uninstall iptables components apt-get remove iptables Download vim components (for text editing) apt-get install vim Word change sudo dpkg-reconfigure console-setu ...

Added by EXiT on Tue, 14 May 2019 16:44:32 +0300