[big data framework and practice] - Chapter 1 spark basic course

Section 1 Introduction to spark 1. What is spark? 1.apache spark is a unified computing engine and a set of class libraries. Using spark to process data is 100 times faster than the traditional way. 2. It is not that spark is 100 times faster than python on a single computer, but that spark is mainly used for parallel data processing on c ...

Added by hotcigar on Wed, 05 Jan 2022 08:29:00 +0200

Easy to understand and explain gradient descent method

Original author: King James, master of data science, King's College London Original address:[ Portal] preface For entry into machine learning, you must understand the gradient descent method. Although the gradient descent method is not directly used in machine learning, understanding the thinking of gradient descent method is the basi ...

Added by osiris1603 on Wed, 05 Jan 2022 08:26:59 +0200

Machine learning practice: handwritten numeral recognition-2

Machine learning practice: handwritten numeral recognition 1. Experimental description In this experiment, we will use the k-nearest neighbor algorithm to recognize handwritten digits, build the k-nearest neighbor algorithm function through the python command line, input the data provided by the experiment, classify the data, and view the cl ...

Added by Pavel_Nedved on Wed, 05 Jan 2022 07:21:31 +0200

Graduation project Week1

The picture comes from the Internet. If there is infringement, contact to delete it. This study is divided into two parts. The first part is the video learning part, and the second part is the code learning part. Part1 video learning Video 1 Introduction one ️⃣ Turing test Judge whether the box is a person or a machine outside the black ...

Added by gabeg on Wed, 05 Jan 2022 06:09:59 +0200

[scikit learn] K-nearest neighbor (KNN)

KNN KNN(K-Nearest Neighbor) is one of the simplest machine learning algorithms, which can be used for classification and regression. It is a supervised learning algorithm. The idea is that if most of the K most similar (i.e. the nearest) samples of a sample in the feature space belong to a certain category, the sample also belongs to this cate ...

Added by CyberShot on Wed, 05 Jan 2022 05:56:09 +0200

Machine learning Chapter 4 linear regression algorithm advanced 4.1 (learning notes of big talk Python machine learning)

Chapter 4 advanced linear regression algorithm    the solution of multivariable linear regression algorithm is far away from univariate linear regression algorithm, and overcomes the limitation of univariate linear regression algorithm with only one characteristic variable in practical application, so it is widely used.    ...

Added by genix2011 on Tue, 04 Jan 2022 17:00:26 +0200

Artificial intelligence handwritten numeral recognition based on neural network ANN model

1, Experimental purpose Build a multi-layer neural network model to recognize MNIST handwritten digits, and try to train the model with different super parameters, so that the final recognition accuracy of the model can reach more than 95%. 2, Algorithm steps 1. Parsing and loading data (1) Open and read the downloaded MNIST dataset from the ...

Added by PRodgers4284 on Tue, 04 Jan 2022 16:28:44 +0200

Numpy easy start

summary Numpy is a basic package for high-performance scientific computing and data analysis. It provides the function of matrix operation and is widely used in the field of deep learning and data analysis.. use Create array vector = np.asarray([1,2,3,4]); Create zero array a = np.zeros(10) # Create vector a1 = np.zeros(shape=(5,3)) # C ...

Added by Kia on Tue, 04 Jan 2022 13:15:42 +0200

Introduction and implementation of multi task learning model ESMM

Introduction: This paper introduces the paper "entire space multi task model: an e" effective approach for estimating post click conversion rate "published by Alibaba team in SIGIR '2018. Based on the idea of multi task learning (MTL), this paper proposes a CVR prediction model called ESMM, which effectively solves the two key pr ...

Added by Jurge on Tue, 04 Jan 2022 11:23:29 +0200

Matplotlib Quick Start Guide for data visualization

Charts are the basis of the data exploration process. They enable us to better understand our data - for example, help identify outliers or data processing to be done, or provide new ideas and ways to build machine learning models. Charting is an important part of any data science report.Python has many visualization libraries for making static ...

Added by jwilh on Tue, 04 Jan 2022 11:13:54 +0200