python parallel scheduling spark tasks

background Translate pyspark code that implements a business logic into sparksql to supplement the historical data for the past six months (run by day) based on sparksql; Core Point 1) Translate pyspark to sparksql; 2) Based on sparksql, supplement the historical data of the past half year (run by day); Realization 1) First, pyspark is tra ...

Added by crimsonmoon on Fri, 11 Feb 2022 03:30:23 +0200

Python script directory traversal

Basic application of urllib 3 Library Urllib 3 is a powerful and well-organized Python library for HTTP clients. Many native Python systems have started to use urllib 3. Urllib3 provides many important features that are not available in the python standard library: Thread safetyConnection poolClient SSL/TSL authenticationFile segment code upl ...

Added by beinerts on Fri, 11 Feb 2022 03:05:17 +0200

Machine learning - Case Study 1: happiness prediction

Case 1 (happiness prediction) Background introduction Happiness involves philosophy, psychology, sociology, economics and other disciplines. At the same time, it is closely related to everyone's life. Everyone has his own measurement standard for happiness. If we can find the commonalities that affect happiness and find the policy factors tha ...

Added by fuji on Fri, 11 Feb 2022 01:25:45 +0200

Python foundation 6 -- Class 3 (encapsulation, inheritance and polymorphism of classes)

catalogue 1, Encapsulation 2, Class inheritance 3, Polymorphism The focus of python programming is object-oriented programming, and the core methods of object-oriented programming are encapsulation, inheritance and polymorphism. 1, Encapsulation First, I will briefly introduce the encapsulation. My personal understanding is to seal the d ...

Added by sKunKbad on Fri, 11 Feb 2022 00:56:23 +0200

Integrated learning 6 -- Blending and Stacking

github address: DataScicence Integrated learning 5-Xgboost principle and parameter tuning Integrated learning 4-forward step-by-step algorithm and GBDT principle and case Principles and cases of integrated learning 3-Boosting Principle and case analysis of integrated learning 2-bagging Principle and case analysis of integrated learning 1- votin ...

Added by rwcurry on Fri, 11 Feb 2022 00:36:19 +0200

python redis tutorial

1. Install redis module package in Python pip install redis 2. Connect Python to redis Redis provides two classes: redis and StrictRedis. StrictRedis is used to implement most official commands. Redis is a subclass of StrictRedis, which is used to use the old version backward. The result retrieved by redis defaults to bytes. We can set deco ...

Added by landung on Thu, 10 Feb 2022 23:37:20 +0200

Detailed explanation of 10000 words in urllib Library

What is the urllib library    urllib library is Python's built-in HTTP request library, which does not need additional download. It mainly includes the following four modules urllib.request Request module urllib.error Exception handling module urllib.parse url Analysis module urllib.robotparser robots.txt Analysis module urlli ...

Added by dmayo2 on Thu, 10 Feb 2022 23:35:27 +0200

[note] don't bother PYTHON | Tensorflow tutorial - high level content (Chapter 5)

5.1 Classification learning The qualitative output of Classification problem is Classification, or discrete variable prediction. Regression problem, the quantitative output is regression, or continuous variable prediction. """ Please note, this code is only for python 3+. If you are using python 2+, please modify the code accordingly. """ f ...

Added by sdyates2001 on Thu, 10 Feb 2022 23:21:41 +0200

python -- detailed explanation of gevent -- practical application of the project

Brief description of three communication models: (1) Polling: The client sends requests to the server periodically and continuously at high frequency: Client request - server response - disconnect. The number of requests QPS is relatively frequent, and the configuration requirements of client and server are relatively high (2) Long polling: Th ...

Added by xluminex on Thu, 10 Feb 2022 22:37:49 +0200

Python crawler basics notes - requests2

requests Crawl the search result page corresponding to the specified term of Sogou (simple web page collector) UA: user agent (identity of request carrier) UA camouflage: the server of the portal website will detect the carrier identity of the corresponding request. If it is detected that the carrier identity of the request is a browser, it ...

Added by iBuddy on Thu, 10 Feb 2022 21:01:24 +0200