Table API for Flink learning (python version)

โ›„ After learning some basic Datastream APIs yesterday, let's continue to learn the Table API and SQL on the upper layer of Datastream. Both APIs deal with relational data, which can reduce the development threshold of flink. Students who are interested in the contents of previous periods can refer to the following contents ๐Ÿ‘‡: hadoop topics: ...

Added by nankoweap on Fri, 18 Feb 2022 17:15:19 +0200

Flinkcdc Hudi: full introduction to MySQL data entering the lake in real time III: explore and realize the high availability of FlinkCDC mysql master-slave database synchronization

Preface: Hudi series: Flinkcdc Hudi: introduction to MySQL data entering the lake in real time 1: preliminary test Flinkcdc Hudi: full introduction to MySQL data entering the lake in real time II: exceptions and solutions encountered during the integration of Hudi and Spark 1, Background In the production environment, mysql is generally eq ...

Added by rulian on Fri, 18 Feb 2022 06:00:15 +0200

DataStream API for Flink learning (python version)

๐Ÿ’ฆ Today, let's learn about the more basic DataStream API in flink, which is used to process stream data. This article is mainly in the form of pyflink to explain to the small partners who are interested in the content of previous periods ๐Ÿ‘‡: hadoop topics: hadoop series articles.spark topics: spark series.Flynk topics: Flink series. ๐Ÿ’› The A ...

Added by urgido on Thu, 17 Feb 2022 19:55:58 +0200

[Flink] Flink's transform operator

Data preparation The data used below are all the following data sensor_1,1547718199,35.8 sensor_6,1547718201,15.4 sensor_7,1547718202,6.7 sensor_10,1547718205,38.1 sensor_1,1547728199,25.8 sensor_6,1547712201,35.4 sensor_7,1547718102,16.7 sensor_10,1547712205,28.1 Basic conversion operators (Map, FlatMap, Filter) Map map is to tran ...

Added by php1 on Fri, 11 Feb 2022 01:42:33 +0200

Java integrates Flink to stream data obtained from Kafka

Last example https://blog.csdn.net/xxkalychen/article/details/117149540?spm=1001.2014.3001.5502 Setting Flink's data source to Socket is just to provide streaming data for testing. This is not generally used in production. The standard model is to obtain streaming data from message queues. Flink provides the encapsulation of connecting with Kaf ...

Added by moty66 on Wed, 09 Feb 2022 02:01:06 +0200

Flink de duplication scheme

Flink heavy De duplication calculation should be a common indicator calculation in data analysis business, such as the number of users visiting the website in a day, the number of users clicking on advertisements, etc. offline calculation is a full and one-time calculation process, and the de duplication results can usually be obtained by dist ...

Added by cowboy_x on Mon, 07 Feb 2022 05:46:01 +0200

Flink real-time data warehouse of big data project (DWM layer)

Design ideas In the past, we split the data into independent Kafka topics through diversion and other processing methods. Next, when processing the data, we should consider processing the index items used in real-time calculation. Timeliness is the pursuit of real-time data warehouse. Therefore, in some scenarios, it is not necessary to have a ...

Added by SteveMellor on Thu, 03 Feb 2022 21:34:05 +0200

Flink (55): Streaming File Sink of Flink advanced features

catalogue 0. Links to related articles 1. Introduction to streaming file writing 1.1. Scene description 1.2. Bucket, SubTask and PartFile 2. Case display 2.1. demand 2.2. Development steps 2.3. Implementation code 3. Detailed explanation of streaming file writing configuration 3.1. PartFile 3.1.1. PartFile lifecycle 3.1.2. Gene ...

Added by Xpheyel on Wed, 02 Feb 2022 23:14:14 +0200

Flink (53): end-to-end exactly once, the advanced feature of Flink

catalogue 0. Links to related articles 1. Data processing language of stream processing 1.1. At most once 1.2. At least once 1.3. Exactly once 1.4. End to end exactly once 1.5. Accurate once & effective once 1.6. How does stream computing system support consistency semantics 2. Implementation of end to end exactly once 2.1. So ...

Added by ansarka on Wed, 02 Feb 2022 23:02:31 +0200

Flink custom window trigger

Flink built-in trigger and custom window trigger flink: 1.10.1 Github: https://github.com/shirukai/flink-examples-trigger.git Some time ago, colleagues encountered a demand in development, which requires counting according to the 10 minute window of event time. At the same time, when new data is needed, the results are output every min ...

Added by twistisking on Wed, 02 Feb 2022 01:17:05 +0200