Flink (55): Streaming File Sink of Flink advanced features

catalogue 0. Links to related articles 1. Introduction to streaming file writing 1.1. Scene description 1.2. Bucket, SubTask and PartFile 2. Case display 2.1. demand 2.2. Development steps 2.3. Implementation code 3. Detailed explanation of streaming file writing configuration 3.1. PartFile 3.1.1. PartFile lifecycle 3.1.2. Gene ...

Added by Xpheyel on Wed, 02 Feb 2022 23:14:14 +0200

Flink (53): end-to-end exactly once, the advanced feature of Flink

catalogue 0. Links to related articles 1. Data processing language of stream processing 1.1. At most once 1.2. At least once 1.3. Exactly once 1.4. End to end exactly once 1.5. Accurate once & effective once 1.6. How does stream computing system support consistency semantics 2. Implementation of end to end exactly once 2.1. So ...

Added by ansarka on Wed, 02 Feb 2022 23:02:31 +0200

Phoenix practice -- a summary of common basic syntax of Phoenix SQL

For detailed syntax, see Official website grammar Here are just some common grammars Common command operations of Phoenix Shell !table --View table information !describe tablename --You can view table field information !history --You can view the execution history SQL !dbinfo !index tb; --see tb Index of !quit; --sign out phoenix shell hel ...

Added by cgrenda on Wed, 02 Feb 2022 08:59:25 +0200

MobileNet actual combat: tensorflow2 Version x, MobileNetV2 image classification task (large data set)

MobileNet actual combat: tensorflow2 Version x, MobileNetV2 image classification task (large data set) abstract This example extracts part of the data in the plant seedling data set as the data set. The data set has 12 categories. Today, I work with you to implement tensorflow2 For the X version image classification task, the classification m ...

Added by arbelo on Wed, 02 Feb 2022 08:40:23 +0200

Complete collection of 23 basic knowledge of big data series scala (the most complete in history, recommended Collection)

Official account: Data and intelligence, big data Club The series of articles on big data are presented from three parts: technical ability, business foundation and analytical thinking. You will gain: ❖ improve self-confidence, deal with the interview freely, and get an internship or offer smoothly; ❖ master the basic knowledge of big d ...

Added by Wolphie on Wed, 02 Feb 2022 03:48:28 +0200

Flink CDC and kafka carry out multi-source merging and downstream synchronization scheme

1, Foreword This paper mainly aims at the problem that Flink SQL cannot realize multi-source consolidation of multi database and multi table by using Flink CDC, and how to update the downstream Kafka synchronously after multi-source consolidation, because at present, Flink SQL can only carry out the job operation of single table Flink CDC, whic ...

Added by jefkin on Tue, 01 Feb 2022 23:22:40 +0200

Java basic syntax 2

Java basic syntax 1 type conversion In Java, some data types can be converted to each other. There are two cases: automatic type conversion and forced type conversion. 1.1 implicit transformation (understanding) Assign a value or variable representing a small data range to another variable representing a large data range. This conversion mo ...

Added by poison6feet on Tue, 01 Feb 2022 20:53:55 +0200

Introduction to superset deployment, installation and use

Introduction to superset deployment, installation and use Superset overview Apache Superset is an open source, modern and lightweight BI analysis tool. It can connect with a variety of data sources, has rich icon display forms, supports custom dashboards, and has a friendly user interface, which is very easy to use. Superset application scen ...

Added by EPCtech on Tue, 01 Feb 2022 16:46:34 +0200

es actual combat - word frequency statistics using IK word segmentation

Introduction: use IK word splitter to segment words and generate word cloud. This paper mainly introduces how to make word frequency statistics through IK word segmentation. Use the word splitter to count the word frequency of the article. The main purpose is to realize the word cloud function as shown in the figure below, and you can find the ...

Added by JKinBlack on Tue, 01 Feb 2022 09:11:17 +0200

Hadoop (Introduction) overview, operation environment construction and operation mode of big data technology

1 Hadoop overview 1.1 what is Hadoop (1) Hadoop is a distributed system infrastructure developed by the Apache foundation (2) It mainly solves the problems of massive data storage and massive data analysis and calculation (3) In a broad sense, Hadoop usually refers to a broader concept - Hadoop ecosystem 1.2 Hadoop advantages (1) High ...

Added by cuongvt on Tue, 01 Feb 2022 04:04:45 +0200