Figure database Neo4j
Traditional databases are difficult to deal with complex multi hop relational operations. A relational computing database that supports massive, complex and flexible structure is needed, and graph database came into being.
Related concepts
brief introduction
The graph database consists of vertices and edges; It is mainly used for adding, del ...
Added by Tindo on Sun, 20 Feb 2022 07:12:16 +0200
2021 big data Spark: Structured Streaming Internet of things device data analysis
catalogue
Data analysis of Internet of things devices
Equipment monitoring data preparation
Create Topic
Analog data
SQL style
DSL style
Data analysis of Internet of things devices
In the era of Internet of things, a large number of sensors are collecting and producing data involving various fields every day. It makes the Internet of t ...
Added by FUNKAM35 on Sun, 20 Feb 2022 01:16:33 +0200
Introduction to the core concept of elasticSearch: simple use of search
Simple use of search
In the previous chapter, we introduced the basic use of the document. If you are interested, you can refer to it Introduction to the core concept of elasticSearch (III): addition, deletion, modification and query of documents In this chapter, we will make a simple use of search.
1, Let's delete all the indexes in ...
Added by abitlikehomer on Sat, 19 Feb 2022 03:28:22 +0200
impala series: Service API--select
Basic grammar
[WITH name AS (select_expression) [, ...] ]
SELECT
[ALL | DISTINCT]
[STRAIGHT_JOIN]
expression [, expression ...]
FROM table_reference [, table_reference ...]
[[FULL | [LEFT | RIGHT] INNER | [LEFT | RIGHT] OUTER | [LEFT | RIGHT] SEMI | [LEFT | RIGHT] ANTI | CROSS]
JOIN table_reference
[ON join_equality_clauses | USING ( ...
Added by CrashRoX on Sat, 19 Feb 2022 00:44:52 +0200
Big data tool Hive (basic)
1, Definition of HIVE
Hive is a data warehouse tool based on Hadoop, which can map structured data files into a data table, and can read, write and manage data files in a way similar to SQL. This Hive SQL is abbreviated as HQL. Hive's execution engines can be MR, Spark and Tez.
essence The essence of Hive is to convert HQL into MapReduce task ...
Added by gwood_25 on Fri, 18 Feb 2022 20:55:12 +0200
Data cleaning using poi
1. Data cleaning
1.1. What is data cleansing
Data cleaning – the process of re examining and verifying data to remove duplicate information, correct existing errors, and provide data consistency.
From the name of data cleaning, we can see that it is to "wash away" the "dirty", which refers to the last procedure ...
Added by Chamza on Fri, 18 Feb 2022 15:30:10 +0200
Big data Flink in 2021: Flink simulated double 11 real-time big screen statistics
catalogue
Flink simulation double 11 real-time big screen statistics
demand
data
Coding steps:
1.env
2.source
3.transformation
4. Use the results of the above aggregation to realize the business requirements:
5.execute
Code implementation
Effect
Flink simulation double 11 real-time big screen statistics
demand
In the r ...
Added by Spitfire on Fri, 18 Feb 2022 13:57:10 +0200
pay? It's impossible! 20 lines of Python code to implement a permanent free PDF editing tool
Pdf (Portable Document Format), Chinese name "Portable Document Format", is a file format we often come into contact with. Documents, documents Many are in PDF format. With the advantage of stable format, it enables us to optimally maintain the original color and format in the process of printing, sharing and transmission.
PDF ...
Added by aniesh82 on Fri, 18 Feb 2022 04:26:34 +0200
Integrated java and scala development environment using maven
Git address: https://gitee.com/jyq_18792721831/sparkmaven.git
Create Project
Let's start by creating a generic maven project
Create a project followed by a hello module
Is also a normal maven module
Increase scala dependency
We don't write code in the parent project, the parent project is just for managing the child project, so th ...
Added by ahundiak on Fri, 18 Feb 2022 00:15:40 +0200
Hadoop cluster ecological construction
Hadoop cluster construction (continuous update)
The relevant resource files that are not used in this paper, the extraction code eeee
1: Preparations to be completed before starting construction
Built Linux serverYou can access the public network (ping www.baidu.com), and you can ping itXshell connection (can be omitted)Server version infor ...
Added by ijug.net on Thu, 17 Feb 2022 10:50:58 +0200