Figure database Neo4j

Traditional databases are difficult to deal with complex multi hop relational operations. A relational computing database that supports massive, complex and flexible structure is needed, and graph database came into being. Related concepts brief introduction The graph database consists of vertices and edges; It is mainly used for adding, del ...

Added by Tindo on Sun, 20 Feb 2022 07:12:16 +0200

2021 big data Spark: Structured Streaming Internet of things device data analysis

catalogue Data analysis of Internet of things devices Equipment monitoring data preparation Create Topic Analog data SQL style DSL style Data analysis of Internet of things devices In the era of Internet of things, a large number of sensors are collecting and producing data involving various fields every day. It makes the Internet of t ...

Added by FUNKAM35 on Sun, 20 Feb 2022 01:16:33 +0200

Introduction to the core concept of elasticSearch: simple use of search

Simple use of search In the previous chapter, we introduced the basic use of the document. If you are interested, you can refer to it Introduction to the core concept of elasticSearch (III): addition, deletion, modification and query of documents In this chapter, we will make a simple use of search. 1, Let's delete all the indexes in ...

Added by abitlikehomer on Sat, 19 Feb 2022 03:28:22 +0200

impala series: Service API--select

Basic grammar [WITH name AS (select_expression) [, ...] ] SELECT [ALL | DISTINCT] [STRAIGHT_JOIN] expression [, expression ...] FROM table_reference [, table_reference ...] [[FULL | [LEFT | RIGHT] INNER | [LEFT | RIGHT] OUTER | [LEFT | RIGHT] SEMI | [LEFT | RIGHT] ANTI | CROSS] JOIN table_reference [ON join_equality_clauses | USING ( ...

Added by CrashRoX on Sat, 19 Feb 2022 00:44:52 +0200

Big data tool Hive (basic)

1, Definition of HIVE Hive is a data warehouse tool based on Hadoop, which can map structured data files into a data table, and can read, write and manage data files in a way similar to SQL. This Hive SQL is abbreviated as HQL. Hive's execution engines can be MR, Spark and Tez. essence The essence of Hive is to convert HQL into MapReduce task ...

Added by gwood_25 on Fri, 18 Feb 2022 20:55:12 +0200

Data cleaning using poi

1. Data cleaning 1.1. What is data cleansing Data cleaning – the process of re examining and verifying data to remove duplicate information, correct existing errors, and provide data consistency. From the name of data cleaning, we can see that it is to "wash away" the "dirty", which refers to the last procedure ...

Added by Chamza on Fri, 18 Feb 2022 15:30:10 +0200

Big data Flink in 2021: Flink simulated double 11 real-time big screen statistics

  catalogue Flink simulation double 11 real-time big screen statistics demand data Coding steps: 1.env 2.source 3.transformation 4. Use the results of the above aggregation to realize the business requirements: 5.execute Code implementation Effect Flink simulation double 11 real-time big screen statistics demand     In the r ...

Added by Spitfire on Fri, 18 Feb 2022 13:57:10 +0200

pay? It's impossible! 20 lines of Python code to implement a permanent free PDF editing tool

Pdf (Portable Document Format), Chinese name "Portable Document Format", is a file format we often come into contact with. Documents, documents Many are in PDF format. With the advantage of stable format, it enables us to optimally maintain the original color and format in the process of printing, sharing and transmission.   PDF ...

Added by aniesh82 on Fri, 18 Feb 2022 04:26:34 +0200

Integrated java and scala development environment using maven

Git address: https://gitee.com/jyq_18792721831/sparkmaven.git Create Project Let's start by creating a generic maven project Create a project followed by a hello module Is also a normal maven module Increase scala dependency We don't write code in the parent project, the parent project is just for managing the child project, so th ...

Added by ahundiak on Fri, 18 Feb 2022 00:15:40 +0200

Hadoop cluster ecological construction

Hadoop cluster construction (continuous update) The relevant resource files that are not used in this paper, the extraction code eeee 1: Preparations to be completed before starting construction Built Linux serverYou can access the public network (ping www.baidu.com), and you can ping itXshell connection (can be omitted)Server version infor ...

Added by ijug.net on Thu, 17 Feb 2022 10:50:58 +0200