Doris is an interactive SQL data warehouse based on MPP, which is mainly used to solve the problems of reports and multidimensional analysis
Frontend (FE): metadata and scheduling node of Doris system. In the import process, it is mainly responsible for the scheduling of import tasks. Backend (BE): th ...
Added by johnh2009 on Sat, 29 Jan 2022 13:27:52 +0200
DataX is an open source version of Alibaba cloud DataWorks data integration. It is an offline data synchronization tool / platform widely used in Alibaba group. DataX implements efficient data synchronization between various heterogeneous data sources, including MySQL, Oracle, SqlServer, Postgre, HDFS, Hive, ADS, HBase, TableStore(OTS), MaxC ...
Added by Salis on Wed, 15 Dec 2021 05:59:35 +0200
insert overwrite implementation of Doris partition bucket table
(1) Real time project If it is real-time, we should pay attention to timeliness. Directly select the unique key model table and select the non repeatable columns as unique keys
A simple example of creating a single partition table
CREATE TABLE order_tab
orderid BIGINT, ...
Added by snipesh0tz on Tue, 14 Dec 2021 10:25:54 +0200
Spark Doris Connector is a new feature introduced by Doris in version 0.12. Users can use this function to directly read and write the data stored in Doris through spark, and support SQL, Dataframe, RDD and other methods.
From the perspective of Doris, introducing its data into Spark can use a series of rich ecological products of Spark, bro ...
Added by jackyhuphp on Wed, 13 Oct 2021 00:15:42 +0300
In some usage scenarios, the user will partition the table by day and perform routine tasks regularly every day. At this time, the user needs to manually manage the partition. Otherwise, the data import may fail because the user does not create a partition, which brings additional maintenance costs to the user.
Through the ...
Added by lost305 on Tue, 28 Sep 2021 08:33:08 +0300