本文共 1762 字,大约阅读时间需要 5 分钟。
Kudu是一款分布式键值存储系统,可通过多种方式进行操作。
kudu-spark包将Spark与Kudu结合,编写Spark应用程序进行操作。/etc/default/impala中,添加参数-kudu_master_hosts指定Kudu主节点地址。kudu_master_hosts设置为slave217:7051。CREATE EXTERNAL TABLE my_mapping_tableSTORED AS KUDUTBLPROPERTIES ('kudu.table_name' = 'my_kudu_table'); CREATE TABLE testinkudu ( name string, age int, primary key (name))PARTITION BY HASH (name)PARTITIONS 8STORED AS KUDU [AS SELECT * FROM OTHER_TABLE];
create table testinkudu ( name string, age int, primary key (name))partition by hash (name)partitions 8stored as kudu;
create table testinkudu2 ( classes int, name string, age int, primary key (classes, name))partition by range (classes) ( partition value = 1, partition value = 2, partition value =3, partition value =4)stored as kudu;
insert into testinkudu values ('a', 12); insert into testinkudu values ('a', 12), ('b', 13), ('c', 14); insert into testinkudu select * from other_table;
-UPSERT语法支持,若主键已存在则执行UPDATE,否则执行INSERT。
-支持标准SQL删除语法,可按主键或全表删除。
本文为Kudu数据存储管理的基础指南,涵盖创建、管理及数据操作等内容,帮助用户高效使用Kudu进行数据存储和处理。
转载地址:http://smxlz.baihongyu.com/