一区精品美女在线视频,伊人春色91九色,中文字幕久久激情亚洲

一、建表：分區(qū)分桶表，對(duì)日期分區(qū)，再對(duì)id分4個(gè)桶

create table t1(id int)

partitioned by (statis_date string)

clustered by(id)

into 4 buckets;

二、設(shè)置強(qiáng)制分桶

set hive.enforce.bucketing=true;

三、執(zhí)行插入語句，插入1到8這幾個(gè)id

insert into t1 partition(statis_date='20211101')

select * from (

select 1 id union all

select 2 id union all

select 3 id union all

select 4 id union all

select 5 id union all

select 6 id union all

select 7 id union all

select 8 id ) tmp

cluster by id;

四、效果

可以看到當(dāng)前已經(jīng)自動(dòng)分成4個(gè)文件，符合4個(gè)桶的設(shè)置。

查看文件0，可以看到分別是4和8，也就是對(duì)4和8進(jìn)行哈希散列后得到相同的值。

五、表抽樣

-- 語法：

select columns from table tablesample(bucket x out of y on column);

-- x：表示從第幾個(gè)分桶進(jìn)行抽樣

-- y：表示每隔幾個(gè)分桶取一個(gè)分桶，必須為y的整數(shù)倍或者因子

例如下面從對(duì)表從桶1開始查，每次間隔1個(gè)桶，得到桶1和桶3的全部數(shù)據(jù)：

select id,statis_date from t1 tablesample(bucket 1 out of 2 on id);

其中id的值4和8是分桶1的，2和6是分桶3的。

六、作用

1、抽樣查詢

2、map-side join，兩個(gè)對(duì)相同字段做了同樣分桶規(guī)則的表關(guān)聯(lián)，可以實(shí)現(xiàn)在map端join，提高效率。

3、控制文件數(shù)量

色偷偷精品伊人,欧洲久久精品,欧美综合婷婷骚逼,国产AV主播,国产最新探花在线,九色在线视频一区,伊人大交九欧美,1769亚洲,黄色成人av

hive分桶表實(shí)踐

hive分桶表實(shí)踐

相關(guān)閱讀更多精彩內(nèi)容

友情鏈接更多精彩內(nèi)容

色偷偷精品伊人,欧洲久久精品,欧美综合婷婷骚逼,国产AV主播,国产最新探花在线,九色在线视频一区,伊人大交九 欧美,1769亚洲,黄色成人av

hive分桶表實(shí)踐

相關(guān)閱讀更多精彩內(nèi)容

友情鏈接更多精彩內(nèi)容

色偷偷精品伊人,欧洲久久精品,欧美综合婷婷骚逼,国产AV主播,国产最新探花在线,九色在线视频一区,伊人大交九欧美,1769亚洲,黄色成人av