PostgreSQL 11 新特性之哈希分區(qū)

作者: 不剪發(fā)的Tony老師
畢業(yè)于北京航空航天大學(xué),十多年數(shù)據(jù)庫管理與開發(fā)經(jīng)驗,目前在一家全球性的金融公司從事數(shù)據(jù)庫架構(gòu)設(shè)計。CSDN學(xué)院簽約講師以及GitChat專欄作者。csdn上的博客收藏于以下地址:https://tonydong.blog.csdn.net

文章目錄

PostgreSQL 10 引入了聲明式分區(qū)(declarative partitioning)特性,但是實現(xiàn)的功能有限。PostgreSQL 11 為此帶來了許多關(guān)于分區(qū)的增強功能。首先就是增加了 HASH 分區(qū)。哈希分區(qū)基于分區(qū)鍵的哈希值計算數(shù)據(jù)所在的分區(qū)。

CREATE TABLE htable (c1 bigint, c2 VARCHAR(10)) PARTITION BY HASH(c1);

 
為哈希分區(qū)表創(chuàng)建分區(qū)時,使用 FOR VALUES WITH 子句指定分區(qū)的計算方法,其中的 MODULUS 子句用于指定除數(shù),REMAINDER 子句用于指定哈希值被除后的余數(shù)。

CREATE TABLE htable_p0 PARTITION OF htable FOR VALUES WITH (MODULUS 4, REMAINDER 0);
CREATE TABLE htable_p1 PARTITION OF htable FOR VALUES WITH (MODULUS 4, REMAINDER 1);
CREATE TABLE htable_p2 PARTITION OF htable FOR VALUES WITH (MODULUS 4, REMAINDER 2);
CREATE TABLE htable_p3 PARTITION OF htable FOR VALUES WITH (MODULUS 4, REMAINDER 3);

REMAINDER 子句需要指定一個小于 MODULUS 子句的值。如果指定的分區(qū)數(shù)量少于 MODULES 子句的值,將會導(dǎo)致無法插入某些數(shù)據(jù),因為沒有用于存儲這些值的分區(qū)。

\d+ htable
                                          Table "public.htable"
 Column |         Type          | Collation | Nullable | Default | Storage  | Stats target | Description
--------+-----------------------+-----------+----------+---------+----------+--------------+-------------
 c1     | bigint                |           |          |         | plain    |              |
 c2     | character varying(10) |           |          |         | extended |              |
Partition key: HASH (c1)
Partitions: htable_p0 FOR VALUES WITH (modulus 4, remainder 0),
            htable_p1 FOR VALUES WITH (modulus 4, remainder 1),
            htable_p2 FOR VALUES WITH (modulus 4, remainder 2),
            htable_p3 FOR VALUES WITH (modulus 4, remainder 3)

為表 htable 生成一些數(shù)據(jù),查看數(shù)據(jù)在各個分區(qū)的分布是否均勻:

INSERT INTO htable SELECT val,'val:'||val FROM generate_series(1,100000) val;
INSERT 0 100000

SELECT COUNT(*) FROM htable_p0;
 count
-------
 25126
(1 row)

SELECT COUNT(*) FROM htable_p1;
 count
-------
 24978
(1 row)

SELECT COUNT(*) FROM htable_p2;
 count
-------
 24971
(1 row)

SELECT COUNT(*) FROM htable_p3;
 count
-------
 24925
(1 row)

每個分區(qū)大概包含四分之一(25000)的數(shù)據(jù)。

對于哈希分區(qū),同樣支持分區(qū)裁剪(Partition Pruning):

show enable_partition_pruning;
 enable_partition_pruning
--------------------------
 on

EXPLAIN SELECT * FROM htable where c1 = 200;
                            QUERY PLAN                            
------------------------------------------------------------------
 Append  (cost=0.00..470.57 rows=1 width=17)
   ->  Seq Scan on htable_p3  (cost=0.00..470.56 rows=1 width=17)
         Filter: (c1 = 200)
(3 rows)

官方文檔:Table Partitioning