ADBDEV-6442: Refactor diskquota local_table_stats_map #34

RekGRpth · 2024-10-10T10:21:12Z

Refactor diskquota local_table_stats_map

During initialization, diskquota used a non-optimal structure for the local
hashmap local_table_stats_map. In a hashmap, there is quite a significant
overhead for each entry. Therefore, a large number of small entries led to
increased RAM consumption during cluster startup. Change the specified
structure, making the table oid as the key, and an array of sizes by segments
as the value. This significantly reduces the amount of memory consumed, because
now there will be SEGCOUNT times fewer records. Also fix a small bug with
duplicate oid tables in the active_oids string array in the dispatch_rejectmap
function.

Tests are not provided, but you can estimate the hashmap size using the hash_estimate_size function, for example, like this:

diff --git a/src/gp_activetable.c b/src/gp_activetable.c
index 0888c4d..5b7654d 100644
--- a/src/gp_activetable.c
+++ b/src/gp_activetable.c
@@ -954,6 +954,9 @@ load_table_size(HTAB *local_table_stats_map)
 	Portal                    portal;
 	char                     *sql = "select tableid, size, segid from diskquota.table_size";
 
+	elog(WARNING, "DiskQuotaActiveTableEntry = %li", hash_estimate_size(1000*1000*1000, sizeof(DiskQuotaActiveTableEntry)));
+	elog(WARNING, "ActiveTableEntryCombined = %li", hash_estimate_size(1000*1000, offsetof(ActiveTableEntryCombined, tablesize) + (1000 + 1) * sizeof(Size)));
+
 	if ((plan = SPI_prepare(sql, 0, NULL)) == NULL)
 		ereport(ERROR, (errmsg("[diskquota] SPI_prepare(\"%s\") failed", sql)));
 	if ((portal = SPI_cursor_open(NULL, plan, NULL, NULL, true)) == NULL)

that gives

2024-10-10 14:15:23.186300 +05,,,p684028,th1126361536,,,,0,con9,,seg-1,,,,sx1,"WARNING","01000","DiskQuotaActiveTableEntry = 40623489136",,,,,,,0,,"gp_activetable.c",957,
2024-10-10 14:15:23.186315 +05,,,p684028,th1126361536,,,,0,con9,,seg-1,,,,sx1,"WARNING","01000","ActiveTableEntryCombined = 8040421488",,,,,,,0,,"gp_activetable.c",958,

That is, the memory consumption for 1,000,000 tables on a 1000-segment cluster dropped from 38 gigabytes to 7.5 gigabytes.

It is easier to view the changes with the "Hide whitespace" option enabled.

src/gp_activetable.c

src/quotamodel.c

src/gp_activetable.c

src/gp_activetable.h

src/gp_activetable.c

RekGRpth added 8 commits October 10, 2024 09:58

Refactor diskquota local_table_stats_map

1dd79be

fix

4f444f6

format

ae19d34

optimize

c398337

fix

e70ff22

fix

7e6c0c3

rm

81e9f79

format

2351a5a

RekGRpth marked this pull request as ready for review October 10, 2024 10:21

RekGRpth mentioned this pull request Oct 10, 2024

ADBDEV-6442: Refactor diskquota local_table_stats_map #32

Closed

silent-observer reviewed Oct 10, 2024

View reviewed changes

src/gp_activetable.c Outdated Show resolved Hide resolved

src/quotamodel.c Outdated Show resolved Hide resolved

src/gp_activetable.c Outdated Show resolved Hide resolved

RekGRpth added 3 commits October 10, 2024 18:19

optimize

4effc9c

simplify

e1f2e18

format

e9a3b45

silent-observer previously approved these changes Oct 10, 2024

View reviewed changes

andr-sokolov reviewed Oct 16, 2024

View reviewed changes

src/gp_activetable.h Outdated Show resolved Hide resolved

andr-sokolov reviewed Oct 16, 2024

View reviewed changes

src/gp_activetable.c Outdated Show resolved Hide resolved

size and hash

ea55c53

RekGRpth dismissed silent-observer’s stale review via ea55c53 October 16, 2024 09:27

RekGRpth added 2 commits October 16, 2024 14:31

comment

2f3c9a3

format

321cdff

andr-sokolov reviewed Oct 16, 2024

View reviewed changes

src/gp_activetable.c Show resolved Hide resolved

andr-sokolov reviewed Oct 16, 2024

View reviewed changes

src/gp_activetable.c Outdated Show resolved Hide resolved

RekGRpth added 2 commits October 16, 2024 15:49

optimize

cf5bdf6

optimize

d8d6aef

andr-sokolov approved these changes Oct 16, 2024

View reviewed changes

silent-observer approved these changes Oct 16, 2024

View reviewed changes

RekGRpth merged commit 24546b2 into gpdb Oct 17, 2024
2 checks passed

RekGRpth deleted the ADBDEV-6442 branch October 17, 2024 03:16

RekGRpth mentioned this pull request Dec 20, 2024

ADBDEV-6443: Refactor diskquota local hashmap with active tables #46

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADBDEV-6442: Refactor diskquota local_table_stats_map #34

ADBDEV-6442: Refactor diskquota local_table_stats_map #34

RekGRpth commented Oct 10, 2024 •

edited

Loading

ADBDEV-6442: Refactor diskquota local_table_stats_map #34

ADBDEV-6442: Refactor diskquota local_table_stats_map #34

Conversation

RekGRpth commented Oct 10, 2024 • edited Loading

RekGRpth commented Oct 10, 2024 •

edited

Loading