database instance names contain redundant data #16

wydengyre · 2024-09-13T23:40:24Z

Database instance names all look like the following:

3aa05350d41d8cd98f00b204e9800998ecf8427e
f1c841c1d41d8cd98f00b204e9800998ecf8427e
e5dde3f7d41d8cd98f00b204e9800998ecf8427e

Note that only the first eight characters change. The remaining are all identical: d41d8cd98f00b204e9800998ecf8427e

This is because the bytes are generated with this function:

pgtestdb/internal/withdb/withdb.go

Line 63 in 063644d

func randomID(prefix string) (string, error) {

The function doesn't do what it's meant to. hash.Sum(bytes) is not generating a hash of the 4 random bytes, but rather generating a constant hash of empty data and appending it to the four random bytes.

Besides this bug in the code, though, the general idea of taking a hash of the 4 random bytes is odd. It adds no entropy and only serves to make the id longer and unwieldy.

I'd suggest replacing this with a call to stdlib uuid.

The text was updated successfully, but these errors were encountered:

peterldowns · 2024-09-25T18:04:31Z

The function doesn't do what it's meant to. hash.Sum(bytes) is not generating a hash of the 4 random bytes, but rather generating a constant hash of empty data and appending it to the four random bytes.

Good catch! You're right.

Besides this bug in the code, though, the general idea of taking a hash of the 4 random bytes is odd. It adds no entropy and only serves to make the id longer and unwieldy.

Wish I could remember what I was thinking — I agree with you.

I'd suggest replacing this with a call to stdlib uuid.

I'm going to fix it by just hexifying the bytes and removing the hashing part, makes for a nice short string with the same amount of entropy. PR coming shortly.

As reported in #16, the previous implementation of `randomID` used a hashing construction (a) inappropriately, (b) incorrectly. As a result, the IDs that were being generated had a random prefix and then a consistent suffix, like this: ``` 3aa05350d41d8cd98f00b204e9800998ecf8427e f1c841c1d41d8cd98f00b204e9800998ecf8427e e5dde3f7d41d8cd98f00b204e9800998ecf8427e ``` This PR fixes the function to essentially just return the randomly generated prefixes, and omit the suffix entirely. The fixed-suffix was always a mistake and never intended, I just wasn't paying close attention beyond "are the instance database names colliding". I tested this change by running some tests that used `pgtestdb` and intentionally failed, then checking the connection strings that were printed as part of the failing test's logs. Here's an example: ``` testdbconf: postgres://pgtdbuser:pgtdbpass@localhost:5433/testdb_tpl_c2b33a22a68750f84b5af76e9774be0e_inst_a4a3ce6c?sslmode=disabl ``` The key part is `inst_a4a3ce6c`, which is now (a) shorter than before, (b) just as random.

peterldowns mentioned this issue Sep 25, 2024

fix: randomID returns shorter IDs with same amount of entropy #17

Merged

peterldowns linked a pull request Sep 25, 2024 that will close this issue

fix: randomID returns shorter IDs with same amount of entropy #17

Merged

peterldowns closed this as completed in #17 Sep 25, 2024

peterldowns added the bug something isn't working label Sep 25, 2024

peterldowns self-assigned this Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

database instance names contain redundant data #16

database instance names contain redundant data #16

wydengyre commented Sep 13, 2024 •

edited

Loading

peterldowns commented Sep 25, 2024

database instance names contain redundant data #16

database instance names contain redundant data #16

Comments

wydengyre commented Sep 13, 2024 • edited Loading

peterldowns commented Sep 25, 2024

wydengyre commented Sep 13, 2024 •

edited

Loading