Should I use hash or btree for a foreign key index in postgresql 9.3?

Question

Which index will perform better for foreign keys of type integer in postgresql 9.3?

I would assume a hash index, because foreign key comparisson are always made with =

Or does a btree compare as fast as a hash when used for JOINS on foreign keys?

Because in postgresql primary keys use btree's that would suggest they are also better for foreign keys.

Evan Carroll · Accepted Answer · 2018-10-26 23:24:08Z

10

Caution Hash index operations are not presently WAL-logged, so hash indexes might need to be rebuilt with REINDEX after a database crash if there were unwritten changes. Also, changes to hash indexes are not replicated over streaming or file-based replication after the initial base backup, so they give wrong answers to queries that subsequently use them. For these reasons, hash index use is presently discouraged.

There is also no proof that an hash index has any performance benefits over a btree.

edited Oct 26, 2018 at 23:24

Evan Carroll

1

answered Oct 10, 2014 at 11:38

Frank Heikens

129k26 gold badges157 silver badges153 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

sudo Over a year ago

Very strange. You'd think a hash index would be much faster, but numerous benchmarks have shown that it's only slightly faster at best. Hash is much faster than tree-based for typical in-memory data structures (e.g. Java's HashMap vs TreeMap). Any reason it's slow in Postgres, or is it just because Postgres's hash index implementation is not well-maintained?

Kevin Parker Over a year ago

This is no longer relevant in postgresql 10+ postgresql.org/docs/10/sql-createindex.html

OrangeDog Over a year ago

@sudo big trees have much better data locality, while hash tables are randomly distributed. The cache performance makes a big difference especially when reading from disk.

sudo Over a year ago

IIRC even in Postgres 10+ switching my indexes to hash didn't show any improvement. I forget what the situation was. Data locality... I don't understand why that'd help during equality checks unless one side is sorted.

OrangeDog Over a year ago

@sudo if one side is a btree index scan, then yes it will be sorted

OrangeDog · Accepted Answer · 2022-07-15 09:17:10Z

3

It depends.

If you're not doing any queries using the foreign key, then you don't need any index. The referential integrity is enforced using the index of the referenced primary key.

The question of which index type to use (if any) is thus the same for any column(s). If your queries would benefit from an index for a = comparison, and you're using PostgreSQL 10 or newer, then a HASH index is a reasonable choice. If the same column is involved in any ordering operations (ORDER BY, <, >=, etc.) then you may as well use a BTREE.

If you are concerned about the relative performance, then you'll need to test them yourself, using your own data distribution and query load. A tree index may still perform better than a hash, due to data access locality (sequential vs. random).

edited Jul 15, 2022 at 9:17

answered Jun 10, 2020 at 15:46

OrangeDog

39.3k18 gold badges141 silver badges227 bronze badges

Collectives™ on Stack Overflow

Should I use hash or btree for a foreign key index in postgresql 9.3?

2 Answers 2

5 Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

5 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related