How to make my postgresql database use a case insensitive collation?

Question

In several SO posts OP asked for an efficient way to search text columns in a case insensitive manner.

As much as I could understand the most efficient way is to have a database with a case insensitive collation. In my case I am creating the database from scratch, so I have the perfect control on the DB collation. The only problem is that I have no idea how to define it and could not find any example of it.

Please, show me how to create a database with case insensitive collation.

I am using postgresql 9.2.4.

EDIT 1

The CITEXT extension is a good solution. However, it has some limitations, as explained in the documentation. I will certainly use it, if no better way exists.

I would like to emphasize, that I wish ALL the string operations to be case insensitive. Using CITEXT for every TEXT field is one way. However, using a case insensitive collation would be the best, if at all possible.

Now https://stackoverflow.com/users/562459/mike-sherrill-catcall says that PostgreSQL uses whatever collations the underlying system exposes. I do not mind making the OS expose a case insensitive collation. The only problem I have no idea how to do it.

PostgreSQL uses whatever collations the underlying operating system exposes. The system table "pg_collation" is populated by initdb. Use select * from pg_collation; to see which collations it found. — Mike Sherrill 'Cat Recall'
– Mike Sherrill 'Cat Recall', Commented Sep 15, 2013 at 2:57
@mark: That's why I posted it as a comment, not as an answer. If you run that query, and you find no case-insensitive collations, that's probably your answer. — Mike Sherrill 'Cat Recall'
– Mike Sherrill 'Cat Recall', Commented Sep 15, 2013 at 14:03
Possible duplicate of stackoverflow.com/q/17422054/157957 and stackoverflow.com/q/1929590/157957 — IMSoP
– IMSoP, Commented Sep 15, 2013 at 22:14

user8870331 · Accepted Answer · 2019-11-29 08:54:24Z

44

A lot has changed since this question. Native support for case-insensitive collation has been added in PostgreSQL v12. This basically deprecates the citext extension, as mentioned in the other answers.

In PostgreSQL v12, one can do:

    CREATE COLLATION case_insensitive (
      provider = icu,
      locale = 'und-u-ks-level2',
      deterministic = false
    );

    CREATE TABLE names(
      first_name text,
      last_name text
    );

    insert into names values
      ('Anton','Egger'),
      ('Berta','egger'),
      ('Conrad','Egger');

    select * from names
      order by
        last_name collate case_insensitive,
        first_name collate case_insensitive;

See https://www.postgresql.org/docs/current/collation.html for more information.

answered Nov 29, 2019 at 8:54

user8870331

Sign up to request clarification or add additional context in comments.

10 Comments

user330315 Over a year ago

Note that this depends on the operating system and the ICU version that comes with it.

Stefan Anghel Over a year ago

Note that as of PostgreSQL v12, non deterministic collations DO NOT support LIKE and LIKE

user275801 Over a year ago

What does "und-u-ks-level2" mean?

AndrewL Over a year ago

@user275801 See the Unicode Collation Settings table for more info on this syntax.

0xced Over a year ago

Overview of ICU collation settings by Peter Eisentraut perfectly explains what all the parts of und-u-ks-level2 mean.

|

RobDil · Accepted Answer · 2014-08-13 09:30:50Z

10

For my purpose the ILIKE keyword did the job.

From the postgres docs:

The key word ILIKE can be used instead of LIKE to make the match case-insensitive according to the active locale. This is not in the SQL standard but is a PostgreSQL extension.

edited Aug 13, 2014 at 9:30

answered Feb 17, 2014 at 13:05

RobDil

4,57448 silver badges58 bronze badges

2 Comments

Craig Ringer Over a year ago

Unless you escape the pattern, this will produce wrong results for strings containing _ or % if you attempt to use it like =.

GeorgiG Over a year ago

Problem is, frameworks like TypeORM has no support for ILIKE in their find conditions.

Craig Ringer · Accepted Answer · 2014-08-22 01:43:25Z

10

There are no case insensitive collations, but there is the citext extension:

http://www.postgresql.org/docs/current/static/citext.html

edited Aug 22, 2014 at 1:43

Craig Ringer

329k84 gold badges742 silver badges820 bronze badges

answered Sep 15, 2013 at 21:48

Denis de Bernardy

79.1k14 gold badges138 silver badges158 bronze badges

1 Comment

user8870331 Over a year ago

See my other answer (stackoverflow.com/a/59101567/8870331) for PostgreSQL v12 and beyond.

Štefan Bartoš · Accepted Answer · 2016-08-31 04:56:54Z

2

This is not changing collation, but maybe somebody help this type of query, where I was use function lower:

SELECT id, full_name, email FROM nurses WHERE(lower(full_name) LIKE '%bar%' OR lower(email) LIKE '%bar%')

answered Aug 31, 2016 at 4:56

Štefan Bartoš

5716 silver badges8 bronze badges

Comments

glyphobet · Accepted Answer · 2013-11-30 19:31:47Z

-4

I believe you need to specify your collation as a command line option to initdb when you create the database cluster. Something like

initdb --lc-collate=en_US.UTF-8

It also seems that using PostgreSQL 9.3 on Ubuntu and Mac OS X, initdb automatically creates the database cluster using a case-insensitive collation that is default in the current OS locale, in my case, en_US.UTF-8.

Could you be using an older version of PostgreSQL that does not default to the host locale? Or could it be that you are on an operating system that does not provide any case-insensitive collations for PostgreSQL to choose from?

answered Nov 30, 2013 at 19:31

glyphobet

1,56411 silver badges17 bronze badges

4 Comments

mark Over a year ago

I am using now PostgreSQL 9.3 on Windows 7 and 8. I have no idea whether they provide a case insensitive collation for PostgreSQL. I know that SQL Server can be configured with such a collation.

glyphobet Over a year ago

I can't help with Windows... but it sounds like that's the place to start. Find out what case-insensitive collations Windows provides, and see if you can tell PostgreSQL to use one of them when it creates the cluster.

Berend de Boer Over a year ago

From my experiments with PostgreSQL 9.6, the --lc-collate=en_US.UTF-8 option does not produce a case-insensitive collation.

Laurenz Albe Over a year ago

This answer is wrong. The collation is not case insensitive.

Collectives™ on Stack Overflow

How to make my postgresql database use a case insensitive collation?

5 Answers 5

10 Comments

2 Comments

1 Comment

Comments

4 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

10 Comments

2 Comments

1 Comment

Comments

4 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related