Postgres FM

Nikolay and Michael discuss a common question — why Postgres isn't using an index, and what you can do about it!
 
Here are some links to things they mentioned:

~~~

What did you like or not like? What should we discuss next time? Let us know via a YouTube comment, on social media, or by commenting on our Google doc!

~~~

Postgres FM is brought to you by:
With special thanks to:

Creators & Guests

Host
Michael Christofides
Founder of pgMustard
Host
Nikolay Samokhvalov
Founder of Postgres AI

What is Postgres FM?

A weekly podcast about all things PostgreSQL

Nikolay: Hello, hello, this is
PostgresFM episode, I don't remember

the number, and today... I'm Nikolay
and this is Michael.

Hi, Michael.

Michael: Hello, Nikolay.

I think 85.

Nikolay: Well, honestly, it's
already at this point, it

doesn't matter.

Michael: When we get to 100, it
will matter.

Nikolay: Yeah, remind me, wake
me up.

Yeah, at 100.

So today I'm in the passenger seat
and Michael is going to talk

about, I will join of course, but
from the passenger seat, about

why the planner behaves not as
expected, not choosing index I

have prepared and how to troubleshoot
that, right?

Michael: Yeah, yeah, this was my
suggestion.

So it's something I see a bunch
and I got reminded again yesterday.

I happened to be on LinkedIn
of all social networks.

It's surprisingly one of the slightly
nicer social networks to

be on these days, which I would
have been shocked by a few years

ago.

But I saw a post from Brent Ozar,
who I remember from my Microsoft

SQL Server days, and has been recently
getting into Postgres

things, which is cool.

I think it would be a great addition
to the community.

And he posted what I thought was
going to be a bug post, and

he cheekily said, why isn't Postgres
using my functional index?

And it turned out it was actually
a Stack Exchange question that

he had posted, and he had included
links to an Aurora database,

which, not Postgres, but Postgres
compatible, which is fun.

And he was genuinely asking why
isn't it using my index?

And I'd noticed it was only posted
like sometimes on LinkedIn,

you see posts from like 3 weeks
ago, but it was only posted about

20, 25 minutes ago or something.

I thought, 'Oh, you know, there's
no answers to this.

I, this is something I generally
know about.

I've written a whole blog post
on.'

So maybe I can help here.

I had a look.

And it just reminded me that this
is a remarkably common issue

for beginners for a bunch of reasons,
but even for some quite

expert database users, sometimes
get confused as to why they've

got an index that Postgres should
be using or they think they

do.

And for some reason, Postgres either
can't use it, and they're

expecting it to be able to, or
doesn't think it will be faster

and is choosing a different plan.

So, yeah, I've seen this a bunch
of times.

I feel like there's probably about
10 different reasons it could

be, which is I think slightly,
like, people don't realize quite

how many reasons there could be
for this.

So I thought it might make an interesting
discussion.

Nikolay: Yeah, well, yeah, I remember
this struggle.

Now I know how to troubleshoot
it, but I remember the struggle

I had many times in the past.

It's not easy sometimes to understand
why a planner behaves in

one way, not as you expected.

And sometimes you just, it upsets
you a lot and you start thinking,

oh, it's stupid.

I saw also people started blaming
Postgres a lot after such events.

They think like, oh, planner is
like, I want control.

Also, this is where you probably
want to start wanting this hinting

which Postgres lacks, right?

Well, it has it, but it's not like
standard.

Michael: Yeah, I think we probably
will get to that.

You said you now know how to solve
it out of interest.

What's the first thing you'll do?

Nikolay: Well, first thing I'll
do, I'll try to understand, okay,

This is the moment, usually I don't
pay attention a lot, but

this is the exact moment when I
start paying attention to cost

metrics in the explain analyze
buffers plan.

So the idea is I try to understand,
is it really a huge number

or like it's relatively small or
tiny?

And then I think, okay, based on
my experience, and here actually

it helps if you have some experience,
right?

Based on my experience, if my index
was used, what would be this

cost, right?

And then, like, basically I already
explained my algorithm.

So, okay, I see some cost.

I wanted to say actual cost, but
the word actual we must reserve

for execution, not for planning,
right?

But this is the cost you see in
the root of this tree of query

execution.

Actually, we don't need to analyze
here, right?

We just say, explain and it's enough.

Michael: For the total cost that
you're talking about, I guess

you're talking about, they're called
startup cost and total cost.

I guess you're talking about the
total cost on the root node.

Nikolay: In the root, yeah, yeah.

We just, like, we have the plan,
we see our index we hoped would

be used, it's not present there,
and we see different index or

maybe sequential scan, and we think,
oh, what's happening here?

And so I see the total cost, probably
also check the successive

method node where data access is
happening, and the cost there

as well.

It's also important.

And then I think, oh, if my index
was used, what would be the

cost?

And since I'm experienced, I'm
sometimes already understanding

what's happening.

But if I have doubts, I just start
playing with these rough knobs.

enable seq scan, enable index scan,
enable blah blah, to put

penalty on specific access methods,
right?

Or so, to specific steps.

Michael: Yeah, scan types.

Nikolay: Yeah, Right, scan types.

And in this case, very often I
start seeing what I expected and

I see, oh, like, this would be
the cost.

And I see the difference.

And sometimes it's very small.

So we were very close to choose
my plan, but it was not chosen

just because...

And then super often, very often,
it's a new system and I ask

people, what random page cost
do you have?

Because sequential scan, our index
is not used.

Oh, and the random page cost is
4.

Recently, some Amazon guys, they
got consultation with me and

this is exactly what happened.

I asked what are the random page
costs?

They said 4.

Remember, Crunchy Bridge, after
listening to us, changed it to

1.1, and they also published good
benchmarks, and it's still

in my to-do to revisit those benchmarks
because I had a tendency

to set random page cost to 1, but
they said 1.1 is better, actually.

According to our benchmarks, and
they had some methodology interesting,

but it's slightly off topic.

So, for those listeners who don't
know what random page cost

is, is how the planner thinks about
random access to data.

And by default, it expects you
using magnetic disks, rotational

disks.

And this is not normal in 2024,
definitely.

And I don't know if Aurora and
RDS still have 4.

That cluster may be created long
ago, but if they still do have

this, This is another question
to them.

We had the question about CPU and
green color in performance

insights last time.

But today we have a new question.

We need to check.

I haven't created clusters for
long.

Yeah.

Michael: Last time I checked, most
of them were.

There were a couple of good exceptions
like ScaleGrid and now

Crunchy Bridge but there aren't
many that have tuned it last

time I checked which was admittedly
quite a while ago.

Nikolay: This is super strange
and also Postgres community.

It's time to revisit this and we
need this for mostly for new

clusters.

You shouldn't think too much about
old clusters, they already

have postgresql.conf in place and
everything there.

So, change it.

Michael: And I would say, why are
we optimizing for the few people

that are still running magnetic
disks instead of the vast majority

who are running on SSDs.

It's time to change, I think, for
the default.

Nikolay: I might be, again, taking
too much time for a passenger,

but let me just, I just feel this
that maybe some people don't

understand what we are talking
about.

I will be very short.

So there is a seqpage cost and
random page cost.

Seqpage cost means sequential,
the cost of sequential data access.

And it's 1.

If you change it, you change scale.

I have cases when people change
it.

But it's 1, it's our baseline.

It's 1.

And random page cost by default
in Postgres is 4.

It expects that random data access
is 4 times more expensive

than sequential page access, which
is not so if you, for example,

have all data fit in memory, cached,
or if you have, if it doesn't

fit, but if you have non-rotational
disks, SSD, NVMe SSD, or

something like that, modern disks.

And most of new databases have
modern disks, of course.

And Aurora, I think they have very
good storage, so obviously

it's not rotational.

I think maybe I'm wrong, actually,
right?

Should not be rotational.

It's very slow.

I mean, throughput is terrible,
right?

Okay.

So, and why does it matter here?

Because if you have 1 and 4 default
settings, 1 sequential and

4 random access, it's 4 times more
expensive.

The planner quite often thinks,
oh, sequential scan is not that

bad.

I would prefer sequential scan
over index access if I need to

fetch a lot of rows, because it
seems to be cheaper for me.

Once you shift it to normal 1.1,
1.1 is normal these days.

This is common understanding.

And Crunchy Bridge have good benchmarks.

Let's attach the link.

So in this case, you tell the planner,
actually, sequential access,

random access, they're almost the
same.

And index scan starts winning.

Yeah.

Michael: Yeah.

Or at least it's only a 10% penalty
instead of a 300% penalty,

which is a huge difference once
you start returning a bunch of

a lot of rows for a single.

Yeah.

Which happens quite often.

There's quite a lot.

Like, anyway, I'm very happy for
you to do a bunch of talking

here.

It's a pretty common topic.

I like that we've gone this deep
so quickly, but I do want to

go, I think it's worth going back
and saying, this is a great

avenue for like, like you mentioned
looking at the costs, but

I think there's an easy, like even
for non-experts, you can,

you can use that to your advantage,
right?

You can, if you have access, I
did, I didn't, when I was looking

into Brent's problem.

It was a read only user, so I couldn't
do enable set scan off,

which is the easiest way if you're
getting a sequential scan,

flipping it to, well, basically
telling the planner that sequential

access is tons more expensive than
it really is, and therefore,

if it has any ability to use the
index, no matter how expensive

it should be, it will then use
it.

If it can't, though, enableSeqScanOff
doesn't actually prevent

the planner using seq scans.

It just penalizes it.

It makes it look more expensive.

So if you cannot use the index,
which is a bunch of these cases

are Postgres can't use the index
for some reason.

Or you've forgotten that you're
in an environment that doesn't

have the index.

Like that is a surprisingly common

Nikolay: and valid index.

Michael: Yeah.

Well, Yeah.

I haven't seen that 1 as often.

Like I haven't seen it come up
as often.

But quite often people are in the
middle of debugging something

and someone's dropped that index
or they're on staging and it

doesn't have it when they went
on production, they do have it.

So it is worth double checking
the index exists, but by using

enable seq scan off, you can quickly
check, is it a costing issue?

Now it's trickier with if you're
in the case you mentioned where

it's using 1 index and not the
1 you're expecting, you can't

use that.

You can disable different scan
types, but you can't tell it not

to use a specific index.

I have seen a trick for that though,
which I wasn't familiar

with before, but Haki Benita shared
on a blog post.

He does a begin transaction, drop
index, the 1 that it was using,

explain the query, and then roll
back, which means you can try

and see if that index didn't exist,
would it pick my index?

Which is quite a nice trick as
well.

So trying to find out, are you
in the case where Postgres can't

use the index or you're in the
case where it's choosing not to?

The enable parameter is a really
nice way of finding that out

quite quickly normally.

So yeah, cool that you start there
as well, even if you're eyeballing

the costs.

The one thing I would add on that
is, I think explain analyze can

be helpful.

So if you've run that already,
or if your query actually runs

and doesn't time out, then the
explain analyze part is helpful

because you get the rows returned,
and I think when the cost estimates

are off, sometimes it's random page cost, but quite

often it's about the number of
rows Postgres expects to be returned

and the number of rows that are
actually returned.

So, in fact, maybe you don't
need to analyze because if

you know the query, you know the
data, you know the query you're

running, if you're expecting only
a few rows to be returned and

you see hundreds of thousands in
the estimate, you've got a pretty

good clue right there that you're
in a case where it's overestimating

the cost of that.

Nikolay: Or underestimating maybe.

But yeah, underestimating wouldn't
lead to the index not being used,

but anyway, it can lead to wrong
plans.

And yeah, that's a good point.

So, this is the reason number 1,
let's say.

First reason is like costs are
slightly different.

But this is already talking
about the most difficult

case, probably.

There are easier cases.

I just excluded them from my mind
because they are easy.

If the cost is off, and indeed, we
check the planned rows and actual

rows, if the mismatch is huge,
either it's outdated stats, probably

somebody is blocking the autovacuum
or something we need to check,

or maybe lack of stats.

Sometimes we don't have stats at
all.

It happens.

For example, if you created a functional
index but you didn't

run analyze at all yet.

Michael: That was the case yesterday.

Nikolay: In this case, explain
analyze buffers also is good.

I agree with you.

But sometimes you have
a query which lasts hours.

So, in this case, we need to
downgrade to just a regular explain.

Michael: Well, this is one of those,
like, I'm obviously a big fan

of explaining those buffers like
you, but this is one of those

cases where I don't think you can
know, like, maybe you can never

know, but I don't think you can
know from a single execution

plan which case you're in, we often
need at least 2.

We need the before and the after
we've changed one of these parameters.

The first one is not going to tell
us that much.

It tells us it's not using the
index, but it doesn't tell us

why it's not using the index.

We could get some clues, like row
estimates being off or like

the cost number, but they're only
they're only clues like we've

only by changing something and
running it again that we can see

the difference of when it
is using the index.

Nikolay: That's why database branching
and experimentation with

iterations matters so much because
if you already calculated the

statistics, but then you start
having questions, what if I did

something different?

You want to reset and go a different
route, different path, right?

In this case, being able to iterate,
like reset in a few seconds,

run it again, check this idea,
check that idea.

That's why branching and fast cloning
matters so much, right?

Exactly.

This is, yeah, because otherwise it's a one-way
ticket.

ticket.

So you already calculated statistics
and that's it.

Michael: Yeah, true.

Actually, you can't go back.

You can solve your problem, but
you can't go back.

Nikolay: And sometimes you, you,
okay, you solve the problem,

but you cannot explain in detail
what happened.

But this doesn't build confidence
in your team if you cannot

explain, right?

And to explain, you need to go
back.

Yeah.

And understand better.

Michael: Yeah.

I like that you're using explain
in a different context now.

Nikolay: Okay.

But it's related, right?

It's related.

Michael: It is.

It really is.

And then you can analyze what you're
going to do next.

Nikolay: Right.

Analyze what is overused.

Michael: Yeah.

Well, that's exactly what happened
yesterday as well.

I didn't well, there's an interesting
thing here.

I jumped to conclusions a little
bit.

I did spot the it was it was probably
an estimation error, but

I thought it might also be a data
type casting issue, which is

the kind of thing that we didn't
cover that in depth, actually.

We covered it in my

Nikolay: Tom's
mismatched query, right?

Let's call it mismatched query,
maybe.

Michael: Tim, yeah.

So, basically, if your query can't
be answered by the index you

have.

So, like, the typical example of
this is a function.

So, for example, if you're running
lower on an expression

on a field, but you have only the
field index.

And you might think logically that
it could use the index.

If, for example, in your...

Date.

Nikolay: Huh?

From timestamp to date, your reduction
from timestamp to date,

for example, it's very common.

You convert timestamp to date and
expect that an index on timestamp

will work.

It won't.

Right?

Because it's for timestamp, not
for your expression.

Michael: So, yeah.

So, in some cases, Postgres handles
some of these data type conversions,

like text to varchar, which I wasn't
sure about yesterday.

But in some cases, it doesn't.

So, it depends on whether it supports
that.

But in general, that's the kind
of thing that can prevent the

use of an index that you're expecting
to be used, or like the

operator not being supported.

There's other cases where the index
type you, so let's say you've

used a B-tree because that's what
most of us are using most of

the time.

One example is on a text field using
ILIKE.

So like again, a case insensitive
search operator is not going

to use.

The B-tree index only supports
certain operators.

And if you're using an operator
that it doesn't support, like

a greater than on a hash index,
for example, or anything other

than equality on a hash index,
it won't use the index, kind of

obviously.

So I think there's a few cases
that are quite simple for why

it can't use the index and then
there's a bunch that are a bit

more complicated along the lines
you were talking about which

is what I've just classified as
Postgres doesn't think it will

be faster.

So yeah, we've said expensive and
cheap a few times, but those

costs, while they're in an arbitrary
unit, the idea is for them

to estimate how fast the query
will be, or how slow it will be.

The higher the cost, the slower
it would be.

That's the idea.

It's not the only measure that
Postgres could have tried to optimize

for.

It could have tried to optimize
for IO.

And that's correlated.

But it's not the same thing.

It optimizes for speed.

Which is, yeah, interesting and
kind of leads us to what could

have gone wrong.

So I actually think we didn't cover
a whole case, though, like

a whole simple case of it doesn't
think it would be faster, and

It's correct.

That's a case that catches people
out a bunch.

Like you're trying to force it
to use an index, but it's doing

the faster route already.

Nikolay: For example, if you read
95% of your table, just a sequential

scan might be much faster on the
fly filtering out those 5% that

are not needed, than walking on
the tree, the B-tree, right? It's

it will be probably...

Even if you balance random page
cost and index scan cost,

it still might be better to use
a sequential scan in reality.

So,

Michael: Yeah, for sure.

And because of how sequential scans
work, because of how efficient

they are, and the fact they don't
need to read data from multiple

places.

And if your data fits really tightly
on like very few pages it

could be a lot faster and not even
not even just at percentages

as high as 95% to be honest. Like for
for small tables, for some with

like not very wide rows, I've seen
it be as low as 30% or so,

that's still faster as a sequential
scan.

So I suspect you've come up with
a contrived example of it being

even lower.

Nikolay: Right, so in this case,
I have various, like simple,

very basic questions.

And I think we should, like, in
many cases, we just should start

from this question when we consider
the performance of a query.

We should think how much underlying
data do we have, and next,

very high level, how many rows
return?

Then it can be like, we can unfold
this and say, okay, if it's

just one row, was it an aggregate?

So we need to analyze a lot of
rows for real, or just we return...

Last week I had a case, people
complained about very slow, like

very bad behavior and they used
explain with buffers, they saw

like some gigabytes of data and
so on.

And they knew, ORM or GraphQL was
involved, I don't remember,

but they knew that they need only
like not a lot of rows.

And then we just saw that, like,
oh, it's bad, it's bad, it's

bad.

And the question, how many rows
are we actually returning?

And they also like, we created
all the indexes.

The indexes are here, but somehow
it's still sequential scans.

But how many rows?

And we thought like, oh, actually
it was like 10,000 rows or

so, maybe 100,000 or 25,000 rows.

I don't know, a lot of rows returned.

Do you really need it or did you just
forget the limit?

So the limit is applied on the client side
maybe, right?

It's terrible.

And then I said immediately, like,
I'm a big fan of using, Again,

maybe off-topic, but I'm a big
fan of using, in larger projects,

not in tiny projects when you care
about everything in your pet.

So queries for me, like you know
this concept, pets versus cattle,

right?

Michael: Yeah.

Nikolay: Yeah, so for virtual machines
or for real machines,

for big fleet of infrastructure.

In the case of workload, if the workload
is complex, I'm a big fan

of dealing with queries like with
cattle as well.

So when I saw this, it was a new
client.

When I saw this, that they had
this problem, returning too many

rows unexpectedly, I immediately
said, stop here.

Like, it's just 1 example of the
problem.

And let's return to a high-level,
top-down analysis using pg_stat_statements

because it has rows,
rows metric, and let's see the

whole picture, how many other queries
behave similarly and in

turn too many rows.

But this is exactly when it happens,
an index is not used.

Because you request for too many
rows here, right?

And selectivity is not good.

It's very weak.

Michael: Yeah, and Postgres is
going to do exactly what you asked

it to do.

If you want all of the rows, it's
not going to say, let me give

you the first 25 and see if that's
enough.

It's going to say, nope, here you
go.

Here's all of them.

Nikolay: ROMAN BATURINIKIS Unless
it's a special case, which

is slightly uncertain, right?

It's something new for us.

Michael: Right, good point.

I haven't actually considered the
new index types.

Nikolay: Yeah.

Anyway.

No more certainty in the SQL world.

Michael: Yeah.

But yeah, there's another case
as well.

A big one is if you're selecting
a high proportion of the table,

chances are it's actually faster
to do a sequential scan.

Possibly a different index might
serve your query better.

I haven't seen that one as often.

But another case I've seen really
often is small tables.

So even if you're only selecting
one row in a table that's like

fewer than 100, which might not
be that common in production,

I've seen quite a few like reference
tables though that are not

very many rows, or more commonly
developer databases on people's

local machines where they just
have a tiny bit of data for like

a new feature.

Even if you have the perfect index,
Postgres won't choose to

use it when you don't have much
data just because it's so...

Because all the data is on one page
and it can just simply look

it up very, very quickly, very,
very easily.

So small tables are the other exception
that I see more often

because of dev boxes.

Nikolay: You know exactly how easy
it can be impatient here trying

to interrupt you, right?

This is my favorite topic.

Probably I will not add anything
here.

And listeners who follow us for
long already know what I would

say here, right?

Because you should always try to
deal with full-size databases.

Michael: Yeah, or at least like
a

Nikolay: bigger,

Michael: large datasets.

If you're going to have to, yeah.

But it trips people up.

Nikolay: Unless you're a big fan
of the hypothetical approach, hyper-pg

and indexed partitioning.

So it also would work, but in a
limited way.

Michael: Would it though?

Because if you don't have any data,
I don't think...

Nikolay: Ah, no, sorry.

It was only discussed, not implemented.

And there was another project,
which like, let's export statistics

from production, import it to a lower
environment, and pretend

we have a lot of data.

Michael: That's a cool idea.

Nikolay: Yes, it's a cool idea.

Maybe already...

Hyper-pg has been developed, so
definitely partitioning was covered,

but about statistics maybe not,
but there was another project

developed in Japan, I guess, and
it's not super popular, but

the idea is cool, I think it's really
good, but I expect some limitations

of this idea obviously if you can
afford testing on full-size

databases this is the best way
yeah you just need to make it

cheap and then here I stop yeah

Michael: but yeah this is a so
this is the more interesting case.

So right, like once you're past
those two of it being, you know,

actually faster, if it's wrong,
if Postgres is not estimating

correctly, That's when we get into
the more interesting parts

of, like, you mentioned stale statistics
or not even having statistics

that are relevant or needed.

So analyze is your friend here
on the tables in question.

We've also got a couple of other
tools.

We can increase the sample that
Postgres will do, if you've got

an oddly distributed-

Nikolay: Default statistics target.

Yeah, some people-

Michael: Not even-

Nikolay: For one column.

Michael: Yeah, exactly.

You can do it globally, but per
column...

Nikolay: But 100, default 100 is
also quite a lot.

It might not be enough if you have
high insert rate and constantly

working on the edge of new data
and the distribution of values

in new data is very different compared
to the archived data.

So there are things there, definitely.

Michael: That's one of the defaults
I'm happiest with, actually.

I think it's a pretty good sweet
spot because increasing it,

if you increase it globally, you
increase the time for analyze

to run globally.

And I think that has knock-on effects
for things like your downtime

for doing certain types of major
upgrades and things.

So I

Nikolay: can see...

Major upgrades

Michael: should be

Nikolay: zero downtime.

And if you've jumped from 100 to
1,000, yes, it will probably

increase analyze time maybe two or
three times, but not 10 times.

It's not linear.

Michael: Sure, but I haven't
seen it cause huge problems

globally.

But yeah, if you've got a column
that's like not a distribution,

well in a skewed distribution,
increasing it for that column

can really help.

And then the final one I've got on
my list was the multi-columns.

Yeah.

Nikolay: Which can be only four columns
inside one table yeah it cannot

be unfortunately for two tables, two
different columns in two tables.

It would be interesting to have
maybe as well. Sometimes one table

fully depends on another, right?

Michael: Yeah, I

Nikolay: don't know.

We

Michael: do have a whole episode
on hints, but I think this is

where the hints discussion really
comes into play.

It's like, what are your options
when we're out of tools within

Postgres to give the planner as
much information as we can about

the distribution and the stats.

Nikolay: Right.

Bugs also happen.

So bugs might happen.

I mean, planner bugs or some not
developed things, not yet developed

things, some easy stuff which is
not yet there, Postgres planner

is not super powerful.

It's quite powerful, but compared
to SQL server, for example,

the code base is much smaller and
the amount of, the number of

engineering hours invested into SQL
Server Planner are much bigger

than into Postgres.

So of course it's evolving, but
still some things might be not

yet developed and sometimes people
see quite simple things.

To me, sometimes you think, oh,
it's so obvious why it's not

here.

And as an example, it's very unrelated,
but it still poses.

Last week, statement_timeout
was committed by Alexander Korotkov.

Many thanks to him.

And statement_timeout was my
idea.

And when I came to this
idea, I was thinking, am I stupid?

Like, am I missing something?

Like whole world still didn't raise
this.

I searched, I don't see discussion.

Why statement_timeout is not
present in PostgreSQL still, like

so many dozens of years of development.

And then a few folks validated
it's a good idea actually, and

then it got support.

So if you spend time with Postgres
enough, with the Query Planner, you

might indeed see cases which are
not yet well-developed.

Or just simply bugs.

Sometimes just bugs happen as well.

Michael: Yeah.

I've spoken to several customers
that have long histories with

DB2, Oracle, and they're surprised
by some things, but they also

talk about some issues with, like,
especially on the Oracle side,

about kind of patchwork solutions.

So sometimes more code in these
optimizers is not necessarily

better because it's like patched
fix on patch fix on patch fix

and it leads to quite like nasty
weird behaviors.

So I do admire the elegance of
the simplicity, but equally, I

think you're right people I know
that that know these things

well, or have used SQL Server or
DB2 in the past, definitely

say nice things about those planners.

Nikolay: Well, my opinion is also
based not on my experience.

My last experience outside Postgres
was very long ago.

I mean, real experience when you
do develop something for long.

It was more than 15 years ago.

So I'm just translating other folks'
opinions, but they seem

reasonable, these opinions.

And they are experienced guys who
actually came to Postgres and

say, you know, like, let's just
admit the planner can be much,

much better.

We need to continue developing
things, improving things.

And in this case, let's advertise
a recent blog post from David

Rowley.

So I've noticed many items, it
was his contribution.

Michael: Yeah, but also many from
other people too.

Nikolay: Yeah, so like the post
is what's new in the planner

in PostgreSQL 16?

Let's unfold it and so on.

It's great.

It's also in my to-do list to inspect
deeper and maybe do some

tests.

So it's a good post.

Michael: And the thing I love most
about it was inspiring people

to upgrade.

**Postgres** 16 came out a few months
ago now.

We've had some minor, at least
1, I think 2 now minor patches.

Nikolay: February, I have no idea.

Michael: I already know at least
1 company that has upgraded

as a direct result of that blog
post coming out, just to see

if some of their queries improved
as a result.

Cool.

Nikolay: Cool.

Michael: Good.

So yeah, thank you to David.

Nikolay: This is important to share,
by the way, because other

people think like, maybe it's still
too early and so on.

And you know, like I recently was
approached and just the question,

do you know anything bad about
**Postgres** 16?

I said no, actually, it's not,
it's not **Postgres** 9.3.

No, 9.3 was terrible.

I remember.

Michael: I think I've got a new
answer for you that I think is

more on brand.

I think you should say yes, but
they're also true in 15, 14,

13, 12, 11 and 10.

Nikolay: Okay.

Yeah.

Michael: Anyway, I know what you
mean.

Did you mean like you don't have
anything bad in 16 that's new

in 16?

Nikolay: Nobody bumped into some
problems like complaining that

it's bad.

There are a few companies upgraded,
I mean, maybe not huge installations,

but still.

It has a good reputation so far.

And of course, the risks are still
there, and tomorrow we might

see some problems, but these risks
already go down, down, and

down over time, right?

And I mean, it's important to share,
so we upgraded, everything

is good, in our case.

It doesn't mean in any case it
will be good, but it builds up

the level of confidence over time,
and more people will be upgrading.

Nice one.

Okay, good.

Did

Michael: you have anything else
you wanted to add?

Nikolay: I always have a lot of
things, but I think we're out

of time.

Let's not consume a lot of time.

Of course, it's good if you run
more or walk your dog more, listening

to us longer, but still, there
are other things to do.

So thank you so much and thank
you for support.

We still like, I didn't follow
really well, but we see comments

on YouTube, for example, it's good.

Please share your experience as
well and other people also see

it and we can highlight some comments
not only about Red Bull,

right?

Or beer.

Michael: Free advert.

I'm teasing.

Yeah, I think over a thousand people
have listened to the last

few episodes, which is pretty cool.

Yeah.

So thank you all.

Nikolay: Good.

Okay.

Bye bye.