How to reduce query running time

Ask Question

Asked 2 years, 11 months ago

Modified 2 years, 11 months ago

Viewed 65 times

I have table1 which contains around 1mln line of records and table2 around 3mln line of data.

I have received the below-mentioned code from my previous post (link) which is updating the table as expected, but is taking more than an hour to update just 100 lines of records.

do $$ 
declare 
  rec record;
  current_joins text;
  current_result int;
begin
  for rec in select joins from table1 loop
     select rec.joins into current_joins;
     execute format('select count(*) 
                     from (
                        select 1
                        from table2 
                        group by %1$s 
                        having count(*)>=1 
                     ) as some_alias;',
                     current_joins) 
              into current_result;
     update table1 set result=current_result where joins=current_joins;
  end loop;
end $$;

Appreciate if somebody could provide me any alternative code to handle the above requirements.

The feature of the table can be seen from the following link (https://dbfiddle.uk/pP_9mKy3) and the previous post can be seen from the following link

edited Dec 31, 2022 at 19:36

Ken White

126k15 gold badges237 silver badges476 bronze badges

asked Dec 31, 2022 at 19:27

Gulya

13111 bronze badges

Why is the having count(*)>=1 needed? It seems kinda pointless to me. (And the result appears to be the same without)

Bergi
– Bergi

2023-01-01 01:25:50 +00:00
Commented Jan 1, 2023 at 1:25
1

Instead of selecting count(*) from a group-by subquery, I would just use select count(distinct %1$s) from table2. (Admittedly it was reported to be slow, no idea whether that's been fixed). With this approach, you can also do all the different count()s in a single query (as different columns) - not sure whether that might speed it up?

Bergi
– Bergi

2023-01-01 01:32:22 +00:00
Commented Jan 1, 2023 at 1:32
1

What are the query plans for these queries? Do you have any indices on the relevant columns to speed this up?

Bergi
– Bergi

2023-01-01 01:33:42 +00:00
Commented Jan 1, 2023 at 1:33
@ Bergi tried with the following code select count(distinct %1$s) from table2 but still taking more than hours to complete just a few rows. What if we concat/join all the columns of table2 based on the criteria in table1.joins in a separate table then counting them? but i think it will be a huge data and takes significant time to join them?

Gulya
– Gulya

2023-01-01 15:30:52 +00:00
Commented Jan 1, 2023 at 15:30

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

How to reduce query running time

0

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest

Linked