Effectively sorting when your data is distributed across different microservices

Question

In a microservices environment, how do we effectively sort data which is part of two different microservices.

For example, lets say we have two microservices

Users Service (Has all information related to users)
Orders Service (Has all information related to orders placed by the user with user id as reference)

Now on Admin UI lets say we have a page where we are displaying orders placed between specific period of time and I need a sorting feature on user display name.

My question is how to achieve this effectively, below are two possible options which can be applied

Fetch all data from both services and then do an in-memory merge (not efficient)
Duplicate the data on the other service like adding User Display Name on Orders Service (then we have redundant data and we need to take care of consistency).

Is there any other way to achieve this, would like to know how a situation like this is solved in real world applications

Note: The scenario provided over here is more like hypothetical just want to understand how to solve when things are like this

it sounds like the same concept of joining tables in SQL. check this about MySQL sorting dba.stackexchange.com/a/117320 — Yuri G.
– Yuri G., Commented May 7, 2024 at 7:33
@YuriG. In this case the databases are distributed, and we don't want to a distributed join as it will have huge performance impact — Baala Srinivas K
– Baala Srinivas K, Commented May 7, 2024 at 8:42

Christophe Quintard · Accepted Answer · 2024-05-07 13:20:40Z

2

The way I solve this problem is by implementing a "order+user" microservice which is a materialized view of the "order" and "user" microservices. That means the view stores all the order with the associated user into its own table. The view listens for changes (dotted lines in the diagram) into the other microservices to keep itself up-to-date (you never directly modify the content of a view).

It implies a duplication of data, and it's not the simplest thing to implement, but it has the following advantages :

You do not make a JOIN between the database of the "order" and "user" microservices, which would lead to a database monolith.
You keep the "order" service clean. Imagine you want to search orders by information from the user, the products, ... That would lead to an incredible messy "order" service.
You will have a super fast search, because the view has denormalized the data !

edited May 7, 2024 at 13:20

answered May 7, 2024 at 8:36

Christophe Quintard

2,8912 gold badges28 silver badges34 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Baala Srinivas K Over a year ago

Thank you for your response, but does this not take us back in path of monolith, because the reason why we have two different services is to scale part of my application (order microservice) now in this case if we introduce a new microservice then the new service also may need to scale for order service requests and some part of user service requests as well

Christophe Quintard Over a year ago

Requests made to the view are not forwarded to the other services, because it is a materialized view, which means the view stores aggregates of an order and the associate user into its own database. So you will be able to scale the "order" service and the "order+user" view service independantly, according to the number of requests to be processed by each service.

Baala Srinivas K Over a year ago

If I understand your approach right, what you mean is we will have a new microservice which will have data to serve combined query requests both order service and user service will update this data in background asynchronously, when we have a need to fetch the combined data we would route our request new service (order+user). Is this understanding right?

Guru Stron · Accepted Answer · 2024-05-08 23:07:25Z

then we have redundant data and we need to take care of consistency

The thing is that data is not actually redundant here. Duplication of data is a very common approach in microservice architecture - specifically for the reason of performance. The main rule of thumb is that only one service should be the owner of data and others store the copy but do not manage it directly (and ideally asynchronously maintain its consistency by using some kind message queue/service bus following the eventual consistent approach).

What (micro)service will maintain the replica is up for you to decide. It can be even a separate service like AdminOrderSearchService or OrderSearchService which will use something like Elastic to actually perform search. If you don't anticipate major changes in the order search logic in this particular case I would consider to store the replica of the needed user data in the order service (it maybe quite useful in other use-cases too, so you will not need extra queries between the services).

Highly recommend to check out:

microservices.io and especially the Command-side replica pattern.
Avoiding Microservice Megadisasters

Collectives™ on Stack Overflow

Effectively sorting when your data is distributed across different microservices

2 Answers 2

3 Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

3 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related