Executing interdependent Database ops on Multiple Threads

Question

My server application that is written in C# starts a new thread every time it needs to insert or remove data from the database. The problem is that since the execution of the threads is arbitrary it is not ensured that a delete command is executed after the insertion of the same object if those events occur almost at the same time.

E.g.: The server receives the command to insert multiple objects. It takes about 5 seconds to insert all the object. After 1 second of execution the server receives the command to delete all those objects again from the database. Since the removal could happen before all objects are completely stored the outcome is unknown.

How can the order of execution of certain thread be managed?

Look for System.Threading.Mutex or System.Threading.ManualResetEvent — bash.d
– bash.d, Commented Mar 13, 2013 at 11:56
Threading.Mutex is a good idea for serializing threads in a particular call. You might also look at message queuing. — Anthony Queen
– Anthony Queen, Commented Mar 13, 2013 at 11:59
I don't think using a mutex would work since there could be a insert, delete and another insert operation of the same data. After the first insert would finish, the mutex would not ensure that the delete command is executed before the next insert, or am I wrong? Message queuing is an option but I would have hoped that there is a better solution to this to avoid performance hits. — lt.ungustl
– lt.ungustl, Commented Mar 13, 2013 at 12:49

oleksii · Accepted Answer · 2013-03-13 12:11:23Z

2

You can use transactions for this and specify different levels for different operations.

For example, you can use the highest level of transactions for writes/updates/deletes but a low level for reads. You can also fine-tune this to allow blocking of only specific rows, compared to tables. Specific terminology depends on the database and data access library you use.

I would advice against using any ordering. Parallel and ordered just don't go well together. For example:

You need to horizontally scale servers, once you add a second server and a load balancer a mutex solution will not work
In a large and distributed systems a message queue won't work either as by the time one thread completed a scan and decided that we good to go, another thread can write a message that should have prevented operation execution. Moreover, given you receive high load, scanning the same queue multiple times is inefficient.

edited Mar 13, 2013 at 12:11

answered Mar 13, 2013 at 12:01

oleksii

36.1k16 gold badges98 silver badges173 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

omer schleifer Over a year ago

what do you mean an operation that should prevent operation execution? an insert followed by a delete? in transactions this might also be the case if they are not in the same transaction. and I also don't think there is any ineffiency here if the queue is not scanned until emptied. let's say it has 500 items to begin with, only 500 items are dequeud, even if more items arrived in the meantime, they will have to wait for the next iteration.

oleksii Over a year ago

In transactions the updated rows will be locked on the database side. Situation 1: insert comes first with predefined ids = {1,2,3} and while it's being executed a delete comes for ids = {4,5,6}. In this case 2 transactions may operate concurrently without locking and both succeed. Situation 2: insert comes first with predefined ids = {1,2,3} and while it's being executed a delete comes for ids = {2,3}. In this case 2 transactions will not operate concurrently. First insert will lock three rows and the second transaction will either wait for the insert to be finished or fail with error

Ilya Chernomordik · Accepted Answer · 2013-03-13 11:59:39Z

0

If you know that you receive insert before delete and the problem is just that you don't want to interrupt your insertion then you can just use lock on your insertion code.

static object m_Lock = new object();

public void Insert()    
{
   lock (m_Lock)
   {
      InsertRecords();
   }
}

public void Remove()    
{
   lock (m_Lock)
   {
      RemoveRecords();
   }
}

This way you are sure that remove won't happen during insert.

P.S. Seems strange though that you need to insert and then delete right away.

answered Mar 13, 2013 at 11:59

Ilya Chernomordik

30.9k31 gold badges148 silver badges246 bronze badges

4 Comments

omer schleifer Over a year ago

I think that locking for every insert might cause a performance hit, for a lrage nubmer of requests.

Ilya Chernomordik Over a year ago

@omerschleifer: This really depends on the load. If a server gets many requests per second this can be a problem of course as all requests wil be executed one by one.

lt.ungustl Over a year ago

This would require a unique method for each objct that can be inserted/deleted. Otherwise locking insertion/deletion for any object would decrease performance since there could be multiple insertions/deletions for different objects. Since this is a distributed application where multiple clients can work on the same data the pictured scenario can happen anytime

Steve Townsend Over a year ago

Never lock around I/O (including DB operations), since you have no control over its runtime

omer schleifer · Accepted Answer · 2013-03-13 12:07:31Z

0

I think the simplest way is to queue all incoming requests to insert objects in one collection, and to queue all incoming requests to delete objects in a second collection.

The server should have a basic loop that does :

a. check if there are incoming inserts , if so -> perform all inserts.

b. check if there are incoming delete requests, if so -> perform all delete requests.

c. sleep for X milli-seconds.

Now, if you have a delete request on an object that does not exist. you have two options:

a. igore this request and discard it.

b. ignore this request for this round and keep it in the collection for the next N rounds, before deleting it (Finally deleting it- assuming this is simply a bad request and is not a problem of race condition.)

edited Mar 13, 2013 at 12:07

answered Mar 13, 2013 at 12:02

omer schleifer

3,9455 gold badges34 silver badges43 bronze badges

3 Comments

Steve Townsend Over a year ago

This does not work unless strict restrictions are placed on the incoming operations. There's nothing to stop insertion of semantically the same record twice in your insert set, and an intervening delete of that record being delayed.

omer schleifer Over a year ago

As i understood the question it was about synchronizing requests that might come out of odrer, and since they might come from different clients, I offerd a solution which is scalable to the clients. I did not refer to data integrity, in that you are right. it was not my understanding that duplicate inserts are the issue here

Steve Townsend Over a year ago

I agree that the question as offered does not allow for a precise answer.

Steve Townsend · Accepted Answer · 2013-03-13 13:33:47Z

0

Use a Queue (with a single servicing thread) to enforce the ordering. You can also use Task Parallel Library to manage tasks with dependencies on other tasks, though that's very difficult with arbitrary DB operations.

I think you need to rethink how you manage the incoming operations, and whether or not their inter-dependencies are predictable enough that you can safely use multiple threads in this way. You may need to add some "depends on" information into incoming operations to achieve that goal.

edited Mar 13, 2013 at 13:33

answered Mar 13, 2013 at 13:18

Steve Townsend

54.4k9 gold badges100 silver badges145 bronze badges

Collectives™ on Stack Overflow

Executing interdependent Database ops on Multiple Threads

4 Answers 4

2 Comments

4 Comments

3 Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

2 Comments

4 Comments

3 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related