What is the 'best' way to do distributed transactions across multiple databases using Spring and Hibernate [closed]

Question

Closed. This question is opinion-based. It is not currently accepting answers.

Want to improve this question? Because this question may lead to opinionated discussion, debate, and answers, it has been closed. You may edit the question if you feel you can improve it so that it requires answers that include facts and citations or a detailed explanation of the proposed solution. If edited, the question will be reviewed and might be reopened.

Closed last year.

The community reviewed whether to reopen this question last year and left it closed:

Original close reason(s) were not resolved

Improve this question

I have an application - more like a utility - that sits in a corner and updates two different databases periodically.

It is a little standalone app that has been built with a Spring Application Context. The context has two Hibernate Session Factories configured in it, in turn using Commons DBCP data sources configured in Spring.

Currently there is no transaction management, but I would like to add some. The update to one database depends on a successful update to the other.

The app does not sit in a Java EE container - it is bootstrapped by a static launcher class called from a shell script. The launcher class instantiates the Application Context and then invokes a method on one of its beans.

What is the 'best' way to put transactionality around the database updates?

I will leave the definition of 'best' to you, but I think it should be some function of 'easy to set up', 'easy to configure', 'inexpensive', and 'easy to package and redistribute'. Naturally FOSS would be good.

Aaron Digulla · Accepted Answer · 2024-08-05 19:53:42Z

47

The best way to distribute transactions over more than one database is: Don't.

Some people will point you to XA but XA (or Two Phase Commit) is a lie (or marketese).

Imagine: After the first phase have told the XA manager that it can send the final commit, the network connection to one of the databases fails. Now what? Timeout? That would leave the other database corrupt. Rollback? Two problems: You can't roll back a commit and how do you know what happened to the second database? Maybe the network connection failed after it successfully committed the data and only the "success" message was lost?

The best way is to copy the data to an "import" table. Use a scheme which allows you to abort the copy and continue it at any time (for example, ignore data which you already have or order the select by ID and request only records > MAX(ID) of your copy). Protect this with a transaction. This is not a problem since you're only reading data from the source, so when the transaction fails for any reason, you can abort, rollback and try again later. Therefore, this is a plain old single source transaction.

After you have copied the data, process it locally by reading it from the import table. Again, you will need some mechanism to determine which data you have already seen. But this time, you're working only on data that is inside a single database.

edited Aug 5, 2024 at 19:53

answered May 20, 2009 at 11:38

Aaron Digulla

330k111 gold badges626 silver badges840 bronze badges

Sign up to request clarification or add additional context in comments.

14 Comments

Falcon Over a year ago

Distributed Transactions must statisfy all 4 ACID properties. What's your problem? The scenario you described cannot happen, as the managers are communicating with each other and do only commit when all participating nodes have exchanged a "GO".

Aaron Digulla Over a year ago

@Falcon: So what happens if the network fails between PREPARE and COMMIT? Or one of the server dies? "cannot happen" can't happen in reality.

K.Nicholas Over a year ago

No, they are not instructed to roll back because in this scenario, some of the nodes have already committed. What happens is when the crashed node becomes available, the transaction coordinator tells it to commit again. Because the node responded positively in the "prepare" phase, it is required to be able to "commit", even when it comes back from a crash.

K.Nicholas Over a year ago

I find support for the standard too widespread to be swayed by the anecdotal evidence of one person. Thank you for your comments.

Aaron Digulla Over a year ago

@Nicholas I find support widespread only by companies who make money by fixing the problems this standard creates. The "consumers" (= people who have to suffer such solutions) usually try this once and then look for better solutions. That said, my answer is logically sound. My approach is much more simple than XA and I can prove that it will always work. XA is more of a promise, not a fact.

|

nhahtdh · Accepted Answer · 2014-11-20 07:47:04Z

8

Setup a transaction manager in your context. Spring docs have examples, and it is very simple. Then when you want to execute a transaction:

try { 
    TransactionTemplate tt = new TransactionTemplate(txManager);

    tt.execute(new TransactionCallbackWithoutResult(){
    protected void doInTransactionWithoutResult(
            TransactionStatus status) {
        updateDb1();
        updateDb2();
    }
} catch (TransactionException ex) {
    // handle 
}

For more examples, and information perhaps look at this: XA transactions using Spring

edited Nov 20, 2014 at 7:47

nhahtdh

56.9k15 gold badges131 silver badges164 bronze badges

answered Sep 24, 2008 at 17:20

Curious Developer

1 Comment

Chriki Over a year ago

This example doesn’t really answer the question or it even answers it wrongly: the OP mentioned that he had two Hibernate session factories configured which would require two separate transaction managers. The example in the answer only uses one transaction manager which is not specified any closer. Using a single Hibernate transaction manager consequently would never rollback one of the two DBs on errors. Using for example a ChainedTransactionManager (as noted by @Pani Dhakshnamurthy) might help but that is not mentioned in this answer.

skaffman · Accepted Answer · 2008-09-28 12:11:20Z

6

When you say "two different databases", do you mean different database servers, or two different schemas within the same DB server?

If the former, then if you want full transactionality, then you need the XA transaction API, which provides full two-phase commit. But more importantly, you also need a transaction coordinator/monitor which manages transaction propagation between the different database systems. This is part of JavaEE spec, and a pretty rarefied part of it at that. The TX coordinator itself is a complex piece of software. Your application software (via Spring, if you so wish) talks to the coordinator.

If, however, you just mean two databases within the same DB server, then vanilla JDBC transactions should work just fine, just perform your operations against both databases within a single transaction.

answered Sep 28, 2008 at 12:11

skaffman

405k96 gold badges825 silver badges775 bronze badges

1 Comment

Phil Over a year ago

Great Answer. That's the decisive difference!

maximdim · Accepted Answer · 2008-12-28 17:15:07Z

3

In this case you would need a Transaction Monitor (server supporting XA protocol) and make sure your databases supports XA also. Most (all?) J2EE servers comes with Transaction Monitor built in. If your code is running not in J2EE server then there are bunch of standalone alternatives - Atomicos, Bitronix, etc.

answered Dec 28, 2008 at 17:15

maximdim

8,2073 gold badges36 silver badges48 bronze badges

Comments

Pani Dhakshnamurthy · Accepted Answer · 2014-06-12 00:08:28Z

3

You could try Spring ChainedTransactionManager - http://docs.spring.io/spring-data/commons/docs/1.6.2.RELEASE/api/org/springframework/data/transaction/ChainedTransactionManager.html that supports distributed db transaction. This could be a better alternative to XA

answered Jun 12, 2014 at 0:08

Pani Dhakshnamurthy

2333 silver badges8 bronze badges

Comments

kylebebak · Accepted Answer · 2023-07-13 23:45:07Z

For those suggesting concerns with two-phase commit can be waved away because it's widely used in practice, I suggest looking at this: https://en.wikipedia.org/wiki/Two-phase_commit_protocol. There's a link at the bottom of the 2PC article to an article on three-phase commit(!)

Some excerpts from the article on 3PC:

In computer networking and databases, the three-phase commit protocol (3PC)[1] is a distributed algorithm which lets all nodes in a distributed system agree to commit a transaction. It is a more failure-resilient refinement of the two-phase commit protocol (2PC).

Three-phase commit assumes a network with bounded delay and nodes with bounded response times; In most practical systems with unbounded network delay and process pauses, it cannot guarantee atomicity.

To summarize:

3PC is more failure-resistant than 2PC
Not even 3PC guarantees atomicity
Draw your own conclusions about 2PC

Audrey · Accepted Answer · 2024-08-20 19:42:03Z

Assuming that you have a web server with two databases at different locations. The web server starts a translation.

In these scenarios, database clustering gives us solutions but generally, we have two types of databases in clustering.

A mast node

And some slave nodes.

The only node that has permission to write is the master node which is a single source of truth and the slave nodes will only read data from the master node. So the solution will answer this problem.

In case any bad things happen to the master node, the fourth principle of ACID means durability. The database has been lost and we can agree that nothing will be recorded on it. After a while one of the slave nodes will get promoted and stand as a new master node and do the job. But we should know that the transaction in the dead database has been lost and the last state of the database is before that transaction gets started. Also, there are some other solutions to record all events on the database somewhere for these kinds of scenarios which might be helpful.

Collectives™ on Stack Overflow

What is the 'best' way to do distributed transactions across multiple databases using Spring and Hibernate [closed]

7 Answers 7

14 Comments

1 Comment

1 Comment

Comments

Comments

Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

7 Answers 7

14 Comments

1 Comment

1 Comment

Comments

Comments

Comments

Comments

Linked

Related