1

I would like to play with Stack Overflow's data dump in Oracle. The format that they gave me is in XML and it is very very huge (one XML file is about 3GB). I would like to do an import of this data to my Oracle DB. I know one other guy in this topic managed to work on it using the XML directly. Any ideas or suggestions to make this happen easily?

5
  • How do you want it in your Oracle database? A table with an XMLTYPE column? In relational format? Commented Mar 23, 2011 at 9:58
  • I want it in a relational format Commented Mar 23, 2011 at 13:56
  • 1
    And what's wrong with the method shown by Vincent Malgrat in that other thread? Commented Mar 23, 2011 at 13:59
  • I don't know how to put the xml into the directory... Commented Mar 23, 2011 at 15:58
  • You just place the xml file somewhere on your server, you remember the OS directory you've put it in and you define an Oracle directory object on that OS directory. Commented Mar 24, 2011 at 8:07

1 Answer 1

1

Check out the groovy SQL and XML libraries--you should be able to get up and running pretty quick even with minimal Java/Groovy experience.

http://docs.codehaus.org/display/GROOVY/Tutorial+6+-+Groovy+SQL

Groovy XML

You'll need to install groovy and get the ojdbc14.jar drivers from Oracle. Put your code in a file and run:

groovy -cp ojdbc14.jar myscript.groovy
Sign up to request clarification or add additional context in comments.

Comments

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.