3

I want to get the content (text only) in a ppt file. How to do it?
(It likes that if I want to get content in a txt file, I just need to open and read. What do I need to do to get information from ppt files?)
By the way, I know there is a win32com in windows system. But now I am working on linux, is there any possible way?

4
  • What do you mean by content? text only or diagrams and multimedia as well? Commented Nov 26, 2012 at 8:48
  • So, have you tried the unix strings on the ppt? Commented Nov 26, 2012 at 8:55
  • @mouviciel I just try it. But it does not look good. Some texts are what I want, but some are not. Commented Nov 26, 2012 at 8:59
  • @thorstenmüller thx. catdoc is a good one. Commented Nov 26, 2012 at 9:48

1 Answer 1

0

I found this discussion over on Superuser:

Command line tool in Linux to Extract Text From Word, Excel, Powerpoint?

There are several reasonable answers listed there, including using LibreOffice to do this (and for .doc, .docx, .pptx, etc, etc.), and the Apache Tika Project (which appears to be the 5,000lb gorilla in this solution space).

Sign up to request clarification or add additional context in comments.

Comments

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.