Extract Data from PDF and populate in Excel using Python

Question

I need bit of advice ............. I'm working on a program in Python, the program would read data from a PDF and I'm supposed to populate the same information in a excel sheet Right now I'm using PyPDF 2 to extract the data and I would be using Panda to store the data in a data frame and then that data frame would be populated in to excel sheet Is my path of action efficient and if there's a better way or a flaw in my plan please let me know about it.

Welcome to Stack Overflow! Please edit your question to show the code you have so far. You should include at least an outline (but preferably a Minimal, Complete, and Verifiable example) of the code that you are having problems with, then we can try to help. You should also read How to Ask. — import random
– import random, Commented Mar 7, 2018 at 23:05

ASH · Accepted Answer · 2018-03-10 21:31:21Z

1

I think it should be something like this.

import PyPDF2
import openpyxl

pdfFileObj = open('C:/Users/Excel/Desktop/TABLES.pdf', 'rb')
pdfReader = PyPDF2.PdfFileReader(pdfFileObj)
pdfReader.numPages

pageObj = pdfReader.getPage(0)
mytext = pageObj.extractText()


wb = openpyxl.load_workbook('C:/Users/Excel/Desktop/excel.xlsx')
sheet = wb.active
sheet.title = 'MyPDF'
sheet['A1'] = mytext

wb.save('C:/Users/Excel/Desktop/excel.xlsx')
print('DONE!!')

See the link below for more details.

http://automatetheboringstuff.com/chapter12/

edited Mar 10, 2018 at 21:31

answered Mar 10, 2018 at 13:43

ASH

20.5k28 gold badges117 silver badges247 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Korzak Over a year ago

Does not work. Just puts the whole dataset into the first cell of the Excel file instead of rendering it as a table(s)

Collectives™ on Stack Overflow

Extract Data from PDF and populate in Excel using Python

1 Answer 1

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related