Just when I think I am finally getting it, such a newb.
I am trying to get a list of numbers from a column from a table that is an PDF.
First step I wanted to convert to a Panda DF.
pip install tabula-py
pip install PyPDF2
import pandas as pd
import tabula
df = tabula.read_pdf('/content/Manifest.pdf')
The output I get however is a list of 1, not a DF. When I look at DF the info is there, I just have no idea how access it as it is a list of 1.
So not sure why I didnt get a DF and no idea what I meant to do with a list of 1.Output
Not sure if it matters but I am using google Colab.
Any help would be awesome.
Thanks
read_pdf()