I want to read certain column from excel file into dataframe however I want to specify the column with its column header name.
for an example, I have an excel file with two columns in Sheet 2: "number" in column A and "ForeignKey" in column B). I want to import the "ForeignKey" into a dataframe. I did this with the following script:
xl_file = pd.read_excel('D:/SnapPython/TestDF.xlsx', sheet_name='Sheet 2', usecols=[0,1])
It shows the following in my xl_file:
number ForeignKey
0 1 abc
1 2 def
2 3 ghi
in case a small number of column, I can get the "ForeignKey" by specifying usecols=[1]. However if I have many column and know the column name pattern, it will be easier by specifying the column name. I tried the following code but it gives empty dataframe.
xl_file = pd.read_excel('D:/SnapPython/TestDF.xlsx', sheet_name='Sheet 2', usecols=['ForeignKey'])
According to discussion in the following link, the code above works well but for read_csv.
[How to drop a specific column of csv file while reading it using pandas?
Is there a way to do this for reading excel file?
thank you in advance