I have two excel files, both contain employee information. File1 is is 195K rows, File2 is less than 100. I need to return the entire row in File1 where the id# from the File2 is present. I've done something like this in PHP but can't sort it out in python/pandas.
I'm looking at the isin() method to work out the selection of rows.
df0 = pd.ExcelFile('File1.xlsx').parse('Sheet1')
df1 = pd.ExcelFile('Fil2.xlsx').parse('Sheet1')
print df0[df1['staffid'].isin(df0['staffid'])]
The result is "IndexingError: Unalignable boolean Series key provided"
Is pandas the right tool for this, or should I look at openpyxl or something else?