1

I am unable to read the HTML files using pandas. Below is the error screenshot:

enter image description here

6
  • whats the html5lib version? Commented Aug 17, 2017 at 3:36
  • the html5lib version is '0.999999999' Commented Aug 17, 2017 at 3:39
  • upgrade your pandas then try again Commented Aug 17, 2017 at 3:45
  • upgraded pandas, but still receving the same error Commented Aug 17, 2017 at 3:51
  • can't reproduce this in python3 with following versions, pandas '0.20.3', bs4 '4.6.0', and html5lib '0.999999999'. it loads it up for me. Commented Aug 17, 2017 at 3:57

1 Answer 1

0

Kindly refer to the link for the same html parsing problem discussed in stackoverflow: html5lib: TypeError: __init__() got an unexpected keyword argument 'encoding'

You have to install via pip

html5lib=="0.9999999"

The problem here persists with the html5lib parse tree so you have to install html5lib separately since we can't upgrade specific modules in Pandas library.

Sign up to request clarification or add additional context in comments.

Comments

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.