TypeError: string or bytes-like object expected

Question

I am trying to tokenize tweet but I get the error: TypeError: expected string or bytes-like object

I am cleaning tweets for use in ml, so am carryout tokenization.

# remove twitter handles (@user)
def remove_pattern(input_txt, pattern):
    r = re.findall(pattern, input_txt)
    for i in r:
        input_txt = re.sub(i, '', input_txt)

    return input_txt  

# remove twitter handles and create new column with clean tweet
data_df['cleaned_tweet'] = np.vectorize(remove_pattern)(data_df['text'], "@[\w]*")

Also include more code, to see what types are in Input of your code — thetradingdogdj
– thetradingdogdj, Commented May 6, 2019 at 14:46

BlueSun · Accepted Answer · 2019-06-04 23:09:12Z

4

This is because the twitter text is not a string, it is an object, you have to convert object into string, write: input_txt =str(input_txt).

edited Jun 4, 2019 at 23:09

BlueSun

3,5701 gold badge21 silver badges37 bronze badges

answered Jun 4, 2019 at 19:55

Priya Sinha

414 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

TypeError: string or bytes-like object expected

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related