Extract a String After a Substring in a Pandas dataframe

Question

'(ep1270399)\nname=stet, johannes cornelis p/a ballast nedam infra b.v., p.o. box 1526 , city=3430 bm  nieuwegein , country=nl \n\nname=bos, wilhelmus johannes p/a ballast nedam infra b.v., p.o. box 1526 , city=3430 bm  nieuwegein , country=nl \n'

I have a pandas dataframe and I would like to extract the name which is always after a certain keyword \nname=. Hence, I would like to get 'stet' and 'bos' and put it in an array.

can you post a sample dataframe

Pyd
– Pyd

2019-12-30 06:10:17 +00:00
Commented Dec 30, 2019 at 6:10 — Pyd
– Pyd, Commented Dec 30, 2019 at 6:10

PacketLoss · Accepted Answer · 2019-12-30 04:45:27Z

1

Assuming that the string you provided is a string (Based on the quotations);

import re

string = '(ep1270399)\nname=stet, johannes cornelis p/a ballast nedam infra b.v., p.o. box 1526 , city=3430 bm nieuwegein , country=nl \n\nname=bos, wilhelmus johannes p/a ballast nedam infra b.v., p.o. box 1526 , city=3430 bm nieuwegein , country=nl \n'

split = re.split(' |=|,|\n', string)
result = [split[idx + 1] for idx, value in enumerate(split) if value == 'name']

result

['stet', 'bos']

This allows you to extract all values after \nname=. However if this data is stored differently, you will need to display so in your question so I can better tailor an answer for you!

You should be able to transfer the regex across to any format however.

answered Dec 30, 2019 at 4:45

PacketLoss

5,7661 gold badge12 silver badges29 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Extract a String After a Substring in a Pandas dataframe

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related