how to regex this link?

Question

I want to regex a list of URLs.
The links format looks like this:
`https://en.wikipedia.org/wiki/Alexander_Pushkin'

The part I need:
en.wikipedia.org

Can you help, please?

MatsLindh · Accepted Answer · 2022-03-15 20:23:28Z

1

Instead of looking for \w etc. which would only match the domain, you're effectively looking for anything up to where the URL arguments start (the first ?):

re.search(r'[^?]*', URL)

This means: from the beginning of the string (search), all characters that are not ?. A character class beginning with ^ negates the class, i.e. not matching instead of matching.

This gives you a match object, where [0] will be the URL you're looking for.

answered Mar 15, 2022 at 20:23

MatsLindh

53.5k5 gold badges70 silver badges97 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Antoine Delia · Accepted Answer · 2022-03-15 20:27:15Z

1

You can do that wihtout using regex by leveraging urllib.parse.urlparse

from urllib.parse import urlparse

url = "https://sales-office.ae/axcapital/damaclagoons/?cm_id=14981686043_130222322842_553881409427_kwd-1434230410787_m__g_&gclid=Cj0KCQiAxc6PBhCEARIsAH8Hff2k3IHDPpViVTzUfxx4NRD-fSsfWkCDT-ywLPY2C6OrdTP36x431QsaAt2dEALw_wcB"

parsed_url = urlparse(url)
print(f"{parsed_url.scheme}://{parsed_url.netloc}{parsed_url.path}")

Outputs

https://sales-office.ae/axcapital/damaclagoons/

answered Mar 15, 2022 at 20:27

Antoine Delia

1,8456 gold badges30 silver badges45 bronze badges

Comments

mbg131 · Accepted Answer · 2022-03-15 20:25:34Z

0

Based on your example, this looks like it would work:

\w+://\S+\.\w+\/\S+\/

answered Mar 15, 2022 at 20:25

mbg131

712 bronze badges

Comments

re-za · Accepted Answer · 2022-03-15 20:29:50Z

0

Based on: How to match "anything up until this sequence of characters" in a regular expression?

.+?(?=\?)

so:

re.findall(".+?(?=\?)", URL)

answered Mar 15, 2022 at 20:29

re-za

8004 silver badges10 bronze badges

Collectives™ on Stack Overflow

how to regex this link?

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related