Skip to main content
Filter by
Sorted by
Tagged with
0 votes
1 answer
35 views

I am scraping the Dead by Daylight Fandom wiki (specifically TOME pages, e.g., https://deadbydaylight.fandom.com/wiki/Tome_1_-_Awakening) to extract memory logs. The goal is to extract the Memory ...
zeromiedo's user avatar
0 votes
2 answers
202 views

From the below structure I only want value of href attribute. But rec_block is returning h5 element without its children so basically <h5 class="series">Recommendations</h5>. <...
Emby's user avatar
  • 1
0 votes
0 answers
56 views

I'm using java spring boot and jsoup and recently I upgraded jsoup version to 1.21.1. My code creates search query and searches for it in the document Elements targetElements = document.select(...
user613's user avatar
  • 243
Advice
1 vote
0 replies
96 views

I'm not a Python specialist, so bear with me. I'm trying to replace a Perl HTML::TokeParser based parser that I use for template foreign language translation to use Python html.parser. Here's the ...
Hugh Barnard's user avatar
3 votes
1 answer
61 views

I'm rather new to using Beautiful Soup and I'm having some issues splitting some html correctly by only looking at html breaks and ignoring other html elements such as changes in font color etc. The ...
James Brian's user avatar
0 votes
1 answer
47 views

I have a html where I want to get elements with class="a" and id="c d". If I have only one of them, I can use soup.select('[class="a"]') and soup.select('[id="c d&...
Akira's user avatar
  • 2,820
0 votes
0 answers
99 views

I've seen the docs https://marked.js.org/using_pro#renderer and it has no example for the list i want to customize more detail https://github.com/markedjs/marked/blob/master/src/Tokens.ts#L137 as the ...
zummon's user avatar
  • 996
4 votes
5 answers
184 views

I have an html page that has data like so: <td><a href="test-2025-03-24_17-05.log">test-2025-03-24_17-05.log</a></td> <td><a href="PASS_report_test_2025-...
Archie's user avatar
  • 389
3 votes
1 answer
93 views

My intent was to give an advise on the question Delete everything between two strings (inclusive) to use the HTMLDocument parser instead of a text based replace command. But somehow the OuterHTML ...
iRon's user avatar
  • 24.4k
-1 votes
1 answer
51 views

my programming teacher made us program in python a calculator for calculating fuel consummation in L/100KM and i decided to go further and even have it calculate the price per 100km but heres the ...
VXV's user avatar
  • 1
1 vote
0 answers
31 views

I'm making an app where students can login to there portal website and it shows their data, however I'm having trouble authenticated users, when I did this project on another website I used ...
abtlb's user avatar
  • 11
1 vote
2 answers
92 views

I'm trying to extract the text of a book from a Wikisource page using BeautifulSoup, but the result is always empty. The page I'm working on is Le Père Goriot by Balzac. Here's the code I'm using: ...
Hugo Durif's user avatar
-1 votes
2 answers
84 views

The idea is: i wanna collect the name of the flat and its price as a list for every flat on the website. Ive made a simple parser on python, but looks like i cant get any values, since it returns an ...
Danny Mxxre's user avatar
1 vote
1 answer
150 views

I am trying to scrape the pitching stats on this url and then save the dataframe to a csv file. https://www.baseball-reference.com/boxes/ARI/ARI202204070.shtml My current code is below (Python 3.9.7) ...
Preston Albury's user avatar
0 votes
1 answer
52 views

I am trying to webscrape the league table for the EPL, but when I do that I am getting duplicate links as well as links to the teams that are not even in the premier league which makes no sense. Here ...
Vignesh's user avatar
  • 27
-1 votes
2 answers
117 views

Here are my codes: `import pandas as pd import requests from bs4 import BeautifulSoup url = "https://www.payscale.com/college-salary-report/majors-that-pay-you-back/bachelors/" response = ...
Dave's user avatar
  • 33
1 vote
1 answer
168 views

I am building a web-server. I am trying to build a function handler that parses the index.html file in the root directory. It works but when I go to the website on my localhost 127.0.0.1:8080 I get ...
Codemon's user avatar
  • 11
1 vote
1 answer
65 views

Where I work uses the Fire Weather Forecast product from the National Weather Service to produce a product for fire management officers that has the fire weather specific to their area. We have been ...
Giric Red Wolf's user avatar
0 votes
1 answer
31 views

Using beautifulsoup and selenium in python, I am trying to scroll down a list of songs in a playlist to parse the song names. The code however will not get past the first 30 songs and scroll down ...
BouckleyBoy's user avatar
1 vote
1 answer
116 views

I am creating a angular shcematics project to propose suggestions to my angular project. I am trying to use the built in angular compiler to parse the code because libraries such as parse5 and ...
Jonathan's user avatar
  • 461
-7 votes
1 answer
124 views

I want to replace the following hyperlinks dynamically from <a href="/xsearch2?q=some search/21">21</a> to <a href="/xsearch2?q=some search&page=21">21</a&...
KTH Clips's user avatar
0 votes
0 answers
60 views

I am having trouble parsing an HTML page on wikipedia. I want to get all text between two headings. I can get all text in the HTML wiki seperated by newline by executing the following in python: ...
MattJ's user avatar
  • 149
-1 votes
1 answer
30 views

I am trying to parse https://rateyourmusic.com/release/album/tyler-the-creator/igor/reviews/1/ I can access the divs that have class_=review_body if I download the html files locally on to my system. ...
Nate's user avatar
  • 1
1 vote
1 answer
93 views

I am trying to extract data from an HTML table with perl, using HTML::TableExtract. Specifically I am trying to grab some rushing stats for the 2024 Baltimore Ravens from Pro Football Referemce. The ...
JimZipCode's user avatar
0 votes
0 answers
78 views

I am trying to parse an html input using jsoup (v1.18.1), extract elements, extract each attribute value and replace as follows: > with &gt < with &lt The method I'm feeding this code ...
Pallavi's user avatar
1 vote
0 answers
56 views

I have @GET endpoint in response to which I receive an html code with URLs. I need to reach the URL that comes after 200 code. Do you have any idea how to do it in Android? I already tried to use the ...
Alex20280's user avatar
  • 385
0 votes
0 answers
24 views

I'm trying to use the PHP Simple HTML DOM Parser for the first time from here - https://simplehtmldom.sourceforge.io/docs/1.9/index.html Unfortunately, I'm having an issue where it's not returning ...
Lewis Hardisty's user avatar
1 vote
1 answer
2k views

I'm using nodriver and it's not directly supporting network methods. But it does support for several CDP objects (network: https://ultrafunkamsterdam.github.io/nodriver/nodriver/cdp/network.html) and ...
Aca's user avatar
  • 67
-2 votes
1 answer
54 views

I have a bookmarklet and JavaScript with which I am extracting data from an html table from a website. For the most part the script works fine however it parses the date wrong. The date, in the HTML ...
SystemWorks's user avatar
0 votes
1 answer
73 views

My question is more about html layout and parsing dynamic of content. My task: parse contacts who read my particular message in the Group. I tried to see DOM structure for the DIV block that hold that ...
Jeffrey Rasmussen's user avatar
0 votes
1 answer
62 views

I have written Apps Script code for Html Parsing using XmlParse. It works fine at my end, my browser and system language both are English as well as my Google Account's. But when I shared the same ...
Amna Irfan's user avatar
0 votes
1 answer
36 views

everyone. I've never did JS coding before but I needed a certain extension that I couldn't find in the shop. So I've decided to make my own. Here is the logic: when you right-click the link you get ...
aleksds1's user avatar
2 votes
1 answer
133 views

By default it seems that html.parser.HTMLParser cannot handle self closing tags correctly, if they are not terminated using /. E.g. it handles <img src="asfd"/> fine, but it ...
flawr's user avatar
  • 11.7k
4 votes
1 answer
148 views

Unlike XHTML, HTML does not allow separate closing tags for empty-content elements like br and hr. The HTML validator gives an error end tag for element "..." which is not open in such ...
Heiko Theißen's user avatar
-1 votes
2 answers
36 views

I am trying to scrape earthquake weather data from USGS and my code runs up to the print(soup) line but nothing after that import requests from bs4 import BeautifulSoup url="https://earthquake....
Lumko Mtengwane's user avatar
1 vote
1 answer
40 views

Hi so I am very new to web scraping and I am trying out the basics for it. Right now, I wanted to extract links from a root website (coventry.gov.uk). The problem was, however, I could not get the ...
Gs can't's user avatar
0 votes
1 answer
60 views

I am not a programmer but I've been digging through the weeds to figure something out on my own and I'm stuck. I have a google spreadsheet with multiple sheets that I need to populate with content ...
notobella designs's user avatar
0 votes
1 answer
112 views

I have this webpage (https://goldapple.ru/) on which I want to parse some data about cosmetics. However, when I open the webpage, the popup button appears, and I want to click the left "Да, верно&...
Alexei Rozhenko's user avatar
1 vote
0 answers
57 views

I am working with an SVG file and converting it to JSON using svgson library. Additionally, I am using the svg-path-to-polygons library to decode the d attribute in the path element. However, I am ...
HEMAL's user avatar
  • 430
1 vote
1 answer
65 views

In my implementation of adding charts to a react frontend, from gsheets, using an apps script backend, there seems to be some sort of an issue where my constructed base64 png string fails to be parsed ...
mayank's user avatar
  • 378
0 votes
0 answers
538 views

I want to read a webpage and split it into chunks to feed a vector database in a RAG pipeline. This webpage has python code examples on it, but I cannot create chunks with that code text, it is ...
Abraham Martín Expósito's user avatar
-1 votes
1 answer
110 views

I'm using js-beautify to beautify my HTML like this: import { html_beautify } from 'js-beautify'; // later in the component html_beautify(localHtmlContent, {indent_size: 2}); which makes my html go ...
Filip Savic's user avatar
  • 3,363
0 votes
1 answer
312 views

I am building an application that receives HTML content as strings. I need to verify that these HTML strings are well-formed, meaning I want to parse them and detect lines with errors. During my ...
Juan Rojas's user avatar
0 votes
1 answer
164 views

As the title suggests, I'm having trouble scraping a table from Baseball Reference. I want to scrape the first 2 tables from here. To be clear, the ones titled "Team Standard Batting" and &...
pcm1113's user avatar
-1 votes
1 answer
643 views

Was hoping for help here. I'm trying to web scrape this second table of player goal and shot creation stats on FB Ref for the MLS, but my script is bringing in the first table of team statistics ...
user15039720's user avatar
0 votes
1 answer
134 views

Whatwg spec describes conception of the speculative HTML parsing. So, there are many places in spec with the term active speculative parser. Spec says that HTML parser that owns an instance of ...
MaximPro's user avatar
  • 556
1 vote
1 answer
443 views

Research So I've found this answer on how to create a vnode list from a simple SVG with one path layer and how to transform that in Vue2. I could not find any good solutions for Vue 3, so I scaffolded ...
Nebulosar's user avatar
  • 1,897
0 votes
1 answer
41 views

I want a regex that will look at a string like this, get the "card" value from each of these comment blocks and also, TRUE if there is a "disabled":true or "hide":true ...
user18102663's user avatar
1 vote
1 answer
247 views

Python novice here. I have been learning how to scrape from various baseball sites (Fangraphs, Statcast, Rotowire). I have had success with a few different methods, but the Park Factors table on ...
gredow1979's user avatar
0 votes
1 answer
1k views

I am getting html code and showing like below image using flutter_widget_from_html package. But now I need to style like it on the website. I tried to find a guide to do this but had no luck. I just ...
Kavinda Lochana's user avatar

1
2 3 4 5
122