49 questions
1
vote
1
answer
93
views
Perl's HTML::TableExtract does not see all the tables on Pro Football Reference pages
I am trying to extract data from an HTML table with perl, using HTML::TableExtract.
Specifically I am trying to grab some rushing stats for the 2024 Baltimore Ravens from Pro Football Referemce. The ...
0
votes
0
answers
56
views
Excel formatting (RowSpan) messed up
I am exporting a HTML Table using this below.
var template = `<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns=&...
0
votes
0
answers
178
views
How to extract tables and images from website's json object?
"table-wrap": {
"@id": "ams2322-tbl-0001",
"@xml:lang": "en",
"@orientation": "portrait",...
1
vote
1
answer
1k
views
How to get the value present in inner html of the td tag in table
index.html
<tr>
<td>
<input type="text" name="id" id="id" value="<%=rn%>">
</td>
...
0
votes
2
answers
1k
views
Is there a way to read and collect EMF image file in python? Can we read a EMF image with OpenCV ? How to convert it in jpg or png?
I am searching for a solution for a long time but couldn't be able to find it. There are more similar qestion-answers but that didn't help me.
Basically
I have some word documents (xxx.docx) having ...
1
vote
0
answers
42
views
What's Wrong: rvest's Error 'in open.connection(x, "rb") and readHTMLTable ()`s "XML contents does not seem to be XML"? [duplicate]
I was trying to import (scrape) sets of tables on a news blog post online using xml2's read_html(), hence html.table() and XML::readHTMLTable() functions.
I got no table or anything useful with
...
-2
votes
1
answer
238
views
How to scrape the product information from the page using Beautiful Soup in which html table are involved [closed]
import requests
from bs4 import BeautifulSoup
import pandas as pd
baseurl='https://books.toscrape.com/'
headers ={
'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML,...
2
votes
1
answer
3k
views
convert html table to csv using pandas python
That's My code is working fine.
import pandas as pd
html_data = """<table id="example" class="table table-hover dataTable no-footer" role="grid" aria-...
1
vote
2
answers
181
views
How to Extract tables from fedex website using selenium into pandas dataframe
I am trying to extract the tables in the fedex website () using selenium and beautiful soup into dataframes using id but nothing comes up into dataframe
url = 'https://www.fedex.com/en-us/shipping/...
2
votes
1
answer
79
views
Python Beautifulsoup htmltableextraction issue
I'm trying to extract data from html table which is bit unstructured.
HTML Table structure as below (sample data) -
Able to extract data but facing issue with the "ID" column. "ID"...
0
votes
2
answers
358
views
How to merge content from multiple HMTL files in a single one?
I have more than 100 html files with the following structure.
<html>
<head>
<body>
<TABLE>
...
</TABLE>
<TABLE>
<TR>
&...
0
votes
1
answer
525
views
Python find data in table row
Lets say i have a html table with 10 columns and 100 rows all i want to do is to use Beautifulsoup to look up for a data if exits and print full row.
import urllib3
from bs4 import BeautifulSou
http =...
0
votes
1
answer
57
views
How to get data from a web table using python?
I am having trouble trying to get the data from this web table. I was wondering if anyone could shed some light on my situation, thanks!
HTML:
0
votes
1
answer
89
views
Capturing the web table by VBA without page break
I would like to capture the full set of data within the table within
https://mis.twse.com.tw/stock/sblInquiryCap.jsp?lang=en_us#
I was using the codes from the other post but I could only grab the ...
0
votes
1
answer
164
views
Download files with perl lwp linkextractor
I am trying to download a file from a web page.
First I get the links with the linkextractor and then I want to download them with the lwp
I'm a newbie programming in perl.
I made the following code ...
0
votes
1
answer
142
views
how to retrieve table, excluding some tags from it
I am trying to scrape data from html table(main_table), using css selectors. The problem is that when I am trying to get all rows(tr), I get extra rows from inner_table, which is inside the main_table,...
0
votes
0
answers
129
views
How to find specific html table segment, just after a particular keyword(multiple occurrence) in HTML document using Jsoup
I am using jsoup for HTML table parsing. Below is the scenario where I have to identify the correct segment. The process for identifying the correct segment is:
Wherever I find the keyword -> ABC , ...
3
votes
1
answer
1k
views
Extract html table into R but skip specific elements (<br>) in one certain column that has three lines/rows, in a multi-page html
I want to load the table in a link into a dataframe in R.
The following script successfully extracts the table:
install.packages("htmltab")
library(htmltabl)
url <- "http://www.hmdb.ca/...
0
votes
1
answer
567
views
Extract data from HTML table and put it in a text file with shell
I need a shell script to get a public password for VPN from a site (which refreshes the password everyday more or less). The password is a HTML table, in a specific line of the HTML code of the web ...
-1
votes
1
answer
39
views
Results extraction from website
I was trying to extract results from similar table:
https://data.fei.org/Result/ResultList.aspx?p=D6E828828E450E2880525ABCEE800008C1D381967CBAF718D9DE41BEBA3B9F06
I have applied following code but ...
-2
votes
1
answer
169
views
HTML::TableExtract doesn't work [closed]
The input html is attached (my $file), With the following script, I cannot extract the table I want. Any suggestions?
use strict;
use warnings;
use HTML::TableExtract;
my $file="view-source_www....
2
votes
1
answer
8k
views
How to export table into csv file in angular 2
How to export data table on my angular application UI into a CSV file or xls spreadsheet. Looking to have a button to be displayed on top of the table clicking on which can do this export. Please ...
1
vote
1
answer
527
views
Perl Mechanize identify content between span tag within specific div tag
Perl WWW::Mechanize::Firefox has successfully retrieved the contents of the web page, and stored in the scalar variable $content.
my $url = 'http://finance.yahoo.com/quote/AAPL/financials?p=AAPL';
$...
0
votes
1
answer
30
views
Macro for Multiple links
How do I loop/run the code According to a list in .txt file, until the list is ends ??
How do I loop/run the code According to a list in .txt file, until the list is ends ??
Sub FundData()
Dim ...
1
vote
4
answers
273
views
HTML::TableExtract an HTTPS site
I've created a perl script to use HTML::TableExtract to scrape data from tables on a site.
It works great to dump out table data for unsecured sites (i.e. HTTP site), but when I try HTTPS sites, it ...
0
votes
1
answer
81
views
PHP parse HTML tables and make them correct HTML5 tables
I'm looking for the best way to clean up old HTML tables (with PHP) so that they are correct HTML5 tables - it's mostly a matter of stripping not allowed attributes. In addition to that, I'd also like ...
-2
votes
1
answer
130
views
Aligning items in td
How to align the '1' and '2' that shows when a player click on the <td> centrally within the element?
Current version can be found here.
1
vote
1
answer
11k
views
how to loop through a table in selenium?
I am new to selenium and I have this question where I need to loop through a table and get the values in that table
<table>
<tr>
<td style="width:5px">
</td&...
0
votes
0
answers
578
views
How to get a value having same class name in a table using jquery?
I have a table like this, from this site Flipkart.com
<table cellspacing="0" class="specTable">
<tr><th class="groupHead" colspan="2">GENERAL FEATURES</th></tr>
<tr&...
1
vote
2
answers
23k
views
How to get TD cell values using jquery
I would like to get the text from the first and second TD which has the class user and id
<tr class="item-model-number">
<td class="label">Item model number</td>
<td class="...
-1
votes
1
answer
251
views
how can i extract special kind of table from website in perl?
I am trying to fetch all tables from the website http://finance.yahoo.com/etf/lists/?bypass=true&mod_id=mediaquotesetf&tab=tab1&scol=imkt&stype=desc&rcnt=50&page=1, using Perl ...
-1
votes
1
answer
4k
views
how to select 6th tr's td in a html table using vba code
tr---- 0495024988
14.08.1996
04/04/130/02514/AM96/
23.01.1996
0495024988
6. tr----(here the text is there which i have copy to my excel sheet)
here i have read many post where the javascript ...
0
votes
1
answer
49
views
readHTMLTable returns dataframe values as integers
Can someone explain why the marked line below returns numbers instead of setting the column names the strings in that row? How would I get the correct column names if I include the commented out ...
0
votes
1
answer
393
views
VBA web scraping
I am trying to get a row of data from this table on this website: http://www.nasdaq.com/symbol/neog/financials?query=balance-sheet
Now I can manage to get the "total liabilities" row using the
doc....
0
votes
1
answer
838
views
Not getting the value of an input element inside <td> while exporting table to csv
Html code
<html>
<td><input id="id1" value="3000" a type="text" /></td>
</html>
I need the value 3000 to be in the csv file please help
My Function
&...
6
votes
1
answer
20k
views
parsing/extracting a HTML Table, Website in Java
I want to parse the contents of this HTML table :
Here is the full website with source code:
http://www.kantschule-falkensee.de/uploads/dmiadgspahw/klassen/A_Klasse_11.htm
I want to parse the ...
2
votes
1
answer
237
views
Printing all HTML Tables with certain string for multiple files (perl)
I am trying to print all the HTML tables containing the string "kcat" for each xml file in a directory but I am having some trouble. Note that each file in the directory (named kcat_tables) has at ...
0
votes
1
answer
1k
views
Extract HTML table content based on "thead"
Here is a basic HTML table :
<table>
<thead>
<td class="foo">bar</td>
</thead>
<tbody>
<td>rows</td>
…
</tbody>
</table>...
1
vote
2
answers
142
views
changed dom when added input field form php
code for order.html.php:
<body>
<p> Place order</p>
<table>
<thead>
<tr>
<th>Item No.</th>
<th>Name</th&...
0
votes
1
answer
233
views
perl HTML::TableExtract out of range error
I am having difficulty recovering data from within an HTML table.
Here is what I have.
use strict;
use warnings;
use HTML::TreeBuilder;
use HTML::TableExtract qw(tree); #
use WWW::Mechanize;
my $d =...
2
votes
1
answer
388
views
Can't get content of <tbody> when parsing HTML in Perl
I HAVE SOLVED THIS:Turns out the page I was loading with WWW::Mechanize uses AJAX to load all the content that is inside the <tbody> so it is not loaded when I created the $html variable.
Now I ...
0
votes
2
answers
2k
views
retrieving data from a html table using php
I am aware that this question has been asked many times but I have looked into many examples and I have still been unable to get the data I need out of this html table.
I have a php file that ...
3
votes
1
answer
685
views
extracting values from html table using beautifulsoup4 (2nd row onwards, 1st and 6th column)
I am new to python and need some guidance on extracting values from specific cells from a HTML table.
The URL that I am working on can be found here
I am looking to get the first 5 values only in ...
0
votes
2
answers
2k
views
Perl printing rows and columns from HTML table
Here's my temp.html
<table border="1">
<tr>
<td>row 1, cell 1</td>
<td>row 1, cell 2</td>
</tr>
<tr>
<td>row 2, cell 1</td>
<td>row 2,...
3
votes
2
answers
1k
views
Getting links from an HTML table using HTML::TableExtract and HTML::Extor in Perl
My goal is to extract the links from the tables titled "Agonists," "Antagonists," and "Allosteric Regulators" in the following site:
http://www.iuphar-db.org/DATABASE/ObjectDisplayForward?objectId=1&...
1
vote
1
answer
3k
views
Trying to use HTML::TableExtract in Perl to extract table from HTML file, but failing
I am trying to extract information for each G protein-coupled receptor from tables from a site such as the following:
http://www.iuphar-db.org/DATABASE/ObjectDisplayForward?objectId=1&familyId=1
...
2
votes
1
answer
635
views
HTML::TableExtract with a table inside of a table
I have a small script that I am using to manipulate the code from a remote URL (code is separate). The manual page for HTML::TableExtract has the following code section relating to doing a table-in-a-...
1
vote
1
answer
560
views
How to use the Perl TableExtract rows method when there are duplicate Header fields
I'm trying to use TableExtract to parse a table where some of the headers have the same name, but different data. The rows method is returning the values for the first headers for both sets of headers....
0
votes
1
answer
710
views
HTML::TableExtract - passing an expression for headers
I have a subroutine that is called through another script to read the HTML file. Below is the code.
sub read_html
{
$data=`cat "$_[0]"`;
use HTML::TableExtract;
print "CALLING read_html ...