Forum Home
Press F1
 
Thread ID: 56893 2005-04-17 23:00:00 Extracting links from webpages Midavalo (7253) Press F1
Post ID Timestamp Content User
346174 2005-04-17 23:00:00 Is there a tool (an add-on to IE6 perhaps) I can get from the net somewhere that I can use to extract page links into a txt or html file, along with the description (or whatever its called) of the link?

Thanks,

Midavalo.
Midavalo (7253)
346175 2005-04-17 23:07:00 Select the link and right mouse / copy shortcut? Speedy Gonzales (78)
346176 2005-04-17 23:13:00 Select the link and right mouse / copy shortcut?Thanks :) I meant all links on a page, at once. If there were only 3 or 4 links on a page then that method would be fine, but not when there are 100+ links per page, plus numerous pages.

Cheers,
M.
Midavalo (7253)
346177 2005-04-17 23:21:00 depending on what your wanting to do with the links.....i know some download mangers can extract all the links in a page (handy for bulk downloading). tweak'e (69)
346178 2005-04-17 23:23:00 The Lynx text browser (runs in a DOS window) can do exactly what you want except give you a description of the linked site. vinref (6194)
346179 2005-04-17 23:58:00 The Lynx text browser (runs in a DOS window) can do exactly what you want except give you a description of the linked site.What I mean by "description" is where in the html you include the link URL, but then just provide a word or two as the link... like <a href = www.pressf1.co.nz> Press F1 < or however it works. I'd call the "Press F1" bit the description.

M.
Midavalo (7253)
346180 2005-04-17 23:59:00 depending on what your wanting to do with the links.....i know some download mangers can extract all the links in a page (handy for bulk downloading).They're not download links that I want - just links to other pages/sites or wherever. I've got a download manager that can extract all the links, but it only outputs to the download manager, which is no use to me :)

cheers,
M.
Midavalo (7253)
346181 2005-04-18 03:17:00 File - Save Page As - Web page Rob99 (151)
346182 2005-04-18 03:21:00 Is there a tool (an add-on to IE6 perhaps) I can get from the net somewhere that I can use to extract page links into a txt or html file, along with the description (or whatever its called) of the link?

Thanks,

Midavalo.
What format would you like the links output? eg

site.com
site.com
site.com
site.com

or more advaced than that?
sal (67)
346183 2005-04-18 03:27:00 Save the webpage to temp dir then at the command prompt

i.e.

c:
cd \temp
find /i "http://" c:\temp\webpage.html.* >links.txt
notepad links.txt


This show the output each line with http to a file called links.txt
KiwiTT_NZ (233)
1 2