| |
| home | Author page | editor login |
The purpose of this tool is simply get a output of urls and titles from a html fragment or a url because DMOZ URL cleaning engine (used on Add a page of links to unreviewed) sometimes cannot recognize the URLs inside a webpage and, instead of return all urls, it return only one or two urls. Test with both tools (official Dmoz and this one) the URL http://grandeminas.globo.com/unainet/index_jornais.htm. There are some improvements to add on Clean HTML but its working.
Put the HTML Fragment or the URL. If you choice the ouput type URL and Titles you will get a html fragment that can be parsed by Dmoz official multilinks tool.
This tool was built with PHP in LPGL license. You can read the sourcecode of clearhtml with highlights and without highlights. You can also download here (click on save as)
The author is Roberto Berto (darkelder at dmoz) or at his homepage.
This site is kindly hosted by TeHospedo, check it hospedagem de sites Linux e Windows.