|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectclothing_search_engine.crawler.PageExtractor
public class PageExtractor
Constructor Summary | |
---|---|
PageExtractor(java.lang.String m_range)
Constructor |
Method Summary | |
---|---|
java.lang.String |
cleanPage(java.lang.String webPage)
Clean the page to make it useful for search engine analysis |
java.lang.String |
getPage(java.net.URL url)
Return the entire webpage into one string |
static void |
main(java.lang.String[] args)
|
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public PageExtractor(java.lang.String m_range)
m_range
- regular expression of range of pageMethod Detail |
---|
public java.lang.String getPage(java.net.URL url)
url
- url of webpage to get
public java.lang.String cleanPage(java.lang.String webPage)
webPage
- unprocessed webpage
public static void main(java.lang.String[] args)
args
-
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |