com.twitter.common.text.extractor
Class URLExtractor

java.lang.Object
  extended by org.apache.lucene.util.AttributeSource
      extended by com.twitter.common.text.token.TokenStream
          extended by com.twitter.common.text.extractor.RegexExtractor
              extended by com.twitter.common.text.extractor.URLExtractor

public class URLExtractor
extends RegexExtractor

Extracts URLs from text, according to the canonical definition found in the twitter-text-java library Regex.


Nested Class Summary
 
Nested classes/interfaces inherited from class com.twitter.common.text.extractor.RegexExtractor
RegexExtractor.AbstractBuilder<N extends RegexExtractor,T extends RegexExtractor.AbstractBuilder<N,T>>, RegexExtractor.Builder
 
Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
org.apache.lucene.util.AttributeSource.AttributeFactory, org.apache.lucene.util.AttributeSource.State
 
Constructor Summary
URLExtractor()
          Default constructor.
 
Method Summary
 
Methods inherited from class com.twitter.common.text.extractor.RegexExtractor
incrementToken, reset, setRegexPattern, setRegexPattern, setTriggeringChar
 
Methods inherited from class com.twitter.common.text.token.TokenStream
getInstanceOf, toStringList
 
Methods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, restoreState, toString
 
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
 

Constructor Detail

URLExtractor

public URLExtractor()
Default constructor.