Name |
Description |
CharFilter |
Removes a single character, but not a number. |
DomainFilter |
Remove domain names |
EmailFilter |
Remove the email address |
PossessiveNounFilter |
Remove 's from tokens |
PunctuationFilter |
Remove the punctuation |
QuotesFilter |
Remove quotes from tokenized documents |
SpacePunctuationFilter |
Used to put in spaces when punctuation is used. |
StopWordsFilter |
Load the stop words |
StripTagsFilter |
Remove html from the text; |
UrlFilter |
Removes urls from the word |
WhitespaceFilter |
Remove extract whitespaces |