Modifier and Type | Method and Description |
---|---|
void |
SpotManager.add(Function<String> fun)
Adds a new function to the pipeline
|
Modifier and Type | Class and Description |
---|---|
class |
NopFunction<T>
Implements the identity function, it gives in output the input object.
|
Constructor and Description |
---|
Pipe(Function<T> fun) |
Pipe(Pipe<T> previous,
Function<T> fun) |
Modifier and Type | Class and Description |
---|---|
class |
Cleaner<T>
A cleaner performs a simple transformation over an object.
|
class |
HtmlCleaner
Converts javascript strings in ascii
|
class |
JuniorAndInitialsCleaner
JuniorAndInitialsCleaner trasforms a name removing initials or 'jr' (e.g.,
ted ginn, jr -> ted ginn ) |
class |
LowerCaseCleaner
LowerCaseCleaner, puts in lowercase the text of a spot
|
class |
ParenthesesCleaner
Parenthesis cleaner removes parenthesis from a spot.
|
class |
PrefixCleaner
Removes a matching prefix
|
class |
QuotesCleaner
Quotes cleaner removes quotes from a spot.
|
class |
StripCleaner
StripCleaner trims characters at the beginning of at the end of a spot.
|
class |
TemplateCleaner
TemplateCleaner removes all the text matching TEMPLATE[....]
|
class |
TypeCleaner
Typecleaner will remove domain informations usually appended at the end of
the titles (between parenthesis or after a #),
e.g. |
class |
UnderscoreCleaner
UnderscoreCleaner removes all the underscores in a post, replacing them with
spaces.
|
class |
UnicodeCleaner
UnicodeCleaner maps a Unicode string to ascii, the conversion is lossy, and
it is performed using the unidecode library.
|
Modifier and Type | Class and Description |
---|---|
class |
AsciiFilter
Ascii filters out all spot that do not contain alphabetic characters
|
class |
CommonnessFilter
CommonnessFilter, filters entities with low probability to be linked with the
spot
|
class |
Filter<T>
Filter allows to remove a given spot if it does not respect a filter
constraint.
|
class |
FrequencyFilter
FrequencyFilter removes spots that are not associated to any entity
|
class |
ImageFilter
Deprecated.
depends on language
|
class |
LengthFilter
LengthFilter filters out spots shorter than a given length (default is 3).
|
class |
LongSpotFilter
LongSpotFilter filters out all the spots that contains more than a given
number of terms (default is 6);
|
class |
NumberFilter
NumberFilter filters out all the spots containing only numbers
|
class |
ProbabilityFilter
ProbabilityFilter removes all the spots with link probability threshold lower
than a certain threshold.
|
class |
SymbolFilter
SymbolFilter filters out all the spotsF that do not contain alphabetic
characters
|
class |
TemplateFilter
Deprecated.
depends on language
|
Modifier and Type | Class and Description |
---|---|
class |
CityMapper
CityMapper manages labels of the type "city, country" (e.g., ada, wisconsin),
returning only the name of the city
|
class |
Mapper<T>
Mapper given a spot returns several different versions of the spot.
|
class |
QuotesMapper
QuotesMapper add text between quotes that appears inside the labels and text
appearing outsite the labels
(e.g., dave "baby" cortez -> baby, dave cortez ) |
class |
TypeMapper
Deprecated.
better as cleaner
|
Copyright © 2013. All rights reserved.