Class | Description |
---|---|
ExtractSpotsCLI |
Retrieves all the titles and anchors from the Wikipedia articles, considers
only articles, redirects, templates and categories.
|
GenerateSpotDocumentFrequencyCLI |
Takes a file containing a list of spots (ordered lexicographically) and
generates a new file containing for each spots it document frequency (the
number of articles in Wikipedia containing the spot as raw text or anchor).
|
SpotCLI |
Performs the spotting over a document using the
DictionarySpotter ,
prints on the stout the list of the matched spots with their candidate entities. |
WriteOneSpotPerLineCLI |
Gets in input a file containing :
<spot> <tab> <src entity> <tab> <target entity> (the output of ExtractSpotCLI ) and a file: <spot> <tab> <df(spot)> (the output of GenerateSpotDocumentFrequencyCLI ) where df(spot) is
the number of wikipedia documents containing the text of the spot (as anchor
or simple text). |
Copyright © 2013. All rights reserved.