| Class | Description |
|---|---|
| ExtractSpotsCLI |
Retrieves all the titles and anchors from the Wikipedia articles, considers
only articles, redirects, templates and categories.
|
| GenerateSpotDocumentFrequencyCLI |
Takes a file containing a list of spots (ordered lexicographically) and
generates a new file containing for each spots it document frequency (the
number of articles in Wikipedia containing the spot as raw text or anchor).
|
| SpotCLI |
Performs the spotting over a document using the
DictionarySpotter,
prints on the stout the list of the matched spots with their candidate entities. |
| WriteOneSpotPerLineCLI |
Gets in input a file containing :
<spot> <tab> <src entity> <tab> <target entity> (the output of ExtractSpotCLI) and a file: <spot> <tab> <df(spot)> (the output of GenerateSpotDocumentFrequencyCLI) where df(spot) is
the number of wikipedia documents containing the text of the spot (as anchor
or simple text). |
Copyright © 2013. All rights reserved.