A C D E F G H I J L M N O P Q R S T U V W X Y 

A

A_OR_THE - Static variable in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.PrefixCleaner
 
add(EntityMatch) - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatchList
 
add(Node) - Method in interface it.cnr.isti.hpc.dexter.graph.NodesWriter
 
add(Node) - Method in class it.cnr.isti.hpc.dexter.graph.ram.RamNodes
 
add(int, String) - Method in interface it.cnr.isti.hpc.dexter.label.IdToLabelWriter
Adds the mapping id -> label .
add(String, int) - Method in interface it.cnr.isti.hpc.dexter.label.LabelToIdWriter
Adds the mapping label -> id
add(int, String) - Method in class it.cnr.isti.hpc.dexter.label.mapdb.MapDBIdToLabel
 
add(String, int) - Method in class it.cnr.isti.hpc.dexter.label.mapdb.MapDBLabelToId
 
add(Function<String>) - Method in class it.cnr.isti.hpc.dexter.spot.clean.SpotManager
Adds a new function to the pipeline
add(SpotMatch) - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatchList
 
addDocument(Article) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Indexes a Wikipedia Article
addDocument(int, String) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Adds a Wikipedia Article (added just for testing)
addField(Field) - Method in class it.cnr.isti.hpc.dexter.document.Document
Adds a field to a document.
addField(Field) - Method in class it.cnr.isti.hpc.dexter.document.FlatDocument
 
addField(Field) - Method in class it.cnr.isti.hpc.dexter.document.MultifieldDocument
Adds a field to a document.
addFilter(Filter<Spot>) - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader
Add a filter to the reader.
AddRedirectIdCLI - Class in it.cnr.isti.hpc.dexter.cli.label
Add redirects takes a file containing a list of serialized TitleRedirectId sorted by title, and create a file containing only the mapping between the redirect text (in the first column, instead of the second) and id of the target article.
AddRedirectIdCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.label.AddRedirectIdCLI
 
ArticleContextExtractor - Class in it.cnr.isti.hpc.dexter.spot
ContextExtractor for each label extract the context (text around the the label)
ArticleContextExtractor(Article) - Constructor for class it.cnr.isti.hpc.dexter.spot.ArticleContextExtractor
 
ArticleContextExtractor(Field) - Constructor for class it.cnr.isti.hpc.dexter.spot.ArticleContextExtractor
 
AsciiFilter - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.filter
Ascii filters out all spot that do not contain alphabetic characters
AsciiFilter() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.AsciiFilter
 

C

CityMapper - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper
CityMapper manages labels of the type "city, country" (e.g., ada, wisconsin), returning only the name of the city
CityMapper() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper.CityMapper
 
clean(String) - Method in class it.cnr.isti.hpc.dexter.spot.clean.SpotManager
Cleans an anchor, i.e., performs over the text only the cleaners previously added to the pipe.
clean(T) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.Cleaner
returns a spot cleaned with some particular policy
clean(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.HtmlCleaner
 
clean(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.JuniorAndInitialsCleaner
 
clean(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.LowerCaseCleaner
 
clean(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.ParenthesesCleaner
 
clean(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.PrefixCleaner
 
clean(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.QuotesCleaner
 
clean(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.StripCleaner
 
clean(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.TemplateCleaner
 
clean(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.TypeCleaner
 
clean(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.UnderscoreCleaner
 
clean(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.UnicodeCleaner
 
Cleaner<T> - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
A cleaner performs a simple transformation over an object.
Cleaner() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.Cleaner
 
cleanText(String) - Static method in class it.cnr.isti.hpc.dexter.spot.clean.SpotManager
Uses the standard cleaner to clean a given text
clearIndex() - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Clears the index
clone() - Method in class it.cnr.isti.hpc.dexter.entity.Entity
 
clone() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Returns a copy of this object
close() - Method in interface it.cnr.isti.hpc.dexter.graph.NodesWriter
 
close() - Method in class it.cnr.isti.hpc.dexter.graph.ram.RamNodes
 
close() - Method in interface it.cnr.isti.hpc.dexter.label.IdToLabelWriter
Closes the writer.
close() - Method in interface it.cnr.isti.hpc.dexter.label.LabelToIdWriter
Closes the writer
close() - Method in class it.cnr.isti.hpc.dexter.label.mapdb.MapDBIdToLabel
 
close() - Method in class it.cnr.isti.hpc.dexter.label.mapdb.MapDBLabelToId
 
close() - Method in class it.cnr.isti.hpc.mapdb.MapDB
 
closest(int, List<Integer>) - Method in class it.cnr.isti.hpc.dexter.spot.ContextExtractor
 
closeWriter() - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
 
collectionSize - Static variable in class it.cnr.isti.hpc.dexter.spot.Spot
 
commit() - Method in class it.cnr.isti.hpc.dexter.graph.ram.RamNodes
 
commit() - Method in class it.cnr.isti.hpc.dexter.label.mapdb.MapDBIdToLabel
 
commit() - Method in class it.cnr.isti.hpc.dexter.label.mapdb.MapDBLabelToId
 
commit() - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
 
commit() - Method in class it.cnr.isti.hpc.mapdb.MapDB
 
CommonnessFilter - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.filter
CommonnessFilter, filters entities with low probability to be linked with the spot
CommonnessFilter() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.CommonnessFilter
 
compare(EntityMatch, EntityMatch) - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch.SortByPosition
 
compare(EntityMatch, EntityMatch) - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch.SpotLengthComparator
 
compareTo(Entity) - Method in class it.cnr.isti.hpc.dexter.entity.Entity
 
compareTo(EntityMatch) - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
compareTo(Relatedness) - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
compareTo(SpotMatch) - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
compareTo(PosToken) - Method in class it.cnr.isti.hpc.text.PosToken
 
ContextExtractor - Class in it.cnr.isti.hpc.dexter.spot
A ContextExtractor extract the context of a spot (i.e., text around the the spot)
ContextExtractor() - Constructor for class it.cnr.isti.hpc.dexter.spot.ContextExtractor
 
ContextExtractor(Field) - Constructor for class it.cnr.isti.hpc.dexter.spot.ContextExtractor
 
ContextExtractor(String) - Constructor for class it.cnr.isti.hpc.dexter.spot.ContextExtractor
 
copy() - Method in class it.cnr.isti.hpc.dexter.relatedness.MilneRelatedness
 
copy() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 

D

DB - Static variable in class it.cnr.isti.hpc.dexter.label.mapdb.MapDBInstance
 
decode(String) - Method in class it.cnr.isti.hpc.dexter.graph.Node.Parser
 
decode(String) - Method in class it.cnr.isti.hpc.dexter.spot.Spot.Parser
 
decode(String) - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotFrequencyParser
 
decode(String) - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTargetParser
 
decode(String) - Method in class it.cnr.isti.hpc.dexter.util.TitleRedirectId.Parser
 
Dexter - Class in it.cnr.isti.hpc.dexter
Implements a standard entity linker, given a document the text is processed with a Spotter that returns the possible mentions detected as a SpotMatchList.
Dexter() - Constructor for class it.cnr.isti.hpc.dexter.Dexter
 
DexterCLI - Class in it.cnr.isti.hpc.dexter.cli
Given a file containing plain text prints on the stdout the entities detected by the Dexter tagger.
DexterCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.DexterCLI
 
DexterException - Exception in it.cnr.isti.hpc.dexter.util
 
DexterException() - Constructor for exception it.cnr.isti.hpc.dexter.util.DexterException
 
DexterException(String) - Constructor for exception it.cnr.isti.hpc.dexter.util.DexterException
 
DictionarySpotter - Class in it.cnr.isti.hpc.dexter.spotter
Spotter
DictionarySpotter() - Constructor for class it.cnr.isti.hpc.dexter.spotter.DictionarySpotter
 
disableStopwords() - Method in class it.cnr.isti.hpc.text.Text
 
disambiguate(SpotMatchList) - Method in interface it.cnr.isti.hpc.dexter.disambiguation.Disambiguator
Given a list of matched spots, where each spot has one or more candidate entities, returns a list of entities, so that each spot can have at most one entity.
disambiguate(SpotMatchList) - Method in class it.cnr.isti.hpc.dexter.disambiguation.TopScoreEntityDisambiguator
 
Disambiguator - Interface in it.cnr.isti.hpc.dexter.disambiguation
A disambiguator is one of the main component of an entity tagger.
Document - Class in it.cnr.isti.hpc.dexter.document
The Document interface defines as a set of fields.
Document() - Constructor for class it.cnr.isti.hpc.dexter.document.Document
 
dump() - Static method in class it.cnr.isti.hpc.dexter.spot.ram.SpotMinimalPerfectHash
 
dumpEliasFanoFile() - Static method in class it.cnr.isti.hpc.dexter.spot.ram.SpotEliasFanoOffsets
 
dumpKeys(String) - Method in class it.cnr.isti.hpc.dexter.spot.ram.SpotMinimalPerfectHash
 
dumpSpotFile(String) - Static method in class it.cnr.isti.hpc.dexter.spot.ram.RamSpotFile
 
dumpWikiIdToLuceneId() - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Dumps the map containing the conversion from the Wikipedia ids to the Lucene Ids.

E

encode(Node) - Method in class it.cnr.isti.hpc.dexter.graph.Node.Parser
 
encode(Spot) - Method in class it.cnr.isti.hpc.dexter.spot.Spot.Parser
 
encode(SpotReader.SpotFrequency) - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotFrequencyParser
 
encode(SpotReader.SpotSrcTarget) - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTargetParser
 
encode(TitleRedirectId) - Method in class it.cnr.isti.hpc.dexter.util.TitleRedirectId.Parser
 
end - Variable in class it.cnr.isti.hpc.dexter.shingle.Shingle
end position in the original text
entities - Variable in class it.cnr.isti.hpc.dexter.spot.Spot
 
entities - Variable in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
Entity - Class in it.cnr.isti.hpc.dexter.entity
The entity is the core concept on a Entity Linking system.
Entity(int, int) - Constructor for class it.cnr.isti.hpc.dexter.entity.Entity
builds an entity with identifier id and frequency frequency
Entity(int) - Constructor for class it.cnr.isti.hpc.dexter.entity.Entity
builds an entity with identifier id .
entityIndex(int) - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatchList
 
EntityMatch - Class in it.cnr.isti.hpc.dexter.entity
EntityMatch contains the confidence score of an entity associated to a spot
EntityMatch(Entity, double, SpotMatch) - Constructor for class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
EntityMatch(int, double, SpotMatch) - Constructor for class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
EntityMatch.SortByPosition - Class in it.cnr.isti.hpc.dexter.entity
 
EntityMatch.SortByPosition() - Constructor for class it.cnr.isti.hpc.dexter.entity.EntityMatch.SortByPosition
 
EntityMatch.SpotLengthComparator - Class in it.cnr.isti.hpc.dexter.entity
 
EntityMatch.SpotLengthComparator() - Constructor for class it.cnr.isti.hpc.dexter.entity.EntityMatch.SpotLengthComparator
 
EntityMatchList - Class in it.cnr.isti.hpc.dexter.entity
EntityMatchList represents a list of EntityMatch
EntityMatchList() - Constructor for class it.cnr.isti.hpc.dexter.entity.EntityMatchList
 
EntityRanker - Class in it.cnr.isti.hpc.dexter.entity
EntityRanker assigns to the entities retrieved for a spot a score.
EntityRanker(Field) - Constructor for class it.cnr.isti.hpc.dexter.entity.EntityRanker
 
equals(Object) - Method in class it.cnr.isti.hpc.dexter.entity.Entity
 
equals(Object) - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
equals(Object) - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
equals(Object) - Method in class it.cnr.isti.hpc.dexter.spot.Spot
 
equals(Object) - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
equals(Object) - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
equals(Object) - Method in class it.cnr.isti.hpc.text.PosToken
 
eval(T, Pipe<T>.OutputCollector) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.Cleaner
 
eval(T, Pipe<T>.OutputCollector) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.Filter
 
eval(T, Pipe<T>.OutputCollector) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.Function
Performs the manipulation, eventual outputs produced by the function are output using the collector object.
eval(T, Pipe<T>.OutputCollector) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper.Mapper
 
eval(T, Pipe<T>.OutputCollector) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.NopFunction
 
ExportArticlesIdCLI - Class in it.cnr.isti.hpc.dexter.cli.label
GenerateArticleIdCLI takes the Json dump of Wikipedia and creates a file which contains the mapping between each article title (the label) to an int (used to represent the entity in the framework).
ExportArticlesIdCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.label.ExportArticlesIdCLI
 
ExtractSpotsCLI - Class in it.cnr.isti.hpc.dexter.cli.spot
Retrieves all the titles and anchors from the Wikipedia articles, considers only articles, redirects, templates and categories.
ExtractSpotsCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.spot.ExtractSpotsCLI
 

F

Field - Class in it.cnr.isti.hpc.dexter.document
A field is the basic unit being part of a Document object.
Field(String, String) - Constructor for class it.cnr.isti.hpc.dexter.document.Field
 
FIELD - Static variable in class it.cnr.isti.hpc.dexter.spot.ram.SpotEliasFanoOffsets.OffsetsFileIterator
 
field - Variable in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
fieldName - Variable in class it.cnr.isti.hpc.dexter.document.FlatDocument
 
fields - Variable in class it.cnr.isti.hpc.dexter.document.MultifieldDocument
 
Filter<T> - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.filter
Filter allows to remove a given spot if it does not respect a filter constraint.
Filter() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.Filter
 
FlatDocument - Class in it.cnr.isti.hpc.dexter.document
A FlatDocument is a Document with only one Field.
FlatDocument() - Constructor for class it.cnr.isti.hpc.dexter.document.FlatDocument
 
FlatDocument(String) - Constructor for class it.cnr.isti.hpc.dexter.document.FlatDocument
FlatDocument build with the "text" value as unique field
FlatDocument(File) - Constructor for class it.cnr.isti.hpc.dexter.document.FlatDocument
FlatDocument build with the content taken from a file
freq - Variable in class it.cnr.isti.hpc.dexter.spot.Spot
 
FrequencyFilter - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.filter
FrequencyFilter removes spots that are not associated to any entity
FrequencyFilter() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.FrequencyFilter
 
fromByteArray(String, byte[]) - Static method in class it.cnr.isti.hpc.dexter.spot.Spot
Decodes a Spot from a byte representation, if the given text match the spot text encoded in the byte array.
fromTsvLine(String) - Static method in class it.cnr.isti.hpc.dexter.spot.Spot
Decodes a tab separated representation of a spot.
Function<T> - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe
A function performs a single manipulation over an object within a pipeline.
Function() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.Function
 

G

GenerateSpotDocumentFrequencyCLI - Class in it.cnr.isti.hpc.dexter.cli.spot
Takes a file containing a list of spots (ordered lexicographically) and generates a new file containing for each spots it document frequency (the number of articles in Wikipedia containing the spot as raw text or anchor).
GenerateSpotDocumentFrequencyCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.spot.GenerateSpotDocumentFrequencyCLI
 
GenerateSpotsMinimalPerfectHashCLI - Class in it.cnr.isti.hpc.dexter.cli.spot.ram
Get in input a file containing the spots (one per line) generate the minimal perfect hash for each spot and store in ${ram.spot.perfect.hash}.
GenerateSpotsMinimalPerfectHashCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.spot.ram.GenerateSpotsMinimalPerfectHashCLI
 
getAllSpots(Article) - Method in class it.cnr.isti.hpc.dexter.spot.clean.SpotManager
Given a Wikipedia article returns a set containing all the processed anchors in the article.
getArticle(int) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Retrieves an article from the index
GetArticleIdCLI - Class in it.cnr.isti.hpc.dexter.cli
Returns an article integer identifier given the its label.
GetArticleIdCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.GetArticleIdCLI
 
GetArticleLabelCLI - Class in it.cnr.isti.hpc.dexter.cli
Returns the article label given its integer identifier.
GetArticleLabelCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.GetArticleLabelCLI
 
getArticleSummary(int) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Retrieves only the article summary and the title from the index
getCleanAsciiText(Article) - Method in class it.cnr.isti.hpc.dexter.spot.ArticleContextExtractor
 
getCollection(String) - Method in class it.cnr.isti.hpc.mapdb.MapDB
 
getCollections() - Method in class it.cnr.isti.hpc.mapdb.MapDB
 
getCollectionSize() - Static method in class it.cnr.isti.hpc.dexter.spot.Spot
Get the number of entities in the collection
getCommonness() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
getContent() - Method in class it.cnr.isti.hpc.dexter.document.Document
Gets the content of the document, appending the content of the different fields belonging to it.
getContent() - Method in class it.cnr.isti.hpc.dexter.document.MultifieldDocument
Gets the content of the document, appending the content of the different fields belonging to it.
getContext(String) - Method in class it.cnr.isti.hpc.dexter.spot.ContextExtractor
 
getCosineSimilarity(int, int) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Returns the cosine similarity between two documents
getCosineSimilarity(int, int, String) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Returns the cosine similarity between two documents
getDb(String) - Static method in class it.cnr.isti.hpc.mapdb.MapDB
 
getDb(String, String) - Static method in class it.cnr.isti.hpc.mapdb.MapDB
 
getDexterLuceneHelper() - Static method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Returns an instance of the Dexter's Lucene index.
getDisambiguator(String) - Method in class it.cnr.isti.hpc.dexter.plugin.PluginLoader
 
getEnd() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
getEnd() - Method in class it.cnr.isti.hpc.dexter.shingle.Shingle
 
getEnd() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
getEnd() - Method in class it.cnr.isti.hpc.text.Sentence
 
getEnd() - Method in class it.cnr.isti.hpc.text.Token
 
getEntities() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Return the list of entities with the current mention
getEntities() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
getEntities() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatchList
 
getEntity() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
getEntityCommonness(Entity) - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Computes the entity commonness for a given entity and this spot, i.e., P(e|s), the probability for an entity to be associated with this spot.
getEntityCommonness(Entity) - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
getEntityFrequency() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
getField(String) - Method in class it.cnr.isti.hpc.dexter.document.Document
Returns the field with the given name if exists in this document, or null.
getField(String) - Method in class it.cnr.isti.hpc.dexter.document.MultifieldDocument
Returns the field with the given name if exists in this document, or null.
getFields() - Method in class it.cnr.isti.hpc.dexter.document.Document
Returns an iterator over the fields being part of this document.
getFields() - Method in class it.cnr.isti.hpc.dexter.document.MultifieldDocument
Returns an iterator over the fields being part of this document.
getFreq(String, String) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
 
getFreq(String) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
 
getFreq() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotFrequency
 
getFrequency() - Method in class it.cnr.isti.hpc.dexter.entity.Entity
 
getFrequency() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
getFrequency() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Returns the document frequency of this spot, i.e., how many wikipedia articles contain the spot as simple text or anchor text.
getFrequency() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
getId() - Method in class it.cnr.isti.hpc.dexter.entity.Entity
 
getId() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
getId(String) - Method in class it.cnr.isti.hpc.dexter.label.IdHelper
 
getId(String) - Method in interface it.cnr.isti.hpc.dexter.label.LabelToId
 
getId(String) - Method in class it.cnr.isti.hpc.dexter.label.mapdb.MapDBLabelToId
 
getId() - Method in class it.cnr.isti.hpc.dexter.util.TitleRedirectId
 
getIdf() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Returns the inverse document frequency of this mention;
getIdHelper(IdHelperFactory.Type) - Static method in class it.cnr.isti.hpc.dexter.label.IdHelperFactory
 
getIdToLabelWriter(IdHelperFactory.Type) - Static method in class it.cnr.isti.hpc.dexter.label.IdHelperFactory
 
getIncoming(int) - Method in class it.cnr.isti.hpc.dexter.graph.ram.RamIncomingNodes
 
getIncomingNodes(NodeFactory.Type) - Static method in class it.cnr.isti.hpc.dexter.graph.NodeFactory
 
getIncomingNodeWriter(NodeFactory.Type) - Static method in class it.cnr.isti.hpc.dexter.graph.NodeFactory
 
getInstance() - Static method in class it.cnr.isti.hpc.dexter.graph.ram.RamIncomingNodes
 
getInstance() - Static method in class it.cnr.isti.hpc.dexter.graph.ram.RamOutcomingNodes
 
getInstance() - Static method in class it.cnr.isti.hpc.dexter.label.mapdb.MapDBIdToLabel
 
getInstance() - Static method in class it.cnr.isti.hpc.dexter.label.mapdb.MapDBLabelToId
 
getInstance() - Static method in class it.cnr.isti.hpc.dexter.spot.ram.RamSpotFile
 
getInstance() - Static method in class it.cnr.isti.hpc.dexter.spot.ram.SpotEliasFanoOffsets
 
getInstance() - Static method in class it.cnr.isti.hpc.dexter.spot.ram.SpotMinimalPerfectHash
 
getInstance(String) - Method in class it.cnr.isti.hpc.dexter.spot.repo.SpotRepositoryFactory
Returns an instance of a SpotRepository
getInstance() - Static method in class it.cnr.isti.hpc.text.PosTagger
 
getInstance() - Static method in class it.cnr.isti.hpc.text.SentenceSegmenter
 
getInstance() - Static method in class it.cnr.isti.hpc.text.TokenSegmenter
 
getIntersection(String, String) - Static method in class it.cnr.isti.hpc.text.Text
 
getLabel(Integer) - Method in class it.cnr.isti.hpc.dexter.label.IdHelper
 
getLabel(Integer) - Method in interface it.cnr.isti.hpc.dexter.label.IdToLabel
 
getLabel(Integer) - Method in class it.cnr.isti.hpc.dexter.label.mapdb.MapDBIdToLabel
 
getLabels(List<Integer>) - Method in class it.cnr.isti.hpc.dexter.label.IdHelper
 
getLabelToIdWriter(IdHelperFactory.Type) - Static method in class it.cnr.isti.hpc.dexter.label.IdHelperFactory
 
getLink() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Return how many times the mention occurs in the collection as a link to an entity
getLinkFrequency() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
getLinkProbability() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Return the probability for the current mention to be a link
getLinkProbability() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
getLong(String) - Method in class it.cnr.isti.hpc.dexter.spot.ram.SpotMinimalPerfectHash
 
getLuceneId(int) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
 
getMaxShingleSize() - Method in class it.cnr.isti.hpc.dexter.shingle.ShingleExtractor
 
getMention() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
getMention() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Returns the text of this mention;
getMention() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
getName() - Method in class it.cnr.isti.hpc.dexter.document.Field
 
getName() - Method in class it.cnr.isti.hpc.dexter.relatedness.MilneRelatedness
 
getName() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
getNames() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
getNeighbourNames() - Method in class it.cnr.isti.hpc.dexter.graph.Node
 
GetNeighbourNodesCLI - Class in it.cnr.isti.hpc.dexter.cli.graph
Given an entity ID or a entity label as input, and a direction (in/out) returns the entities linking to the given entity (in) or the entities linked from the given entity (out).
GetNeighbourNodesCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.graph.GetNeighbourNodesCLI
 
getNeighbours() - Method in class it.cnr.isti.hpc.dexter.graph.Node
 
getNeighbours(int) - Method in interface it.cnr.isti.hpc.dexter.graph.NodeStar
return the neighbors of the the node id;
getNeighbours(int) - Method in class it.cnr.isti.hpc.dexter.graph.ram.RamNodes
 
getNode() - Method in class it.cnr.isti.hpc.dexter.graph.Node
 
getNode(int) - Method in interface it.cnr.isti.hpc.dexter.graph.NodeStar
return a node description
getNode(int) - Method in class it.cnr.isti.hpc.dexter.graph.ram.RamNodes
 
getOffset(long, long) - Method in class it.cnr.isti.hpc.dexter.spot.ram.RamSpotFile
 
getOffset(long) - Method in class it.cnr.isti.hpc.dexter.spot.ram.SpotEliasFanoOffsets
 
getOutcoming(int) - Method in class it.cnr.isti.hpc.dexter.graph.ram.RamOutcomingNodes
 
getOutcomingIds(Article) - Method in class it.cnr.isti.hpc.dexter.label.IdHelper
 
getOutcomingNodes(NodeFactory.Type) - Static method in class it.cnr.isti.hpc.dexter.graph.NodeFactory
 
getOutcomingNodeWriter(NodeFactory.Type) - Static method in class it.cnr.isti.hpc.dexter.graph.NodeFactory
 
getPos() - Method in class it.cnr.isti.hpc.text.PosToken
 
getProbability() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
Returns probability to be a link to a entity for the text of this spot, it is computed dividing the number of documents in Wikipedia containing this spot as a anchor by the number of documents in wikipedia containing this spot as simple text.
getRedirect() - Method in class it.cnr.isti.hpc.dexter.util.TitleRedirectId
 
getRelatedness(String) - Method in class it.cnr.isti.hpc.dexter.plugin.PluginLoader
 
getRelatedness(int, int) - Method in class it.cnr.isti.hpc.dexter.relatedness.RelatednessFactory
 
GetRelatednessCLI - Class in it.cnr.isti.hpc.dexter.cli
Get relatedness between two entities
GetRelatednessCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.GetRelatednessCLI
 
getResults() - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.Pipe
 
getScore() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
getScore() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
getScore(int, int) - Method in class it.cnr.isti.hpc.dexter.relatedness.RelatednessFactory
 
getSimilarity(Query, int) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Returns the TFIDF-similarity between a given string and an article
getSpot() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
getSpot(String) - Method in class it.cnr.isti.hpc.dexter.spot.ram.RamSpotRepository
 
getSpot(String) - Method in interface it.cnr.isti.hpc.dexter.spot.repo.SpotRepository
Given the a piece of text, the method will return a Spot if the given text is associated to one or more entities.
getSpot() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
getSpot() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotFrequency
 
getSpot() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
getSpotLinkProbability() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
getSpotter(String) - Method in class it.cnr.isti.hpc.dexter.plugin.PluginLoader
 
getSrc() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
getStandardSpotCleaner() - Static method in class it.cnr.isti.hpc.dexter.spot.clean.SpotManager
Returns a StandardSpotCleaner used by Dexter to clean the anchors.
getStandardSpotManager() - Static method in class it.cnr.isti.hpc.dexter.spot.clean.SpotManager
Returns a StandardSpotManager used by Dexter to process the anchors.
getStart() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
getStart() - Method in class it.cnr.isti.hpc.dexter.shingle.Shingle
 
getStart() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
getStart() - Method in class it.cnr.isti.hpc.text.Sentence
 
getStart() - Method in class it.cnr.isti.hpc.text.Token
 
getStdIdHelper() - Static method in class it.cnr.isti.hpc.dexter.label.IdHelperFactory
returns the standard id helper, you should probably want to use this.
getStdIdToLabelWriter() - Static method in class it.cnr.isti.hpc.dexter.label.IdHelperFactory
 
getStdInstance() - Method in class it.cnr.isti.hpc.dexter.spot.repo.SpotRepositoryFactory
Returns the standard instance of the SpotRepository, defined in the Dexter property file ( spot.repository )
getStdLabelToIdWriter() - Static method in class it.cnr.isti.hpc.dexter.label.IdHelperFactory
 
getTagger(String) - Method in class it.cnr.isti.hpc.dexter.plugin.PluginLoader
 
getTarget() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
getTerms() - Method in class it.cnr.isti.hpc.text.Text
 
getTermsSet() - Method in class it.cnr.isti.hpc.text.Text
 
getText() - Method in class it.cnr.isti.hpc.dexter.shingle.Shingle
 
getText() - Method in class it.cnr.isti.hpc.text.Sentence
 
getText() - Method in class it.cnr.isti.hpc.text.Token
 
getTitle() - Method in class it.cnr.isti.hpc.dexter.util.TitleRedirectId
 
getToken() - Method in class it.cnr.isti.hpc.text.PosToken
 
getValue() - Method in class it.cnr.isti.hpc.dexter.document.Field
 
getVerbs(String) - Method in class it.cnr.isti.hpc.text.PosTagger
 
getWikiId(int) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
 
getWindowSize() - Method in class it.cnr.isti.hpc.dexter.spot.ContextExtractor
 
getX() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
getY() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 

H

hasCollection(String) - Method in class it.cnr.isti.hpc.mapdb.MapDB
 
hasDexterLuceneIndex() - Static method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
 
hash(String) - Method in class it.cnr.isti.hpc.dexter.spot.ram.SpotMinimalPerfectHash
 
hashCode() - Method in class it.cnr.isti.hpc.dexter.entity.Entity
 
hashCode() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
hashCode() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
hashCode() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
 
hashCode() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
hashCode() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
hashCode() - Method in class it.cnr.isti.hpc.text.PosToken
 
hasLabel(String) - Method in class it.cnr.isti.hpc.dexter.label.IdHelper
 
hasNegativeScores() - Method in class it.cnr.isti.hpc.dexter.relatedness.MilneRelatedness
 
hasNegativeScores() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
hasNegativeScores() - Method in class it.cnr.isti.hpc.dexter.relatedness.RelatednessFactory
 
hasNext() - Method in class it.cnr.isti.hpc.dexter.spot.ram.SpotEliasFanoOffsets.OffsetsFileIterator
 
hasNext() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader
 
hasSameSpot(SpotReader.SpotSrcTarget) - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
HtmlCleaner - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
Converts javascript strings in ascii
HtmlCleaner() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.HtmlCleaner
 

I

idf - Variable in class it.cnr.isti.hpc.dexter.spot.Spot
 
IdHelper - Class in it.cnr.isti.hpc.dexter.label
An IdHelper provides the conversion between an entity label (e.g., Pablo_Picasso) and an integer representing the entity (e.g., 24176).
IdHelper(IdToLabel, LabelToId) - Constructor for class it.cnr.isti.hpc.dexter.label.IdHelper
 
IdHelperFactory - Class in it.cnr.isti.hpc.dexter.label
Generates an IdHelper that takes care to convert the entity labels (i.e., the titles of the Wikipedia pages containing the description of the entity) in integer identifiers used for internal processing.
IdHelperFactory() - Constructor for class it.cnr.isti.hpc.dexter.label.IdHelperFactory
 
IdHelperFactory.Type - Enum in it.cnr.isti.hpc.dexter.label
 
IdToLabel - Interface in it.cnr.isti.hpc.dexter.label
IdToArticle stores the map wiki-id -> label
IdToLabelWriter - Interface in it.cnr.isti.hpc.dexter.label
IdToLabelWriter takes care to write in a persistent format the mapping id -> label .
ImageFilter - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.filter
Deprecated.
depends on language
ImageFilter() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.ImageFilter
Deprecated.
 
IncomingNodes - Interface in it.cnr.isti.hpc.dexter.graph
Represents a collection of nodes, where for each node the list of neighbors represents the list of nodes that link to or are linked by the given node.
incrementEntityFrequency() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
index(Spot) - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatchList
 
IndexIdToLabelCLI - Class in it.cnr.isti.hpc.dexter.cli.label
Takes a file containing a list of TitleRedirectId and indexes the mapping id -> title .
IndexIdToLabelCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.label.IndexIdToLabelCLI
 
IndexIncomingNodesCLI - Class in it.cnr.isti.hpc.dexter.cli.graph
IndexIncomingNodesCLI generate the binary file containing the incoming entities.
IndexIncomingNodesCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.graph.IndexIncomingNodesCLI
 
IndexLabelToIdCLI - Class in it.cnr.isti.hpc.dexter.cli.label
Takes a file containing a list of TitleRedirectId and indexes the mapping title -> id .
IndexLabelToIdCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.label.IndexLabelToIdCLI
 
IndexOffsetsUsingEliasFanoCLI - Class in it.cnr.isti.hpc.dexter.cli.spot.ram
Encodes the offsets file with elias fano
IndexOffsetsUsingEliasFanoCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.spot.ram.IndexOffsetsUsingEliasFanoCLI
 
IndexOutcomingNodesCLI - Class in it.cnr.isti.hpc.dexter.cli.graph
IndexOutcomingNodesCLI generate the binary file containing the incoming entities.
IndexOutcomingNodesCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.graph.IndexOutcomingNodesCLI
 
IndexSpotFileAndGenerateOffsetsCLI - Class in it.cnr.isti.hpc.dexter.cli.spot.ram
Get the file containing the spots (sorted by minimal perfect hash) and generate the compressed spot file, and an offset file containing for each spot the position in the compressed spot file, containing the spot data
IndexSpotFileAndGenerateOffsetsCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.spot.ram.IndexSpotFileAndGenerateOffsetsCLI
 
IndexWikipediaOnLuceneCLI - Class in it.cnr.isti.hpc.dexter.cli.index
IndexWikipediaOnLuceneCLI takes the Wikipedia Dump (in json) and indexes it with Lucene.
IndexWikipediaOnLuceneCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.index.IndexWikipediaOnLuceneCLI
 
inIntersection() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
init(String) - Method in class it.cnr.isti.hpc.dexter.spot.ContextExtractor
 
init() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader
 
inoutX() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
inoutY() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
IntArrayString - Class in it.cnr.isti.hpc.text
Allows to encode a list of integers as string containing the integer encoded in hexadecimal (and optionally compressed)
IntArrayString() - Constructor for class it.cnr.isti.hpc.text.IntArrayString
 
IntArrayString(boolean) - Constructor for class it.cnr.isti.hpc.text.IntArrayString
 
intersection(int[], int[], int, int) - Static method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
intersectionSize(int[], int[]) - Method in class it.cnr.isti.hpc.dexter.relatedness.MilneRelatedness
 
intersectionSize(int[], int[], int, int) - Static method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
isAbjective() - Method in class it.cnr.isti.hpc.text.PosToken
 
isAdverb() - Method in class it.cnr.isti.hpc.text.PosToken
 
isConjunction() - Method in class it.cnr.isti.hpc.text.PosToken
 
isDisambiguation(int) - Static method in class it.cnr.isti.hpc.dexter.entity.Entity
 
isDisambiguation(Integer) - Method in class it.cnr.isti.hpc.dexter.label.IdHelper
 
isEmpty() - Method in class it.cnr.isti.hpc.dexter.shingle.Shingle
 
isEmpty() - Method in class it.cnr.isti.hpc.text.Sentence
 
isEmpty() - Method in class it.cnr.isti.hpc.text.Token
 
isFilter(String) - Method in class it.cnr.isti.hpc.dexter.spot.clean.SpotManager
 
isFilter(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.AsciiFilter
 
isFilter(Spot) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.CommonnessFilter
 
isFilter(T) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.Filter
returns true if the given spot does not respect the filter constraint.
isFilter(Spot) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.FrequencyFilter
 
isFilter(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.ImageFilter
Deprecated.
 
isFilter(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.LengthFilter
 
isFilter(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.LongSpotFilter
 
isFilter(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.NumberFilter
 
isFilter(Spot) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.ProbabilityFilter
 
isFilter(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.SymbolFilter
 
isFilter(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.TemplateFilter
Deprecated.
 
isPureAscii(String) - Static method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.AsciiFilter
 
isRedirect() - Method in class it.cnr.isti.hpc.dexter.util.TitleRedirectId
 
isVerb() - Method in class it.cnr.isti.hpc.text.PosToken
 
it.cnr.isti.hpc.dexter - package it.cnr.isti.hpc.dexter
 
it.cnr.isti.hpc.dexter.cli - package it.cnr.isti.hpc.dexter.cli
 
it.cnr.isti.hpc.dexter.cli.graph - package it.cnr.isti.hpc.dexter.cli.graph
 
it.cnr.isti.hpc.dexter.cli.index - package it.cnr.isti.hpc.dexter.cli.index
 
it.cnr.isti.hpc.dexter.cli.label - package it.cnr.isti.hpc.dexter.cli.label
 
it.cnr.isti.hpc.dexter.cli.spot - package it.cnr.isti.hpc.dexter.cli.spot
 
it.cnr.isti.hpc.dexter.cli.spot.ram - package it.cnr.isti.hpc.dexter.cli.spot.ram
 
it.cnr.isti.hpc.dexter.disambiguation - package it.cnr.isti.hpc.dexter.disambiguation
 
it.cnr.isti.hpc.dexter.document - package it.cnr.isti.hpc.dexter.document
 
it.cnr.isti.hpc.dexter.entity - package it.cnr.isti.hpc.dexter.entity
 
it.cnr.isti.hpc.dexter.graph - package it.cnr.isti.hpc.dexter.graph
 
it.cnr.isti.hpc.dexter.graph.ram - package it.cnr.isti.hpc.dexter.graph.ram
 
it.cnr.isti.hpc.dexter.label - package it.cnr.isti.hpc.dexter.label
 
it.cnr.isti.hpc.dexter.label.mapdb - package it.cnr.isti.hpc.dexter.label.mapdb
 
it.cnr.isti.hpc.dexter.lucene - package it.cnr.isti.hpc.dexter.lucene
 
it.cnr.isti.hpc.dexter.plugin - package it.cnr.isti.hpc.dexter.plugin
 
it.cnr.isti.hpc.dexter.relatedness - package it.cnr.isti.hpc.dexter.relatedness
 
it.cnr.isti.hpc.dexter.shingle - package it.cnr.isti.hpc.dexter.shingle
 
it.cnr.isti.hpc.dexter.spot - package it.cnr.isti.hpc.dexter.spot
 
it.cnr.isti.hpc.dexter.spot.clean - package it.cnr.isti.hpc.dexter.spot.clean
 
it.cnr.isti.hpc.dexter.spot.cleanpipe - package it.cnr.isti.hpc.dexter.spot.cleanpipe
 
it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner - package it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
 
it.cnr.isti.hpc.dexter.spot.cleanpipe.filter - package it.cnr.isti.hpc.dexter.spot.cleanpipe.filter
 
it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper - package it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper
 
it.cnr.isti.hpc.dexter.spot.ram - package it.cnr.isti.hpc.dexter.spot.ram
 
it.cnr.isti.hpc.dexter.spot.repo - package it.cnr.isti.hpc.dexter.spot.repo
 
it.cnr.isti.hpc.dexter.spotter - package it.cnr.isti.hpc.dexter.spotter
 
it.cnr.isti.hpc.dexter.util - package it.cnr.isti.hpc.dexter.util
 
it.cnr.isti.hpc.mapdb - package it.cnr.isti.hpc.mapdb
 
it.cnr.isti.hpc.text - package it.cnr.isti.hpc.text
 
iterator() - Method in class it.cnr.isti.hpc.dexter.graph.ram.RamNodes
 
iterator() - Method in class it.cnr.isti.hpc.dexter.shingle.ShingleExtractor
 
iterator() - Method in class it.cnr.isti.hpc.dexter.spot.ram.SpotEliasFanoOffsets.OffsetsFile
 

J

JuniorAndInitialsCleaner - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
JuniorAndInitialsCleaner trasforms a name removing initials or 'jr' (e.g., ted ginn, jr -> ted ginn )
JuniorAndInitialsCleaner() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.JuniorAndInitialsCleaner
 

L

LabelToId - Interface in it.cnr.isti.hpc.dexter.label
LabelToId stores the map label -> id
LabelToIdWriter - Interface in it.cnr.isti.hpc.dexter.label
Label2IdWriter takes care to write in a persistent format the mapping label -> wiki-id .
LengthFilter - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.filter
LengthFilter filters out spots shorter than a given length (default is 3).
LengthFilter() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.LengthFilter
 
LengthFilter(int) - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.LengthFilter
 
link - Variable in class it.cnr.isti.hpc.dexter.spot.Spot
 
linkProbability - Variable in class it.cnr.isti.hpc.dexter.spot.Spot
 
load(String) - Static method in class it.cnr.isti.hpc.dexter.spot.ram.RamSpotFile
 
load(File) - Static method in class it.cnr.isti.hpc.dexter.spot.ram.RamSpotFile
 
load(FileInputStream) - Static method in class it.cnr.isti.hpc.dexter.spot.ram.RamSpotFile
 
loadWikiIdToLuceneId() - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Loads the map containing the conversion from the Wikipedia ids to the Lucene Ids.
LongSpotFilter - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.filter
LongSpotFilter filters out all the spots that contains more than a given number of terms (default is 6);
LongSpotFilter() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.LongSpotFilter
 
LowerCaseCleaner - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
LowerCaseCleaner, puts in lowercase the text of a spot
LowerCaseCleaner() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.LowerCaseCleaner
 
LuceneHelper - Class in it.cnr.isti.hpc.dexter.lucene
LuceneHelper provides utilities for indexing, retrieving, and ranking Wikipedia articles.
LuceneHelper(File, File) - Constructor for class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Opens or creates a lucene index in the given directory

M

main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.DexterCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.GetArticleIdCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.GetArticleLabelCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.GetRelatednessCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.graph.GetNeighbourNodesCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.graph.IndexIncomingNodesCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.graph.IndexOutcomingNodesCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.index.IndexWikipediaOnLuceneCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.index.QueryLuceneByIdCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.index.QueryLuceneCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.label.AddRedirectIdCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.label.ExportArticlesIdCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.label.IndexIdToLabelCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.label.IndexLabelToIdCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.spot.ExtractSpotsCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.spot.GenerateSpotDocumentFrequencyCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.spot.ram.GenerateSpotsMinimalPerfectHashCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.spot.ram.IndexOffsetsUsingEliasFanoCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.spot.ram.IndexSpotFileAndGenerateOffsetsCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.spot.SpotCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.cli.spot.WriteOneSpotPerLineCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.dexter.spot.ram.RamSpotRepository
 
map(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper.CityMapper
 
map(T) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper.Mapper
given a spot returns several different versions of the spot.
map(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper.QuotesMapper
 
map(String) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper.TypeMapper
Deprecated.
 
MapDB - Class in it.cnr.isti.hpc.mapdb
MapDB abstracts the mapdb framework, allowing to create big maps that are stored on the disk and to access them with high performance.
MapDB(String, String) - Constructor for class it.cnr.isti.hpc.mapdb.MapDB
 
MapDB(File) - Constructor for class it.cnr.isti.hpc.mapdb.MapDB
 
MapDBException - Exception in it.cnr.isti.hpc.mapdb
JDBMException.java
MapDBException() - Constructor for exception it.cnr.isti.hpc.mapdb.MapDBException
 
MapDBException(String) - Constructor for exception it.cnr.isti.hpc.mapdb.MapDBException
 
MapDBIdToLabel - Class in it.cnr.isti.hpc.dexter.label.mapdb
MapDBIdToLabel allows to retrieve the label of an entity given its ID.
MapDBInstance - Class in it.cnr.isti.hpc.dexter.label.mapdb
An instance of a MapDB object
MapDBInstance() - Constructor for class it.cnr.isti.hpc.dexter.label.mapdb.MapDBInstance
 
MapDBLabelToId - Class in it.cnr.isti.hpc.dexter.label.mapdb
MapDBIdToLabel allows to retrieve the ID of an entity given its Label.
Mapper<T> - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper
Mapper given a spot returns several different versions of the spot.
Mapper() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper.Mapper
 
match(Document) - Method in class it.cnr.isti.hpc.dexter.spotter.DictionarySpotter
 
match(Document) - Method in interface it.cnr.isti.hpc.dexter.spotter.Spotter
 
mention - Variable in class it.cnr.isti.hpc.dexter.spot.Spot
 
MilneRelatedness - Class in it.cnr.isti.hpc.dexter.relatedness
Implements the standard relatedness function proposed by Milne and Witten [1].
MilneRelatedness() - Constructor for class it.cnr.isti.hpc.dexter.relatedness.MilneRelatedness
 
MilneRelatedness(int, int) - Constructor for class it.cnr.isti.hpc.dexter.relatedness.MilneRelatedness
 
MultifieldDocument - Class in it.cnr.isti.hpc.dexter.document
A document with an unordered set of fields.
MultifieldDocument() - Constructor for class it.cnr.isti.hpc.dexter.document.MultifieldDocument
Constructs a new document with no fields.

N

next() - Method in class it.cnr.isti.hpc.dexter.spot.ram.SpotEliasFanoOffsets.OffsetsFileIterator
 
next() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader
 
nextLong() - Method in class it.cnr.isti.hpc.dexter.spot.ram.SpotEliasFanoOffsets.OffsetsFileIterator
 
Node - Class in it.cnr.isti.hpc.dexter.graph
Node implements a generic node in a entity-graph, represented as its node-id, plus a set of incoming or outcoming nodes (the neighbours, represented as an array of integers)
Node() - Constructor for class it.cnr.isti.hpc.dexter.graph.Node
 
Node(int, int[]) - Constructor for class it.cnr.isti.hpc.dexter.graph.Node
 
Node.Parser - Class in it.cnr.isti.hpc.dexter.graph
 
Node.Parser() - Constructor for class it.cnr.isti.hpc.dexter.graph.Node.Parser
 
NodeFactory - Class in it.cnr.isti.hpc.dexter.graph
Returns a particular nodes collection, or a NodeWriter depending on how the nodes are serialized (db, hadoop mapfile, ram).
NodeFactory() - Constructor for class it.cnr.isti.hpc.dexter.graph.NodeFactory
 
NodeFactory.Type - Enum in it.cnr.isti.hpc.dexter.graph
 
NodeStar - Interface in it.cnr.isti.hpc.dexter.graph
Represents a collection of nodes in a graph, for each node (represented by an integer id), can return addictional data (see @link{Node}) and a list of neighbors.
NodesWriter - Interface in it.cnr.isti.hpc.dexter.graph
 
NOID - Static variable in class it.cnr.isti.hpc.dexter.label.IdHelper
 
NopFunction<T> - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe
Implements the identity function, it gives in output the input object.
NopFunction() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.NopFunction
 
normalizedVerb() - Method in class it.cnr.isti.hpc.text.PosToken
if the token is a verb, return a normalized form, otherwise an empty string
normalizeScores() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatchList
Normalizes the scores of the entity match, such that at the end of the method the sum of all the scores equals to 1
normalizeSpotProbabilities() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatchList
 
NumberFilter - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.filter
NumberFilter filters out all the spots containing only numbers
NumberFilter() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.NumberFilter
 
numDocs() - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
 

O

originalShingle(String) - Method in class it.cnr.isti.hpc.dexter.shingle.Shingle
 
OutcomingNodes - Interface in it.cnr.isti.hpc.dexter.graph
Represents a collection of nodes, where for each node the list of neighbors represents the list of nodes that are linked by the given node.
outIntersection() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
output - Variable in class it.cnr.isti.hpc.dexter.spot.cleanpipe.Pipe
 
outX() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
outY() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
overlaps(EntityMatch) - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
Returns true if this spot and the given spots overlaps in the annotated text, e.g.
overlaps(SpotMatch) - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
Returns true if this spot and the given spots overlaps in the annotated text, e.g., "neruda pablo picasso" -> 'neruda pablo' 'pablo picasso' .

P

ParenthesesCleaner - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
Parenthesis cleaner removes parenthesis from a spot.
ParenthesesCleaner() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.ParenthesesCleaner
 
parseWikiIdToLuceneId() - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Loads the map containing the conversion from the Wikipedia ids to the Lucene Ids.
Pipe<T> - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe
Class pipe allows to create a chain of functions manipulating T objects.
Pipe(Function<T>) - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.Pipe
 
Pipe(Pipe<T>, Function<T>) - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.Pipe
 
Pipe.OutputCollector - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe
 
PluginLoader - Class in it.cnr.isti.hpc.dexter.plugin
The PluginLoader allows to include in the framework new implementations of components linking project.
PluginLoader() - Constructor for class it.cnr.isti.hpc.dexter.plugin.PluginLoader
 
positions - Variable in class it.cnr.isti.hpc.dexter.spot.ContextExtractor
 
PosTagger - Class in it.cnr.isti.hpc.text
PosTagger allows to annotate text with pos tag.
PosToken - Class in it.cnr.isti.hpc.text
A pos token
PosToken(String, String) - Constructor for class it.cnr.isti.hpc.text.PosToken
 
PrefixCleaner - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
Removes a matching prefix
PrefixCleaner(String) - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.PrefixCleaner
 
ProbabilityFilter - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.filter
ProbabilityFilter removes all the spots with link probability threshold lower than a certain threshold.
ProbabilityFilter() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.ProbabilityFilter
 
process(String) - Method in class it.cnr.isti.hpc.dexter.spot.clean.SpotManager
 
process(T) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.Pipe
Performs all the pipeline over the object elem, and returns one or multiple manipulations of the object elem.
properties - Static variable in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
pushResult(T) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.Pipe.OutputCollector
 
pushResult(T) - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.Pipe
 

Q

query(String, String) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
 
query(String) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
 
QueryLuceneByIdCLI - Class in it.cnr.isti.hpc.dexter.cli.index
QueryLuceneByIdCLI, retrieves from the index the Wikipedia Article with the given Wikipedia Id.
QueryLuceneByIdCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.index.QueryLuceneByIdCLI
 
QueryLuceneCLI - Class in it.cnr.isti.hpc.dexter.cli.index
QueryLuceneCLI performs a query over the Wikipedia Lucene index and returns the number of documents matching the query
QueryLuceneCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.index.QueryLuceneCLI
 
QuotesCleaner - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
Quotes cleaner removes quotes from a spot.
QuotesCleaner() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.QuotesCleaner
 
QuotesMapper - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper
QuotesMapper add text between quotes that appears inside the labels and text appearing outsite the labels
(e.g., dave "baby" cortez -> baby, dave cortez )
QuotesMapper() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper.QuotesMapper
 

R

RamIncomingNodes - Class in it.cnr.isti.hpc.dexter.graph.ram
RamIncomingNodes allows to keep the incoming nodes for each node in a graph directly in main memory.
RamNodes - Class in it.cnr.isti.hpc.dexter.graph.ram
RamNodes implements NodeStar object in main memory.
RamNodes(File) - Constructor for class it.cnr.isti.hpc.dexter.graph.ram.RamNodes
 
RamOutcomingNodes - Class in it.cnr.isti.hpc.dexter.graph.ram
RamIncomingNodes allows to keep the outcoming nodes for each node in a graph directly in main memory.
RamSpotFile - Class in it.cnr.isti.hpc.dexter.spot.ram
 
RamSpotRepository - Class in it.cnr.isti.hpc.dexter.spot.ram
 
RamSpotRepository() - Constructor for class it.cnr.isti.hpc.dexter.spot.ram.RamSpotRepository
 
rank(SpotMatch) - Method in class it.cnr.isti.hpc.dexter.entity.EntityRanker
 
rankBySimilarity(SpotMatch, EntityMatchList, String, String) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Sorts a list of entities by their similarity with the string context.
rankBySimilarity(SpotMatch, EntityMatchList, String) - Method in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
Sorts a list of entities by their similarity (full text) with the string context.
register(Relatedness) - Static method in class it.cnr.isti.hpc.dexter.relatedness.RelatednessFactory
 
Relatedness - Class in it.cnr.isti.hpc.dexter.relatedness
Contains the relatedness between two entities, and several functions on the graph useful to implement a relatedness function.
Relatedness() - Constructor for class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
Relatedness(int, int) - Constructor for class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
relatedness - Variable in class it.cnr.isti.hpc.dexter.relatedness.RelatednessFactory
 
RelatednessFactory - Class in it.cnr.isti.hpc.dexter.relatedness
Allows to retrieve a particular relatedness function given its name.
RelatednessFactory() - Constructor for class it.cnr.isti.hpc.dexter.relatedness.RelatednessFactory
 
RelatednessFactory(String) - Constructor for class it.cnr.isti.hpc.dexter.relatedness.RelatednessFactory
 
remove() - Method in class it.cnr.isti.hpc.dexter.spot.ram.SpotEliasFanoOffsets.OffsetsFileIterator
 
remove() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader
 
removeField(String) - Method in class it.cnr.isti.hpc.dexter.document.Document
Removes field with the specified name from the document.
removeField(String) - Method in class it.cnr.isti.hpc.dexter.document.FlatDocument
 
removeField(String) - Method in class it.cnr.isti.hpc.dexter.document.MultifieldDocument
Removes field with the specified name from the document.
removeOverlappings() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatchList
Removes the entities whose spots overlaps, if some entity spots overlaps, only the entity with the greatest score is keep while the others are removed from the list.
rmCollection(String) - Method in class it.cnr.isti.hpc.mapdb.MapDB
 

S

score() - Method in class it.cnr.isti.hpc.dexter.relatedness.MilneRelatedness
 
score - Variable in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
score() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
Sentence - Class in it.cnr.isti.hpc.text
Represents a sentece in a text.
Sentence(int, int) - Constructor for class it.cnr.isti.hpc.text.Sentence
 
Sentence(String, int, int) - Constructor for class it.cnr.isti.hpc.text.Sentence
 
SentenceSegmenter - Class in it.cnr.isti.hpc.text
SentenceSegmenter allows to segment text in sentences.
set(int, int) - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
setCollectionSize(int) - Static method in class it.cnr.isti.hpc.dexter.spot.Spot
Set the number of entities in the collection
setContent(String) - Method in class it.cnr.isti.hpc.dexter.document.FlatDocument
Set the content of the flat document (the body field)
setEnd(int) - Method in class it.cnr.isti.hpc.dexter.shingle.Shingle
 
setEnd(int) - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
setEnd(int) - Method in class it.cnr.isti.hpc.text.Sentence
 
setEnd(int) - Method in class it.cnr.isti.hpc.text.Token
 
setEntities(List<Entity>) - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Set the list of entities with the current mention
setEntities(EntityMatchList) - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
setEntity(Entity) - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
setFreq(int) - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotFrequency
 
setFrequency(int) - Method in class it.cnr.isti.hpc.dexter.entity.Entity
 
setFrequency(int) - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Set How many times the mention occurs in the collection.
setId(int) - Method in class it.cnr.isti.hpc.dexter.entity.Entity
sets the entity integer identifier (i.e., wikiId)
setId(int) - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
setId(String) - Method in class it.cnr.isti.hpc.dexter.util.TitleRedirectId
 
setIdf(double) - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Set the inverse document frequency of the mention in the collection.
setLink(int) - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Set how many times the mention occurs in the collection as a link to an entity
setLinkProbability(double) - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Set the probability for this mention to be a link;
setMaxShingleSize(int) - Method in class it.cnr.isti.hpc.dexter.shingle.ShingleExtractor
 
setMention(String) - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Set the text of the mention
setName(String) - Method in class it.cnr.isti.hpc.dexter.document.Field
 
setNeighbours(int[]) - Method in class it.cnr.isti.hpc.dexter.graph.Node
 
setNode(int) - Method in class it.cnr.isti.hpc.dexter.graph.Node
 
setPos(String) - Method in class it.cnr.isti.hpc.text.PosToken
 
setProbability(double) - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
setRedirect(String) - Method in class it.cnr.isti.hpc.dexter.util.TitleRedirectId
 
setScore(double) - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
setScore(double) - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
setSpot(SpotMatch) - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
setSpot(Spot) - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
setSpot(String) - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotFrequency
 
setSpot(String) - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
setSrc(int) - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
setStart(int) - Method in class it.cnr.isti.hpc.dexter.shingle.Shingle
 
setStart(int) - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
setStart(int) - Method in class it.cnr.isti.hpc.text.Sentence
 
setStart(int) - Method in class it.cnr.isti.hpc.text.Token
 
setTarget(int) - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
setText(String) - Method in class it.cnr.isti.hpc.dexter.shingle.Shingle
 
setText(String) - Method in class it.cnr.isti.hpc.text.Sentence
 
setText(String) - Method in class it.cnr.isti.hpc.text.Token
 
setTitle(String) - Method in class it.cnr.isti.hpc.dexter.util.TitleRedirectId
 
setToken(String) - Method in class it.cnr.isti.hpc.text.PosToken
 
setValue(String) - Method in class it.cnr.isti.hpc.dexter.document.Field
 
setWindowSize(int) - Method in class it.cnr.isti.hpc.dexter.spot.ContextExtractor
 
setX(int) - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
setY(int) - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
sharedTerms(String, String) - Static method in class it.cnr.isti.hpc.text.Text
 
Shingle - Class in it.cnr.isti.hpc.dexter.shingle
A Shingle represents a fragment of text in a document to annotate.
Shingle(List<Token>) - Constructor for class it.cnr.isti.hpc.dexter.shingle.Shingle
 
Shingle(String, int, int) - Constructor for class it.cnr.isti.hpc.dexter.shingle.Shingle
 
Shingle(String) - Constructor for class it.cnr.isti.hpc.dexter.shingle.Shingle
 
ShingleExtractor - Class in it.cnr.isti.hpc.dexter.shingle
ShingleExtractor extracts all the ngrams (of fixed length) from the text of an article.
ShingleExtractor(Article) - Constructor for class it.cnr.isti.hpc.dexter.shingle.ShingleExtractor
 
ShingleExtractor(String) - Constructor for class it.cnr.isti.hpc.dexter.shingle.ShingleExtractor
 
skip(int) - Method in class it.cnr.isti.hpc.dexter.spot.ram.SpotEliasFanoOffsets.OffsetsFileIterator
 
sort() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatchList
 
sortByProbability() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatchList
 
sortedunion(int[], int[], int, int) - Static method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
split(String) - Method in class it.cnr.isti.hpc.text.SentenceSegmenter
 
split(Reader) - Method in class it.cnr.isti.hpc.text.SentenceSegmenter
 
splitPos(String) - Method in class it.cnr.isti.hpc.text.SentenceSegmenter
 
spot(Document) - Method in class it.cnr.isti.hpc.dexter.Dexter
 
Spot - Class in it.cnr.isti.hpc.dexter.spot
A spot consists of: the mention (i.e.
Spot(String) - Constructor for class it.cnr.isti.hpc.dexter.spot.Spot
Builds a spot from a textual mention
Spot(String, List<Entity>, int, int) - Constructor for class it.cnr.isti.hpc.dexter.spot.Spot
Creates the spot, and associates the entities that can be referred by the mention, how many times the mention occurs as an anchor-text in Wikipedia and how many articles that contain the mention as anchor or simple text (i.e.
spot - Variable in class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
Spot.Parser - Class in it.cnr.isti.hpc.dexter.spot
A record parser for tsv encoded spots.
Spot.Parser() - Constructor for class it.cnr.isti.hpc.dexter.spot.Spot.Parser
 
SpotCLI - Class in it.cnr.isti.hpc.dexter.cli.spot
Performs the spotting over a document using the DictionarySpotter, prints on the stout the list of the matched spots with their candidate entities.
SpotCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.spot.SpotCLI
 
SpotEliasFanoOffsets - Class in it.cnr.isti.hpc.dexter.spot.ram
 
SpotEliasFanoOffsets.OffsetsFile - Class in it.cnr.isti.hpc.dexter.spot.ram
 
SpotEliasFanoOffsets.OffsetsFile(String) - Constructor for class it.cnr.isti.hpc.dexter.spot.ram.SpotEliasFanoOffsets.OffsetsFile
 
SpotEliasFanoOffsets.OffsetsFileIterator - Class in it.cnr.isti.hpc.dexter.spot.ram
 
SpotEliasFanoOffsets.OffsetsFileIterator(String) - Constructor for class it.cnr.isti.hpc.dexter.spot.ram.SpotEliasFanoOffsets.OffsetsFileIterator
 
SpotManager - Class in it.cnr.isti.hpc.dexter.spot.clean
A SpotManager takes care of cleaning the anchor texts extracted from the Wikipedia articles in order to produce a dictionary of spots.
SpotManager() - Constructor for class it.cnr.isti.hpc.dexter.spot.clean.SpotManager
Generates a new pipeline
SpotManager(Pipe<String>) - Constructor for class it.cnr.isti.hpc.dexter.spot.clean.SpotManager
Creates a spot manager performing the cleaning described in the given pipe.
SpotMatch - Class in it.cnr.isti.hpc.dexter.spot
SpotMatch contains all the additional informations regarding a spot matched in a particular position of the text.
SpotMatch(Spot) - Constructor for class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
SpotMatch(Spot, EntityMatchList) - Constructor for class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
SpotMatch(Spot, List<Entity>) - Constructor for class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
SpotMatch(Spot, Field) - Constructor for class it.cnr.isti.hpc.dexter.spot.SpotMatch
 
SpotMatchList - Class in it.cnr.isti.hpc.dexter.spot
Represents a list of SpotMatch
SpotMatchList() - Constructor for class it.cnr.isti.hpc.dexter.spot.SpotMatchList
 
SpotMinimalPerfectHash - Class in it.cnr.isti.hpc.dexter.spot.ram
 
SpotReader - Class in it.cnr.isti.hpc.dexter.spot
SpotReader class iterates over a list of spots given two files: a spotSrcTargetFile containing a list of spots extracted from the anchors in the wikipedia dump.
SpotReader(String, String) - Constructor for class it.cnr.isti.hpc.dexter.spot.SpotReader
 
SpotReader.SpotFrequency - Class in it.cnr.isti.hpc.dexter.spot
Contains the text of a spot and its document frequency in the collection.
SpotReader.SpotFrequency() - Constructor for class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotFrequency
 
SpotReader.SpotFrequency(String, int) - Constructor for class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotFrequency
 
SpotReader.SpotFrequencyParser - Class in it.cnr.isti.hpc.dexter.spot
Parse a line containing the encoded version of a SpotFrequency object.
SpotReader.SpotFrequencyParser() - Constructor for class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotFrequencyParser
 
SpotReader.SpotSrcTarget - Class in it.cnr.isti.hpc.dexter.spot
 
SpotReader.SpotSrcTarget() - Constructor for class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
SpotReader.SpotSrcTargetParser - Class in it.cnr.isti.hpc.dexter.spot
 
SpotReader.SpotSrcTargetParser() - Constructor for class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTargetParser
 
SpotRepository - Interface in it.cnr.isti.hpc.dexter.spot.repo
SpotRepository allows to retrieve metadata referring to a spot, i.e., a piece of text referring to one or more entities.
SpotRepositoryFactory - Class in it.cnr.isti.hpc.dexter.spot.repo
SpotRepositoryFactory returns a spot repository instance given a string defining its type.
SpotRepositoryFactory() - Constructor for class it.cnr.isti.hpc.dexter.spot.repo.SpotRepositoryFactory
 
Spotter - Interface in it.cnr.isti.hpc.dexter.spotter
Spotting aims at identifying spots, i.e., contiguous sequences of n terms (n-grams) occurring in a document D that might mention some entity e.
start - Variable in class it.cnr.isti.hpc.dexter.shingle.Shingle
start position in the original text
stats() - Method in class it.cnr.isti.hpc.dexter.Dexter
 
STD - Static variable in class it.cnr.isti.hpc.dexter.relatedness.RelatednessFactory
 
STD_TYPE - Static variable in class it.cnr.isti.hpc.dexter.graph.NodeFactory
 
STD_TYPE - Static variable in class it.cnr.isti.hpc.dexter.label.IdHelperFactory
 
stdIdHelper - Static variable in class it.cnr.isti.hpc.dexter.label.IdHelperFactory
 
stdIdToLabelWriter - Static variable in class it.cnr.isti.hpc.dexter.label.IdHelperFactory
 
stdLabelToIdWriter - Static variable in class it.cnr.isti.hpc.dexter.label.IdHelperFactory
 
store(Integer, String) - Static method in class it.cnr.isti.hpc.dexter.cli.label.IndexIdToLabelCLI
 
store(String, Integer) - Static method in class it.cnr.isti.hpc.dexter.cli.label.IndexLabelToIdCLI
 
STORE_TERM_VECTORS - Static variable in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
 
STORE_TERM_VECTORS_NOT_STORED - Static variable in class it.cnr.isti.hpc.dexter.lucene.LuceneHelper
 
StripCleaner - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
StripCleaner trims characters at the beginning of at the end of a spot.
StripCleaner() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.StripCleaner
Default strip cleaner, will strip the characters ,#*-!`{}~[]='<>:%/ if they are in the beginning or at the end of a string.
StripCleaner(String) - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.StripCleaner
Default strip cleaner, will strip the characters given in a string if they are in the beginning or at the end of a string.
SymbolFilter - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.filter
SymbolFilter filters out all the spotsF that do not contain alphabetic characters
SymbolFilter() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.SymbolFilter
 

T

tag(Document) - Method in class it.cnr.isti.hpc.dexter.Dexter
 
tag(Document) - Method in interface it.cnr.isti.hpc.dexter.Tagger
Takes a Document performs the entity linking.
tag(String) - Method in class it.cnr.isti.hpc.text.PosTagger
 
Tagger - Interface in it.cnr.isti.hpc.dexter
A tagger performs the whole entity linking process.
TemplateCleaner - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
TemplateCleaner removes all the text matching TEMPLATE[....]
TemplateCleaner() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.TemplateCleaner
 
TemplateFilter - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.filter
Deprecated.
depends on language
TemplateFilter() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.filter.TemplateFilter
Deprecated.
 
text - Variable in class it.cnr.isti.hpc.dexter.shingle.Shingle
cleaned text
text - Variable in class it.cnr.isti.hpc.dexter.spot.ContextExtractor
 
Text - Class in it.cnr.isti.hpc.text
Utils to manage text.
Text(String) - Constructor for class it.cnr.isti.hpc.text.Text
 
TitleRedirectId - Class in it.cnr.isti.hpc.dexter.util
TitleDisambiguationId contains the title of an article, its numerical id, and, if the article is a redirect, the redirect string.
TitleRedirectId() - Constructor for class it.cnr.isti.hpc.dexter.util.TitleRedirectId
 
TitleRedirectId(Article) - Constructor for class it.cnr.isti.hpc.dexter.util.TitleRedirectId
 
TitleRedirectId.Parser - Class in it.cnr.isti.hpc.dexter.util
The Parser class encode a TitleRedirectId object as a tab separated value line, containing in the first position the title (or the target of the redirect if the article is a redirect), in the second position the redirect title (or the empty string if the article is not a redirect) and finally the numerical id of the article.
TitleRedirectId.Parser() - Constructor for class it.cnr.isti.hpc.dexter.util.TitleRedirectId.Parser
 
toArray(String) - Method in class it.cnr.isti.hpc.text.IntArrayString
 
toByteArray() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Encodes this spot in a array of byte, the encode consists in: 1 byte, containing the length of the mention (it is assumed that the length n of the mention is less than 256) n bytes, containing the mention encoded in ascii 4 bytes, containing the frequency of the mention as link 4 bytes, containing the document frequency of the mention 2 x 4 x m bytes, where m is the number of entities associated with the mention, containing for each entity its unique id and its frequency (number of anchors with this text that link to the entity);
toEntityString() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
toEntityString() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatchList
 
Token - Class in it.cnr.isti.hpc.text
 
Token(int, int) - Constructor for class it.cnr.isti.hpc.text.Token
 
Token(String, int, int) - Constructor for class it.cnr.isti.hpc.text.Token
 
tokenize(String) - Method in class it.cnr.isti.hpc.text.TokenSegmenter
 
tokenizePos(String) - Method in class it.cnr.isti.hpc.text.TokenSegmenter
 
TokenSegmenter - Class in it.cnr.isti.hpc.text
 
TopScoreEntityDisambiguator - Class in it.cnr.isti.hpc.dexter.disambiguation
Implements the Okkam's Razor principle, resolving the ambiguity for a spot using the entity with the largest probability to be represented by the spot (this probability is called commonness, and it is computed as the ratio between the links that point to the entity (using the spot as anchor) and the total number of links that have the spot as anchor.
TopScoreEntityDisambiguator() - Constructor for class it.cnr.isti.hpc.dexter.disambiguation.TopScoreEntityDisambiguator
 
toString() - Method in class it.cnr.isti.hpc.dexter.entity.Entity
 
toString() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatch
 
toString() - Method in class it.cnr.isti.hpc.dexter.entity.EntityMatchList
 
toString() - Method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
toString() - Method in class it.cnr.isti.hpc.dexter.shingle.Shingle
 
toString() - Method in class it.cnr.isti.hpc.dexter.spot.cleanpipe.Pipe.OutputCollector
 
toString() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
 
toString() - Method in class it.cnr.isti.hpc.dexter.spot.SpotMatchList
 
toString() - Method in class it.cnr.isti.hpc.dexter.spot.SpotReader.SpotSrcTarget
 
toString(List<Integer>) - Method in class it.cnr.isti.hpc.text.IntArrayString
 
toString(Integer[]) - Method in class it.cnr.isti.hpc.text.IntArrayString
 
toString() - Method in class it.cnr.isti.hpc.text.PosToken
 
toString() - Method in class it.cnr.isti.hpc.text.Sentence
 
toString() - Method in class it.cnr.isti.hpc.text.Token
 
toTsv() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Returns a tab separated version of the spot in a string.
TypeCleaner - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
Typecleaner will remove domain informations usually appended at the end of the titles (between parenthesis or after a #),
e.g.
TypeCleaner() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.TypeCleaner
 
TypeMapper - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper
Deprecated.
better as cleaner
TypeMapper() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.mapper.TypeMapper
Deprecated.
 

U

UnderscoreCleaner - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
UnderscoreCleaner removes all the underscores in a post, replacing them with spaces.
UnderscoreCleaner() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.UnderscoreCleaner
 
UnicodeCleaner - Class in it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner
UnicodeCleaner maps a Unicode string to ascii, the conversion is lossy, and it is performed using the unidecode library.
UnicodeCleaner() - Constructor for class it.cnr.isti.hpc.dexter.spot.cleanpipe.cleaner.UnicodeCleaner
 
unionSize(int[], int[], int, int) - Static method in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
updateIdf() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Update the IDF using the freq and collectionSize fields
updateLinkProbability() - Method in class it.cnr.isti.hpc.dexter.spot.Spot
Update the link probability using the link and freq fields
useStopwords - Variable in class it.cnr.isti.hpc.text.Text
 

V

valueOf(String) - Static method in enum it.cnr.isti.hpc.dexter.graph.NodeFactory.Type
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum it.cnr.isti.hpc.dexter.label.IdHelperFactory.Type
Returns the enum constant of this type with the specified name.
values() - Static method in enum it.cnr.isti.hpc.dexter.graph.NodeFactory.Type
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum it.cnr.isti.hpc.dexter.label.IdHelperFactory.Type
Returns an array containing the constants of this enum type, in the order they are declared.

W

W - Static variable in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
WriteOneSpotPerLineCLI - Class in it.cnr.isti.hpc.dexter.cli.spot
Gets in input a file containing :

<spot> <tab> <src entity> <tab> <target entity>

(the output of ExtractSpotCLI) and a file:

<spot> <tab> <df(spot)>

(the output of GenerateSpotDocumentFrequencyCLI) where df(spot) is the number of wikipedia documents containing the text of the spot (as anchor or simple text).
WriteOneSpotPerLineCLI(String[]) - Constructor for class it.cnr.isti.hpc.dexter.cli.spot.WriteOneSpotPerLineCLI
 

X

x - Variable in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 

Y

y - Variable in class it.cnr.isti.hpc.dexter.relatedness.Relatedness
 
A C D E F G H I J L M N O P Q R S T U V W X Y 

Copyright © 2013. All rights reserved.