A C D E F G H I J K L M P R S T U V W 

A

addAllCategories(List<Link>) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
addCategory(Link) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
addRow(List<String>) - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
addTemplatesSchema(List<String>) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
Article - Class in it.cnr.isti.hpc.wikipedia.article
Article represents an article in the Wikipedia dump.
Article() - Constructor for class it.cnr.isti.hpc.wikipedia.article.Article
 
Article.Type - Enum in it.cnr.isti.hpc.wikipedia.article
The possible types of an article (e.g., template, article, category)
ArticleParser - Class in it.cnr.isti.hpc.wikipedia.parser
Generates a Mediawiki parser given a language, (it will expect to find a locale file in src/main/resources/).
ArticleParser(String) - Constructor for class it.cnr.isti.hpc.wikipedia.parser.ArticleParser
 
ArticleParser() - Constructor for class it.cnr.isti.hpc.wikipedia.parser.ArticleParser
 
ArticleSummarizer - Class in it.cnr.isti.hpc.wikipedia.article
Given an article returns a string summarizing (cleaning and enriching its content) using for displaying the entity.
ArticleSummarizer() - Constructor for class it.cnr.isti.hpc.wikipedia.article.ArticleSummarizer
 
ArticleSummarizer(int) - Constructor for class it.cnr.isti.hpc.wikipedia.article.ArticleSummarizer
 

C

cleanWikiText(String) - Method in class it.cnr.isti.hpc.wikipedia.article.ArticleSummarizer
 

D

DisambiguationFilter - Class in it.cnr.isti.hpc.wikipedia.reader.filter
Filters out/only Disambiguations
DisambiguationFilter(boolean) - Constructor for class it.cnr.isti.hpc.wikipedia.reader.filter.DisambiguationFilter
 
doubleSpaces(String) - Method in class it.cnr.isti.hpc.wikipedia.article.ArticleSummarizer
 

E

EMPTY - Static variable in class it.cnr.isti.hpc.wikipedia.article.Template
 
EMPTY_TEMPLATE - Static variable in class it.cnr.isti.hpc.wikipedia.article.Template
 
EN - Static variable in class it.cnr.isti.hpc.wikipedia.article.Language
 
equals(Object) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
equals(Object) - Method in class it.cnr.isti.hpc.wikipedia.article.Link
 
equals(Object) - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
equals(Object) - Method in class it.cnr.isti.hpc.wikipedia.article.Template
 

F

FEWER_THAN_THREE - Static variable in class it.cnr.isti.hpc.wikipedia.reader.filter.ShortTitleFilter
 
FILTER_OUT - Static variable in class it.cnr.isti.hpc.wikipedia.reader.filter.UnknownTypeFilter
 
FILTER_OUT_DISAMBIGUATIONS - Static variable in class it.cnr.isti.hpc.wikipedia.reader.filter.DisambiguationFilter
 
FILTER_OUT_REDIRECTS - Static variable in class it.cnr.isti.hpc.wikipedia.reader.filter.RedirectFilter
Keeps only the non-redirects
fromJson(String) - Static method in class it.cnr.isti.hpc.wikipedia.article.Article
 

G

get(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Template
 
getAsMap() - Method in class it.cnr.isti.hpc.wikipedia.article.Template
 
getCategories() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getCategoryIdentifiers() - Method in class it.cnr.isti.hpc.wikipedia.parser.Locale
 
getCleanId() - Method in class it.cnr.isti.hpc.wikipedia.article.Link
 
getCleanParagraphs() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getCleanText() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getColumn(int) - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
getDescription() - Method in class it.cnr.isti.hpc.wikipedia.article.Link
 
getDescription() - Method in class it.cnr.isti.hpc.wikipedia.article.Template
 
getDisambigutionIdentifiers() - Method in class it.cnr.isti.hpc.wikipedia.parser.Locale
 
GetDumpSummaryCLI - Class in it.cnr.isti.hpc.wikipedia.cli
Takes the JSON dump and produce a summary file containing, a file where each line contains:

type wid wikititle redirect/short summary

The last field contains the redirection is type is redirect, otherwise the short summary
GetDumpSummaryCLI(String[]) - Constructor for class it.cnr.isti.hpc.wikipedia.cli.GetDumpSummaryCLI
 
getEnWikiTitle() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getHighlights() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getId() - Method in class it.cnr.isti.hpc.wikipedia.article.Link
 
getImageIdentifiers() - Method in class it.cnr.isti.hpc.wikipedia.parser.Locale
 
getImages() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getInfobox() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getIntegerNamespace() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getLang() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getLinks() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getListIdentifiers() - Method in class it.cnr.isti.hpc.wikipedia.parser.Locale
 
getLists() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getName() - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
getName() - Method in class it.cnr.isti.hpc.wikipedia.article.Template
 
getNamespace() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getNumCols() - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
getNumRows() - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
getParagraphs() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getParser(String) - Method in class it.cnr.isti.hpc.wikipedia.parser.MediaWikiParserFactory
 
getRedirect() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getRedirectIdentifiers() - Method in class it.cnr.isti.hpc.wikipedia.parser.Locale
 
getRedirectNoAnchor() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
the redirect without the anchor, e.g., da_vinci#life -> da_vinci
getSchema() - Method in class it.cnr.isti.hpc.wikipedia.article.Template
 
getSections() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getSnippet() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getSummary() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getSummary(Article) - Method in class it.cnr.isti.hpc.wikipedia.article.ArticleSummarizer
 
getTable() - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
getTables() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getTemplates() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getTemplatesSchema() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getText() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getTimestamp() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getTitle() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
GetTitleAndTypeCLI - Class in it.cnr.isti.hpc.wikipedia.cli
Retrieves all the titles from the wikipedia articles.
GetTitleAndTypeCLI(String[]) - Constructor for class it.cnr.isti.hpc.wikipedia.cli.GetTitleAndTypeCLI
 
getTitleInWikistyle() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getTitleInWikistyle(String) - Static method in class it.cnr.isti.hpc.wikipedia.article.Article
 
GetTitlesCLI - Class in it.cnr.isti.hpc.wikipedia.cli
Retrieves all the titles from the Wikipedia articles, considers only pages, templates and categories.
GetTitlesCLI(String[]) - Constructor for class it.cnr.isti.hpc.wikipedia.cli.GetTitlesCLI
 
getType() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getTypeName() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getWid() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getWikiId() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
getWikiTitle() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 

H

hasEnWikiTitle() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
hashCode() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
hashCode() - Method in class it.cnr.isti.hpc.wikipedia.article.Link
 
hashCode() - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
hashCode() - Method in class it.cnr.isti.hpc.wikipedia.article.Template
 
hasInfobox() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 

I

isDisambiguation() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
isDisambiguation() - Method in class it.cnr.isti.hpc.wikipedia.article.Template
 
isEmpty() - Method in class it.cnr.isti.hpc.wikipedia.article.Link
 
isFilter(Article) - Method in class it.cnr.isti.hpc.wikipedia.reader.filter.DisambiguationFilter
 
isFilter(Article) - Method in class it.cnr.isti.hpc.wikipedia.reader.filter.RedirectFilter
 
isFilter(Article) - Method in class it.cnr.isti.hpc.wikipedia.reader.filter.ShortTitleFilter
 
isFilter(Article) - Method in class it.cnr.isti.hpc.wikipedia.reader.filter.TitleFilter
Deprecated.
 
isFilter(Article) - Method in class it.cnr.isti.hpc.wikipedia.reader.filter.TypeFilter
 
isFilter(Article) - Method in class it.cnr.isti.hpc.wikipedia.reader.filter.UnknownTypeFilter
 
isLang(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
isList() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
isRedirect() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
IT - Static variable in class it.cnr.isti.hpc.wikipedia.article.Language
 
it.cnr.isti.hpc.wikipedia.article - package it.cnr.isti.hpc.wikipedia.article
 
it.cnr.isti.hpc.wikipedia.cli - package it.cnr.isti.hpc.wikipedia.cli
 
it.cnr.isti.hpc.wikipedia.parser - package it.cnr.isti.hpc.wikipedia.parser
 
it.cnr.isti.hpc.wikipedia.reader - package it.cnr.isti.hpc.wikipedia.reader
 
it.cnr.isti.hpc.wikipedia.reader.filter - package it.cnr.isti.hpc.wikipedia.reader.filter
 
ITALIAN_TITLE_FILTER - Static variable in class it.cnr.isti.hpc.wikipedia.reader.filter.TitleFilter
Deprecated.
 

J

JsonToLineCLI - Class in it.cnr.isti.hpc.wikipedia.cli
Output wikipedia dump in a particular format given as input string
JsonToLineCLI(String[]) - Constructor for class it.cnr.isti.hpc.wikipedia.cli.JsonToLineCLI
 

K

KEEP_DISAMBIGUATIONS - Static variable in class it.cnr.isti.hpc.wikipedia.reader.filter.DisambiguationFilter
 
KEEP_REDIRECTS - Static variable in class it.cnr.isti.hpc.wikipedia.reader.filter.RedirectFilter
Keeps only the redirects

L

Language - Class in it.cnr.isti.hpc.wikipedia.article
Contains definitions of different languages.
Language() - Constructor for class it.cnr.isti.hpc.wikipedia.article.Language
 
Link - Class in it.cnr.isti.hpc.wikipedia.article
Link class models a link an internal link on Wikipedia.
Link(String, String) - Constructor for class it.cnr.isti.hpc.wikipedia.article.Link
 
lists - Variable in class it.cnr.isti.hpc.wikipedia.article.Article
 
Locale - Class in it.cnr.isti.hpc.wikipedia.parser
Models the locale for a language.
Locale(String) - Constructor for class it.cnr.isti.hpc.wikipedia.parser.Locale
 
LocalizedMediaWikiParserFactory - Class in it.cnr.isti.hpc.wikipedia.parser
Generates a parser from the proper Locale.
LocalizedMediaWikiParserFactory(Locale) - Constructor for class it.cnr.isti.hpc.wikipedia.parser.LocalizedMediaWikiParserFactory
 

M

main(String[]) - Static method in class it.cnr.isti.hpc.wikipedia.cli.GetDumpSummaryCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.wikipedia.cli.GetTitleAndTypeCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.wikipedia.cli.GetTitlesCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.wikipedia.cli.JsonToLineCLI
 
main(String[]) - Static method in class it.cnr.isti.hpc.wikipedia.cli.MediawikiToJsonCLI
 
MAIN - Static variable in class it.cnr.isti.hpc.wikipedia.reader.filter.TypeFilter
 
MAIN_CATEGORY_TEMPLATE - Static variable in class it.cnr.isti.hpc.wikipedia.reader.filter.TypeFilter
 
MediaWikiParserFactory - Class in it.cnr.isti.hpc.wikipedia.parser
Generates the MediaWikiParser given a language.
MediaWikiParserFactory() - Constructor for class it.cnr.isti.hpc.wikipedia.parser.MediaWikiParserFactory
 
MediawikiToJsonCLI - Class in it.cnr.isti.hpc.wikipedia.cli
MediawikiToJsonCLI converts a Wikipedia Dump in Json.
MediawikiToJsonCLI(String[]) - Constructor for class it.cnr.isti.hpc.wikipedia.cli.MediawikiToJsonCLI
 

P

parse(Article, String) - Method in class it.cnr.isti.hpc.wikipedia.parser.ArticleParser
 

R

redirect - Variable in class it.cnr.isti.hpc.wikipedia.article.Article
 
RedirectFilter - Class in it.cnr.isti.hpc.wikipedia.reader.filter
Filters in/out Redirects
RedirectFilter(boolean) - Constructor for class it.cnr.isti.hpc.wikipedia.reader.filter.RedirectFilter
 
removeParanthesis(String) - Method in class it.cnr.isti.hpc.wikipedia.article.ArticleSummarizer
 
removeThumbs(String) - Method in class it.cnr.isti.hpc.wikipedia.article.ArticleSummarizer
 
removingUrls(String) - Method in class it.cnr.isti.hpc.wikipedia.article.ArticleSummarizer
 

S

setCategories(List<Link>) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setDescription(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Link
 
setDescription(List<String>) - Method in class it.cnr.isti.hpc.wikipedia.article.Template
 
setEnWikiTitle(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setEnWikiTitle(Article, ParsedPage) - Method in class it.cnr.isti.hpc.wikipedia.parser.ArticleParser
 
setHighlights(List<String>) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setId(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Link
 
setImages(List<Link>) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setInfobox(Template) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setIntegerNamespace(Integer) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setLang(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setLinks(List<Link>) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setLists(List<List<String>>) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setName(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
setName(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Template
 
setNamespace(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setNumCols(int) - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
setNumRows(int) - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
setParagraphs(List<String>) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setRedirect(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setSections(List<String>) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setSummary(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setTable(List<List<String>>) - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
setTables(List<Table>) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setTemplates(List<Template>) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setTemplatesSchema(List<String>) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setTimestamp(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setTitle(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setType(Article.Type) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setWid(int) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setWikiId(int) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
setWikiTitle(String) - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
ShortTitleFilter - Class in it.cnr.isti.hpc.wikipedia.reader.filter
Article is filtered if its length is fewer then a certain length (default is 3)
ShortTitleFilter() - Constructor for class it.cnr.isti.hpc.wikipedia.reader.filter.ShortTitleFilter
 
ShortTitleFilter(int) - Constructor for class it.cnr.isti.hpc.wikipedia.reader.filter.ShortTitleFilter
 
start() - Method in class it.cnr.isti.hpc.wikipedia.reader.WikipediaArticleReader
Starts the parsing
STD_FILTER - Static variable in class it.cnr.isti.hpc.wikipedia.reader.filter.TypeFilter
 

T

Table - Class in it.cnr.isti.hpc.wikipedia.article
Table models a table structure encoded in an article.
Table() - Constructor for class it.cnr.isti.hpc.wikipedia.article.Table
 
Table(String) - Constructor for class it.cnr.isti.hpc.wikipedia.article.Table
 
Template - Class in it.cnr.isti.hpc.wikipedia.article
Template represents a particular template in a article.
Template(String, List<String>) - Constructor for class it.cnr.isti.hpc.wikipedia.article.Template
 
title - Variable in class it.cnr.isti.hpc.wikipedia.article.Article
 
TitleFilter - Class in it.cnr.isti.hpc.wikipedia.reader.filter
Deprecated.
TitleFilter(String...) - Constructor for class it.cnr.isti.hpc.wikipedia.reader.filter.TitleFilter
Deprecated.
 
toJson() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
toString() - Method in class it.cnr.isti.hpc.wikipedia.article.Article
 
toString() - Method in class it.cnr.isti.hpc.wikipedia.article.Link
 
toString() - Method in class it.cnr.isti.hpc.wikipedia.article.Table
 
toString() - Method in class it.cnr.isti.hpc.wikipedia.article.Template
 
TypeFilter - Class in it.cnr.isti.hpc.wikipedia.reader.filter
TypeFilter filters the articles base on their type.
TypeFilter(Article.Type...) - Constructor for class it.cnr.isti.hpc.wikipedia.reader.filter.TypeFilter
 

U

UnknownTypeFilter - Class in it.cnr.isti.hpc.wikipedia.reader.filter
Filters out articles that do not have a type
UnknownTypeFilter() - Constructor for class it.cnr.isti.hpc.wikipedia.reader.filter.UnknownTypeFilter
 

V

valueOf(String) - Static method in enum it.cnr.isti.hpc.wikipedia.article.Article.Type
Returns the enum constant of this type with the specified name.
values() - Static method in enum it.cnr.isti.hpc.wikipedia.article.Article.Type
Returns an array containing the constants of this enum type, in the order they are declared.

W

WikipediaArticleReader - Class in it.cnr.isti.hpc.wikipedia.reader
A reader that converts a Wikipedia dump in its json dump.
WikipediaArticleReader(String, String, String) - Constructor for class it.cnr.isti.hpc.wikipedia.reader.WikipediaArticleReader
Generates a converter from the xml to json dump.
WikipediaArticleReader(File, File, String) - Constructor for class it.cnr.isti.hpc.wikipedia.reader.WikipediaArticleReader
Generates a converter from the xml to json dump.
wikiTitle - Variable in class it.cnr.isti.hpc.wikipedia.article.Article
 
A C D E F G H I J K L M P R S T U V W 

Copyright © 2013. All Rights Reserved.