public class WikipediaArticleReader extends Object
Article
Constructor and Description |
---|
WikipediaArticleReader(File inputFile,
File outputFile,
String lang)
Generates a converter from the xml to json dump.
|
WikipediaArticleReader(String inputFile,
String outputFile,
String lang)
Generates a converter from the xml to json dump.
|
public WikipediaArticleReader(String inputFile, String outputFile, String lang)
inputFile
- - the xml file (compressed)outputFile
- - the json output file, containing one article per line (if
the filename ends with .gz the output will be
compressed).lang
- - the language of the dumppublic WikipediaArticleReader(File inputFile, File outputFile, String lang)
inputFile
- - the xml file (compressed)outputFile
- - the json output file, containing one article per line (if
the filename ends with .gz the output will be
compressed).lang
- - the language of the dumppublic void start() throws IOException, SAXException
IOException
SAXException
Copyright © 2013. All Rights Reserved.