wikidump

Tools to manipulate and extract data from wikipedia dumps
Download

wikidump Ranking & Summary

Advertisement

  • Rating:
  • License:
  • GPL v3
  • Publisher Name:
  • Marco Lui
  • Publisher web site:
  • http://github.com/saffsd/

wikidump Tags


wikidump Description

Tools to manipulate and extract data from wikipedia dumps wikidump is a Python module that contains code for manipulating wikipedia dumps available from http://download.wikimedia.org/backup-index.htmlConfigurationUpon first importing the module, a file 'wikidump.cfg' will be created. Modify the paths in this file to point to your data. * scratch : where indices are stores (must be writeable) * xml_dumps : where the xml dumps are located (can be read-only)UsageIn addition to python modules, wikidump also comes with a command-line tool to quickly access wikidump functionality. Run wikidump help for a list of options.Credits * Distribute * Buildout * modern-package-template Requirements: · Python


wikidump Related Software