System requirementsΒΆ
Scrapple is a Python package/command line tool which runs on all operating systems that support Python. This includes :
- Windows XP
- Windows Vista
- Windows 7
- Windows 8.x
- Common Linux distros : Ubuntu/Xubuntu/Lubuntu, Fedora, Mint, Gentoo, openSUSE, Arch Linux etc.
- OS X
The basic requirements for running Scrapple are:
- Python 2.7 or 3.x
- pip or easy_install for installing the necessary Python packages [pip is the recommended choice]
Scrapple depends on a number of Python packages for various parts of its execution :
- requests : The HTTP library. The requests library is used to make HTTP requests to load the required web pages.
- lxml : The web scraping library. The lxml library is used to parse the element tree and extract the required content.
- cssselect : The CSS selector library. cssselect works in tandem with lxml to handle CSS Selector expressions.
- docopt : The command line parser. The docopt library is used to parse the command line interface input based on the CLI usage specification in the docstring.
- Jinja2 : The templating engine. Jinja2 is used to create skeleton configuration file and generated Python scraper scripts.
- Flask : The web micro-framework. The web interface to edit configuration files runs on Flask.
- colorama : The output formatter. colorama is used to format the various sections of the command line output.