Extract text from HTML
- Free software: MIT license
Install with pip:
pip install html-text
The package depends on lxml, so you might need to install some additional packages: http://lxml.de/installation.html
Extract text from HTML:
>>> import html_text
>>> text = html_text.extract_text(u'<h1>Hey</h1>')
u'Hey'
The code is extracted from utilities used in several projects, written by Mikhail Korobov.