mmautner / readability Goto Github PK

View Code? Open in Web Editor NEW

186.0 186.0 60.0 185 KB

a collection of functions that measure the readability of a given body of text

License: Other

Python 100.00%

readability's People

Contributors

Stargazers

Watchers

readability's Issues

The problem with syllable count

The number of syllables for some words computed by the 'count' function (in syllables_en.py) is wrong.
For example:
count('the') =0, count('we') =0, count('be') =0

Installation

Hi,
I am not able to do pip install for this. Not able to find the installation setyp.py fine in the package

0 syllables in the

Hey there.

I was playing around with this module and found some discrepancies with my own calculations.

Particularly, 'the' apparently doesn't have any syllables:

import readability.syllables_en
>>> readability.syllables_en.count("the")
0

Unicode error

Shouldn't this work?

>>> Readability(u'This does not work\u2762').SMOGIndex()
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "readability\readability.py", line 16, in __init__
    self.analyze_text(text)
  File "readability\readability.py", line 20, in analyze_text
    char_count = get_char_count(words)
  File "readability\utils.py", line 17, in get_char_count
    characters += len(word.decode("utf-8"))
  File "c:\Python27\lib\encodings\utf_8.py", line 16, in decode
    return codecs.utf_8_decode(input, errors, True)
UnicodeEncodeError: 'ascii' codec can't encode character u'\u2762' in position 4: ordinal not in range(128)

P.S., thank you for the nice code.

mmautner / readability Goto Github PK

readability's People

Contributors

Stargazers

Watchers

Forkers

readability's Issues

The problem with syllable count

Installation

0 syllables in the

Unicode error

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent