Comments (3)
Good question! There's not a hard-coded limit, but you'll definitely run into problems with large state limits. Small-ish corpora won't have many sequences that share state_size - 1
consecutive items — e.g., it's much easier to find a subsequent item for John ate
than for John ate a bagel with
.
Some convenience methods, e.g. markovify.Text.make_sentence(...)
, handle this issue by specifying a maximum number of times to try creating a new chain. That said, it's definitely possible that there's some hidden bug causing problems.
Do you have example code/data to reproduce the problem?
from markovify.
Here is the block of code.
file_path = 'resources/the-elves.txt'
with open(file_path) as f:
file_content = f.read()
model = markovify.Text(file_content, state_size=6)
return ' '.join([model.make_short_sentence(150) for i in range(5)])
This runs fine with a state size of 5 or less. From your explanation above, I understand why a large state size can be an issue.
from markovify.
Thanks! Do you have a link to the "resources/the-elves.txt" file? If so, I can try to figure out what's causing this particular problem. (If there's no link, you can also email me the file: [email protected].)
from markovify.
Related Issues (20)
- Character level chains instead of word level? HOT 2
- Markovify always outputs "None" with russian corpus HOT 12
- markovify and music HOT 1
- Thank you for a job well done! HOT 2
- I can’t install because of the encoding of the file HOT 1
- Can I generate sentence with only two words? HOT 2
- generate sentence with it's prediction HOT 2
- spaCy model shortcuts are deprecated HOT 1
- Non-english characters are not being displayed correctly.
- markov_text_model.make_sentence_with_start KeyError HOT 1
- Fallback without building a new model? HOT 1
- “python_requires” should be set with “>=3.6”, as markovify 0.9.3 is not compatible with all Python versions. HOT 1
- Control generated sentences randomness HOT 2
- - HOT 2
- missing utf-8 BOM lead to codec failures during tests on windows
- Markovify - Markov chain : Seed and Condition to text generated based in input. HOT 2
- markovify's make_sentence_with_start() doesn't seem to work properly HOT 11
- Can't install on browser webpage.
- PolyCodeMaster.py HOT 1
- Markovify returns None HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from markovify.