Giter Club home page Giter Club logo

dosage's Introduction

Dosage

Dosage is a comic strip downloader and archiver.

<<<<<<< HEAD You have reached the old repository of Dosage.

Please go to https://github.com/webcomics/dosage or http://dosage.rocks/ for the new location of the project.

If you still have an old clone, you can switch to the new location like this:

git remote set-url origin https://github.com/webcomics/dosage.git

If you have already updated to the last commit on this repository, you can get rid of this commit by doing:

git reset --hard HEAD^

And then updating to the latest revision on the new repository.

Dosage is designed to keep a local copy of specific webcomics and other picture-based content such as Picture of the Day sites. With the dosage commandline script you can get the latest strip of a webcomic, or catch-up to the last strip downloaded, or download a strip for a particular date/index (if the webcomic's site layout allows this).

Multiple webcomics can be downloaded in parallel, making the update of comic strips faster.

See http://dosage.rocks/ for more info.

91bfb44b47ecc9efe99552c4710aaa761436aa0c

dosage's People

Contributors

a3nm avatar carlosefonseca avatar dirkreiners avatar dromaludaire avatar faldrian avatar freestila avatar hstasch avatar kevinanthony avatar lugoues avatar manabi avatar null000 avatar pataluc avatar rpglover64 avatar sehrgut avatar serenitas50 avatar shartge avatar tobix avatar wummel avatar zac9 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dosage's Issues

multiple comics found

python dosage Garfield

********** Oops, I did it again. *************

You have found an internal error in Dosage. Please write a bug report
at http://wummel.github.com/dosage/issues and include at least the information below:

Not disclosing some of the information below due to privacy reasons is ok.
I will try to help you nonetheless, but you have to give me something
I can work with ;) .

<type 'exceptions.ValueError'> multiple comics found: GoComics/Garfield, GoComics/GarfieldMinusGarfield
Traceback (most recent call last):
File "dosage", line 314, in main
res = run(parser.parse_args())
File "dosage", line 205, in run
return getComics(options)
File "dosage", line 157, in getComics
for scraperobj in getScrapers(options.comic, options.basepath, options.adult, options.multimatch):
File "dosage", line 294, in getScrapers
for scraperclass in scraper.find_scraperclasses(name, multiple_allowed=multiple_allowed):
File "D:\Data\Dosage\dosagelib\scraper.py", line 222, in find_scraperclasses
raise ValueError('multiple comics found: %s' % comics)
ValueError: multiple comics found: GoComics/Garfield, GoComics/GarfieldMinusGarfield
System info:
Dosage 1.11
Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
Local time: 2013-03-11 15:42:57+002
sys.argv ['dosage', 'Garfield']

******** Dosage internal error, over and out ********

3 new comics

Hi,

I discovered doasge a few days ago and I must say it's a great tool! Most of the comics I read were already in the database. but I wondered if it would be possible to add the comics from The Sharing Machine:
http://www.marriedtothesea.com/
http://www.nataliedee.com/
http://toothpastefordinner.com/

I already added Married to the Sea and Natalie Dee to my plugins with the following code:

class MarriedToTheSea(_BasicScraper):
    url = 'http://www.marriedtothesea.com/'
    rurl = escape(url)
    stripUrl = url + 'index.php?date=%s'
    firstStripUrl = stripUrl % '022806'
    imageSearch = compile(tagre("img", "src", r'?(%s\d+/[^"]+)' % rurl, before="overflow"))
    prevSearch = compile(r'<a href="(.+?)">&lt;&lt; Yesterday</a>')
    help = 'Index format: mmddyy' 

I don't really know Python or regular expressions, I just modified some of the plugins already present (it seems to work, though...)

I also have a suggestion: could it be possible to add a time index to the downloaded files? Some comics, like those from The Sharing Machine, are not numbered and it's impossible to read them in sorted order.

Thanks!

Fatal error on using -a with an index

I tried to fetch part of a comic using the following to avoid another bug:

dosage -a eeriecuties:limited-edition

but it crashed with the following error (on windows with 1.8):

********** Oops, I did it again. *************

You have found an internal error in Dosage. Please write a bug report at https://github.com/wummel/dosage/issues and include at least the information
below:

Not disclosing some of the information below due to privacy reasons is ok.
I will try to help you nonetheless, but you have to give me something
I can work with ;) .

<type 'exceptions.UnboundLocalError'> local variable 'msg' referenced before assignment
Traceback (most recent call last):
File "C:\Python27\Scripts\dosage", line 258, in main
res = run(options, comics)
File "C:\Python27\Scripts\dosage", line 157, in run
return getComics(options, comics)
File "C:\Python27\Scripts\dosage", line 108, in getComics
errors += getStrips(scraperobj, options)
File "C:\Python27\Scripts\dosage", line 130, in getStrips
for strip in strips:
File "C:\Python27\lib\site-packages\dosagelib\scraper.py", line 79, in getAllStrips
msg += "Retrieving %d strips for indexes %s" % (len(self.indexes), self.indexes)
UnboundLocalError: local variable 'msg' referenced before assignment
System info:
Dosage 1.8
Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
Local time: 2013-01-20 06:23:15+001
sys.argv ['C:\Python27\Scripts\dosage', '-a', 'eeriecuties:limited-edition']

******** Dosage internal error, over and out ********

_Dosage_configdata.py[co] not recorded during insall

Hello,

I do not know if this is a bug in distutils or in dosage's setup.py

When running python setup.py install --record foo -c -O1
I see that the following 3 files are installed:
/usr/local/lib/python2.7/site-packages/_Dosage_configdata.py
/usr/local/lib/python2.7/site-packages/_Dosage_configdata.pyc
/usr/local/lib/python2.7/site-packages/_Dosage_configdata.pyo

But only the first one is recorded in the file foo:
/usr/local/lib/python2.7/site-packages/_Dosage_configdata.py

LookingForGroup

I get the following errors (python dosage --adult -n 3 LookingForGroup, last one I got was 0664.jpg):

LookingForGroup> Retrieving 3 strips
LookingForGroup> ERROR: Patterns ['<\s_[iI][mM][gG]\s+(?:[^>]\s+)?[sS][rR][cC]\s=\s_"(http://cdn.lfgcomic.com
/wp-content/uploads/[^"]+)"[^>][^>]>'] not found at URL http://www.lfgcomic.com/page/668/.
LookingForGroup> ERROR: Patterns ['<\s_[iI][mM][gG]\s+(?:[^>]\s+)?[sS][rR][cC]\s=\s_"(http://cdn.lfgcomic.com
/wp-content/uploads/[^"]+)"[^>][^>]>'] not found at URL http://www.lfgcomic.com/page/667/.
LookingForGroup> ERROR: Patterns ['<\s_[iI][mM][gG]\s+(?:[^>]\s+)?[sS][rR][cC]\s=\s_"(http://cdn.lfgcomic.com
/wp-content/uploads/[^"]+)"[^>][^>]>'] not found at URL http://www.lfgcomic.com/page/666/.

EvilInc website structure changed

Hi,

it seems like the Website from EvilInc changed. I get warnings for multiple file matches and it downloads wrong pictures (like ads for a ringtone or so).

Thx

PennyArcade no longer retrieving

Using dosage 2.13, Penny Arcade has failed to retrieve since March 10, with the following output.

speedster@whinge:~$ dosage -vv PennyArcade
MainThread> SingleInstance lockfile: /tmp/-usr-local-bin-dosage.lock
MainThread> Loading comic modules...
MainThread> ... 2071 modules loaded.
PennyArcade> Open URL http://penny-arcade.com/comic/
PennyArcade> Sending cookies <<class 'requests.cookies.RequestsCookieJar'>[]>
PennyArcade> Response cookies: <<class 'requests.cookies.RequestsCookieJar'>[<Cookie pmc_last_activity=1395282307 for penny-arcade.com/>, <Cookie pmc_last_visit=1079922307 for penny-arcade.com/>, <Cookie pmc_super_search_history=83783 for penny-arcade.com/>, <Cookie pmc_tracker=a%3A1%3A%7Bi%3A0%3Bs%3A5%3A%22comic%22%3B%7D for penny-arcade.com/>]>
PennyArcade> matched URL u'http://penny-arcade.com/comic/2014/03/17/pluro-singluarity' with pattern <\s*[aA]\s+[^>]*btnPrev[^>]*\s+[hH][rR][eE][fF]\s*=\s*"(http\:\/\/penny\-arcade\.com\/comic\/[^"]+)"[^>]*[^>]*>
PennyArcade> matched URL u'http://penny-arcade.com/comic/2014/03/17/pluro-singluarity' with pattern <\s*[aA]\s+[^>]*btnPrev[^>]*\s+[hH][rR][eE][fF]\s*=\s*"(http\:\/\/penny\-arcade\.com\/comic\/[^"]+)"[^>]*[^>]*>
PennyArcade> Open URL http://penny-arcade.com/comic/2014/03/17/pluro-singluarity
PennyArcade> Sending cookies <<class 'requests.cookies.RequestsCookieJar'>[<Cookie pmc_last_activity=1395282307 for penny-arcade.com/>, <Cookie pmc_last_visit=1079922307 for penny-arcade.com/>, <Cookie pmc_super_search_history=83783 for penny-arcade.com/>, <Cookie pmc_tracker=a%3A1%3A%7Bi%3A0%3Bs%3A5%3A%22comic%22%3B%7D for penny-arcade.com/>]>
PennyArcade> Response cookies: <<class 'requests.cookies.RequestsCookieJar'>[<Cookie pmc_last_activity=1395282207 for penny-arcade.com/>, <Cookie pmc_last_visit=1079922207 for penny-arcade.com/>, <Cookie pmc_super_search_history=197130 for penny-arcade.com/>, <Cookie pmc_tracker=a%3A1%3A%7Bi%3A0%3Bs%3A34%3A%22comic%2F2014%2F03%2F17%2Fpluro-singluarity%22%3B%7D for penny-arcade.com/>]>
PennyArcade> matched URL u'http://penny-arcade.com/comic/2014/03/19/titania' with pattern <\s*[aA]\s+[^>]*btnNext[^>]*\s+[hH][rR][eE][fF]\s*=\s*"(http\:\/\/penny\-arcade\.com\/comic\/[^"]+)"[^>]*[^>]*>
PennyArcade> matched URL u'http://penny-arcade.com/comic/2014/03/19/titania' with pattern <\s*[aA]\s+[^>]*btnNext[^>]*\s+[hH][rR][eE][fF]\s*=\s*"(http\:\/\/penny\-arcade\.com\/comic\/[^"]+)"[^>]*[^>]*>
PennyArcade> Retrieving 1 strip
PennyArcade> Get strip URL http://penny-arcade.com/comic/2014/03/19/titania
PennyArcade> Open URL http://penny-arcade.com/comic/2014/03/19/titania
PennyArcade> Sending cookies <<class 'requests.cookies.RequestsCookieJar'>[<Cookie pmc_last_activity=1395282207 for penny-arcade.com/>, <Cookie pmc_last_visit=1079922207 for penny-arcade.com/>, <Cookie pmc_super_search_history=197130 for penny-arcade.com/>, <Cookie pmc_tracker=a%3A1%3A%7Bi%3A0%3Bs%3A34%3A%22comic%2F2014%2F03%2F17%2Fpluro-singluarity%22%3B%7D for penny-arcade.com/>]>
PennyArcade> Response cookies: <<class 'requests.cookies.RequestsCookieJar'>[<Cookie pmc_last_activity=1395282090 for penny-arcade.com/>, <Cookie pmc_last_visit=1079922090 for penny-arcade.com/>, <Cookie pmc_super_search_history=625630 for penny-arcade.com/>, <Cookie pmc_tracker=a%3A1%3A%7Bi%3A0%3Bs%3A24%3A%22comic%2F2014%2F03%2F19%2Ftitania%22%3B%7D for penny-arcade.com/>]>
PennyArcade> matched URL u'http://art.penny-arcade.com/photos/i-5cwMmnj/0/1050x10000/i-5cwMmnj-1050x10000.jpg' with pattern <\s*[iI][mM][gG]\s+(?:[^>]*\s+)?[sS][rR][cC]\s*=\s*"(http://art\.penny-arcade\.com/photos/[^"]+)"[^>]*[^>]*>
PennyArcade> ERROR: invalid literal for int() with base 10: 'titania'

Pattern Errors

Running latest master code, received these errors.

ERROR: Patterns ['<\\s*[aA]\\s+(?:[^>]*\\s+)?[hH][rR][eE][fF]\\s*=\\s*"(http\\:\\/\\/abstrusegoose\\.com\\/\\d+)"[^>]*[^>]*>&laquo; Previous</a>'] not found at URL http://abstrusegoose.com/.
ERROR: Patterns ['<\\s*[iI][mM][gG]\\s+[^>]*strip[^>]*\\s+[sS][rR][cC]\\s*=\\s*"([^"]+/comics/[^"]+)"[^>]*[^>]*>'] not found at URL http://www.questionablecontent.net/.

Alt text for comics

Some comics (XKCD for example) have extra content (extra jokes) hidden in the alt text of the image. That content is currently lost when using dosage.

A solution to this would be adding alt text to the bottom of the image that is saved. Of course this should only be done for comics with this enabled.

The manual mentioned a -c option which doesn't exist

In the man page, under EXAMPLES there's this:

Retrieve Calvin and Hobbes strips from a given index going backwards to the beginning until an existing file is found:
dosage -c calvinandhobbes:20120722

but there is no -c option. Is this the catchup option and has it been removed?

ImportError

Hello,
I'm french, so i'm sorry about my english. I tried to install dosage but i have an error when i did "dosage --list" :

Traceback (most recent call last):
File "/usr/local/bin/dosage", line 19, in
from dosagelib import events, scraper
File "/usr/local/lib/python2.7/dist-packages/dosagelib/events.py", line 8, in
from . import rss, util, configuration
File "/usr/local/lib/python2.7/dist-packages/dosagelib/util.py", line 8, in
import requests
ImportError: No module named requests

I have the last version of python, and i installed dosage with : pip install dosage.

thank's

ptitlu

import _dosage_configdata as configdata

When I try to run Dosage I get the following error:
python dosage --adult -n 3 -b \Comics @
Traceback (most recent call last):
File "dosage", line 18, in
from dosagelib import events, configuration, singleton, director
File "\Dosage\dosagelib__init__.py", line 21, in
from .configuration import Version as version
File "\Dosage\dosagelib\configuration.py", line 5, in
import _dosage_configdata as configdata
ImportError: No module named _dosage_configdata

Pattern error with some strips

Hi,

With some strips i have a pattern error like this:

Oglaf> ERROR: Patterns ['<\\s*[iI][mM][gG]\\s+[^>]*strip[^>]*\\s+[sS][rR][cC]\\s
*=\\s*"(/media-haha/comic/[^"]+)"[^>]*[^>]*>'] not found at URL http://oglaf.com
/.
Oglaf> WARN: Patterns ['<\\s*[aA]\\s+(?:[^>]*\\s+)?[hH][rR][eE][fF]\\s*=\\s*"([^
"]+)"[^>]*[^>]*><\\s*[dD][iI][vV]\\s+(?:[^>]*\\s+)?[iI][dD]\\s*=\\s*"pvs"[^>]*[^
>]*>'] not found at URL http://oglaf.com/. Assuming no previous comic strips exi
st.

Why there is this issue?

Thank you.

dosage -c only grabs current page

At least for the GirlGenius comic, dosage -c only grabs the current strip, despite being several days behind. dosage -a still goes back through the whole list, and will grab any missing pages while skipping existing ones.

ImportError: No module named email.utils

When running fresh installed dosage 2.15 installed from .exe, with fresh installment of Python 3.4.1 with python-requests module installed I get following error:

Traceback (most recent call last):
  File "dosage", line 18, in <module>
  File "dosagelib\events.pyo", line 11, in <module>
  File "dosagelib\util.pyo", line 17, in <module>
  File "requests\__init__.pyo", line 58, in <module>
  File "requests\utils.pyo", line 25, in <module>
  File "requests\compat.pyo", line 7, in <module>
  File "requests\packages\__init__.pyo", line 3, in <module>
  File "requests\packages\urllib3\__init__.pyo", line 16, in <module>
  File "requests\packages\urllib3\connectionpool.pyo", line 39, in <module>
  File "requests\packages\urllib3\request.pyo", line 12, in <module>
  File "requests\packages\urllib3\filepost.pyo", line 15, in <module>
  File "requests\packages\urllib3\fields.pyo", line 7, in <module>
ImportError: No module named email.utils

When trying to uninstall 2.15 I get following error:

dosage_uninstall_error

dosage -l error

Not disclosing some of the information below due to privacy reasons is ok.
I will try to help you nonetheless, but you have to give me something
I can work with ;) .

<type 'exceptions.TypeError'> unicode argument expected, got 'str'
Traceback (most recent call last):
File "/usr/local/bin/dosage", line 395, in main
res = run(parser.parse_args())
File "/usr/local/bin/dosage", line 262, in run
return doList()
File "/usr/local/bin/dosage", line 282, in doList
out.info('Available comic scrapers:')
File "/usr/local/lib/python2.7/dist-packages/dosagelib/output.py", line 29, in info
self.write(s, level=level)
File "/usr/local/lib/python2.7/dist-packages/dosagelib/output.py", line 64, in write
self.stream.write('%s' % s, color=color)
File "/usr/local/lib/python2.7/dist-packages/dosagelib/ansicolor.py", line 253, in _write
self.fp.write(text)
TypeError: unicode argument expected, got 'str'
System info:
Dosage 2.0
Python 2.7.3 (default, Jan 2 2013, 13:56:14)
[GCC 4.7.2] on linux2
Local time: 2013-04-11 22:36:33+002
sys.argv ['/usr/local/bin/dosage', '-l']
LANG = 'fr_FR.UTF-8'

Sometimes, comic files are saved with 0 bytes

I don't know if this is a bug in Doage or Requests, but sometimes my downloaded comic files end up as a 0 byte file.

Here is a log of one of those downloads:

22:37:08 Namesake> Open URL http://namesakecomic.com/comic/hell-wont-get-back-to-that-book-for-awhile/
22:37:08 Namesake> Sending headers {'User-Agent': u'Mozilla/5.0 (compatible; Dosage/1.7; +https://github.com/wummel/dosage)'}
22:37:15 Namesake> Open URL http://namesakecomic.com/comic/hell-wont-get-back-to-that-book-for-awhile/
22:37:15 Namesake> Sending headers {'User-Agent': u'Mozilla/5.0 (compatible; Dosage/1.7; +https://github.com/wummel/dosage)'}
22:37:16 Namesake> matched URL u'http://namesakecomic.com/wp-content/uploads/2012/12/Chapter12_06.jpg' with pattern <\s*[iI][mM][gG]\s+(?:[^>]*\s+)?[sS][rR][cC]\s*=\s*"([^"]*/wp-content/uploads/[^"]+)"[^>]*title=[^>]*>
22:37:16 Namesake> Open URL http://namesakecomic.com/wp-content/uploads/2012/12/Chapter12_06.jpg
22:37:16 Namesake> Sending headers {'Referer': 'http://namesakecomic.com/comic/hell-wont-get-back-to-that-book-for-awhile/', 'User-Agent': u'Mozilla/5.0 (compatible; Dosage/1.7; +https://github.com/wummel/dosage)'}
22:37:23 Namesake> ... filename = '2012-12-Chapter12_06', ext = '.jpg', contentLength = 404768
22:37:23 Namesake> Writing comic to file Comics/Namesake/2012-12-Chapter12_06.jpg...
22:37:23 Namesake> Saved Comics/Namesake/2012-12-Chapter12_06.jpg (0B).

But no error and Dosage just continues as if nothing has happened :(

(PS: Pull request for that comic is pending 😉)

Not sure how to create a new plugin

I have been looking over the plugins that came with dosage, and have not figured out the proper way to create a new one. I am wanting to create a new plugin for "Alien Dice" at http://aliendice.com. Can someone give me a little direction, or point me toward some documentation on proper plugin creation?

Thank you.

"dosage -l" does not work anymore

It seems the changes to argument parsing broke all arguments that don't need a comic.

So I have to do:

dosage -l @
dosage --version @

etc.

An Eerie Cuties strip produces a fatal error, likely due to a unicode url

When downloading Eerie Cuties with the -a option, it fails at http://www.eeriecuties.com/strips-ec/you_like_them_too%EF%BC%9F

The character ?seems to be the problem. This hits a couple other comics I added plugins for. Menage a 3 and Magick Chicks also have this character at one page. (I used Eerie Cuties as a template for creating those two).

This is on windows.

EerieCuties> Skipping existing file "Comics\EerieCuties\ec20110214.png".
C:\Python27\lib\urllib.py:1268: UnicodeWarning: Unicode equal comparison failed to convert both arguments to Unicode - interpreting them as being unequal
return ''.join(map(quoter, s))

********** Oops, I did it again. *************

You have found an internal error in Dosage. Please write a bug report
at https://github.com/wummel/dosage/issues and include at least the information below:

Not disclosing some of the information below due to privacy reasons is ok.
I will try to help you nonetheless, but you have to give me something
I can work with ;) .

<type 'exceptions.KeyError'> u'\xef'
Traceback (most recent call last):
File "C:\Python27\Scripts\dosage", line 258, in main
res = run(options, comics)
File "C:\Python27\Scripts\dosage", line 157, in run
return getComics(options, comics)
File "C:\Python27\Scripts\dosage", line 108, in getComics
errors += getStrips(scraperobj, options)
File "C:\Python27\Scripts\dosage", line 130, in getStrips
for strip in strips:
File "C:\Python27\lib\site-packages\dosagelib\scraper.py", line 92, in getAllStrips
for strip in self.getStripsFor(url, maxstrips):
File "C:\Python27\lib\site-packages\dosagelib\scraper.py", line 101, in getStripsFor
self.prevSearch, session=self.session)
File "C:\Python27\lib\site-packages\dosagelib\util.py", line 125, in fetchUrls
data, baseUrl = getPageContent(url, session=session)
File "C:\Python27\lib\site-packages\dosagelib\util.py", line 90, in getPageContent
check_robotstxt(url)
File "C:\Python27\lib\site-packages\dosagelib\util.py", line 210, in check_robotstxt
if not rp.can_fetch(UserAgent, url):
File "C:\Python27\lib\robotparser.py", line 139, in can_fetch
url = urllib.quote(url)
File "C:\Python27\lib\urllib.py", line 1268, in quote
return ''.join(map(quoter, s))
KeyError: u'\xef'
System info:
Dosage 1.8
Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
Local time: 2013-01-20 05:53:53+001
sys.argv ['C:\Python27\Scripts\dosage', '-a', 'eeriecuties']

******** Dosage internal error, over and out ********

List not redirectable

Hello, what is the intended way of browsing the list of dosage -l ?
It works when I just run it alone and scrolls by rapidly, but this is what I get when I try to capture it (windows 7 command line, dosage 1.12):

D:\Users\tolgyesipeter>dosage -l | more
Traceback (most recent call last):
File "dosage", line 13, in
File "dosagelib\events.pyo", line 6, in
File "dosagelib\util.pyo", line 18, in
File "dosagelib\output.pyo", line 8, in
File "dosagelib\ansicolor.pyo", line 234, in
File "dosagelib\colorama.pyo", line 122, in init
File "dosagelib\colorama.pyo", line 89, in GetConsoleScreenBufferInfo
WindowsError: [Error 6] A leÝrˇ ÚrvÚnytelen.

D:\Users\tolgyesipeter>dosage -l>dosage-list.txt
Traceback (most recent call last):
File "dosage", line 13, in
File "dosagelib\events.pyo", line 6, in
File "dosagelib\util.pyo", line 18, in
File "dosagelib\output.pyo", line 8, in
File "dosagelib\ansicolor.pyo", line 234, in
File "dosagelib\colorama.pyo", line 122, in init
File "dosagelib\colorama.pyo", line 89, in GetConsoleScreenBufferInfo
WindowsError: [Error 6] A leÝrˇ ÚrvÚnytelen.

The WindowsError says Invalid descriptor.
Is this not a command line app?
I get the same error with --singlelist.

Peter

ImportError: No module named requests

Im having a problem. Installed dosage for windows, as well as python 2.7.6 and requests module. When trying to invoke dosage in command shell, getting an error:

Traceback (most recent call last):
File "dosage", line 28, in
File "dosagelib\events.pyo", line 11, in
File "dosagelib\util.pyo", line 17, in
ImportError: No module named requests

Running win 8. Am I doing something completely wrong? Help very appreciated

ERROR: Patterns not found

Tried this for both "endtown" and "GoComics/Endtown":

C:\Windows\system32>dosage endtown
GoComics/Endtown> ERROR: Patterns ['<\s_[aA]\s+(?:[^>]\s+)?[hH][rR][eE][fF]
\s
=\s_"(/[^"]+/\d+/\d+/\d+)"[^>]prev[^>]>'] not found at URL http://www.g
ocomics.com/endtown.

Most other sources are fine, so it might be the site that's the issue.

New features for events?

Hi,

Since some comics use pretty unpredictable file names I wanted to enhance Dosage with something to preserver the comic order. My first idea is an event listener which creates a simple JSON dump of all infos Dosage gathers while it runs. See https://github.com/TobiX/dosage/commits/event-rework for a first implementation.

Since the events are currently in such a way that the comicDownloaded event is only fired on the first successful download (not if I do a "recheck" via "dosage -a"), you loose information if you forget to run dosage without "-o json" once.

I propose the following additional changes:

  • Allow multiple event listeners (-o rss,json) - My current idea for this would make events.py pretty ugly...
  • Always enable all listeners (I don't know if this would be popular with users)

Dilbert

it downloads the latest as dilbert.com.jpg and after that doesn't rename it to the date of the comic and tries to download the next but stops because the comic already exists (dilbert.com.jpg)...

Detect end-of-life comics

Some comics are definitely end-of-life and there should be an option or a tag to say "this comic is completely downloaded, do not crawl this anymore".
Ideas:

  • first, mark comics that are end-of-life in the source, eg. endOfLife = True
  • then, after a successful (ie. no errors) run with "--all" without indexes, set a marker in the downloaded comic directory.
  • let the marker be easily added/deleted manually (not necessarily with dosage, but with a file editor or system tools). The marker could be a single file "complete.txt" or something similar.

Looking For Group

it seems like LFG is gathering thumbnails instead of the larger images...

Android version

Were there any plans for an Android version so you could read comics on the go?
If yes, can I help? If no, will you help me if I do it?

xkcd - pulls incorrect alt-text

It is pulling the title from the store image (first image on the page) instead of the comic image.

title="the xkcd store"
instead of
title="Theory: Smugness is proportional to the negative second derivative of TV ownership rate with respect to time."

New comics

Can you add
FullFrontalNerdity, Geeks next door and AntiHero for hire
??

Not downloading all Oglaf comics

When I try to download all comics from the oglaf website it doesn't download them all... I think its because the website is organized as stories with multiple pages each, and dosage only gets the first one.

Create continuous list

Could you add a parameter to get the dosage-comic-list to a text file or make the 'more' option optionable??

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.