wummel / dosage Goto Github PK

View Code? Open in Web Editor NEW

51.0 51.0 15.0 118.22 MB

dosage is a comic strip downloader and archiver

Home Page: http://wummel.github.io/dosage/

License: MIT License

Makefile 0.93% Python 98.73% Shell 0.19% Batchfile 0.16%

dosage's Introduction

Dosage

Dosage is a comic strip downloader and archiver.

<<<<<<< HEAD You have reached the old repository of Dosage.

Please go to https://github.com/webcomics/dosage or http://dosage.rocks/ for the new location of the project.

If you still have an old clone, you can switch to the new location like this:

git remote set-url origin https://github.com/webcomics/dosage.git

If you have already updated to the last commit on this repository, you can get rid of this commit by doing:

git reset --hard HEAD^

And then updating to the latest revision on the new repository.

Dosage is designed to keep a local copy of specific webcomics and other picture-based content such as Picture of the Day sites. With the dosage commandline script you can get the latest strip of a webcomic, or catch-up to the last strip downloaded, or download a strip for a particular date/index (if the webcomic's site layout allows this).

Multiple webcomics can be downloaded in parallel, making the update of comic strips faster.

See http://dosage.rocks/ for more info.

91bfb44b47ecc9efe99552c4710aaa761436aa0c

dosage's People

Contributors

Stargazers

Watchers

Forkers

johna513 shartge dromaludaire mbrandis faldrian null000 eyefox47 sehrgut genba bigyesh carlosefonseca rpglover64 pombredanne serenitas50 puba

dosage's Issues

Fokke & Sukke

Could you add http://foksuk.nl/??

multiple comics found

python dosage Garfield

********** Oops, I did it again. *************

You have found an internal error in Dosage. Please write a bug report
at http://wummel.github.com/dosage/issues and include at least the information below:

Not disclosing some of the information below due to privacy reasons is ok.
I will try to help you nonetheless, but you have to give me something
I can work with ;) .

<type 'exceptions.ValueError'> multiple comics found: GoComics/Garfield, GoComics/GarfieldMinusGarfield
Traceback (most recent call last):
File "dosage", line 314, in main
res = run(parser.parse_args())
File "dosage", line 205, in run
return getComics(options)
File "dosage", line 157, in getComics
for scraperobj in getScrapers(options.comic, options.basepath, options.adult, options.multimatch):
File "dosage", line 294, in getScrapers
for scraperclass in scraper.find_scraperclasses(name, multiple_allowed=multiple_allowed):
File "D:\Data\Dosage\dosagelib\scraper.py", line 222, in find_scraperclasses
raise ValueError('multiple comics found: %s' % comics)
ValueError: multiple comics found: GoComics/Garfield, GoComics/GarfieldMinusGarfield
System info:
Dosage 1.11
Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
Local time: 2013-03-11 15:42:57+002
sys.argv ['dosage', 'Garfield']

******** Dosage internal error, over and out ********

Change output path

Hi,

Could you add a parameter to change the output path ?

Thx.

3 new comics

Hi,

I discovered doasge a few days ago and I must say it's a great tool! Most of the comics I read were already in the database. but I wondered if it would be possible to add the comics from The Sharing Machine:
http://www.marriedtothesea.com/
http://www.nataliedee.com/
http://toothpastefordinner.com/

I already added Married to the Sea and Natalie Dee to my plugins with the following code:

class MarriedToTheSea(_BasicScraper):
    url = 'http://www.marriedtothesea.com/'
    rurl = escape(url)
    stripUrl = url + 'index.php?date=%s'
    firstStripUrl = stripUrl % '022806'
    imageSearch = compile(tagre("img", "src", r'?(%s\d+/[^"]+)' % rurl, before="overflow"))
    prevSearch = compile(r'<a href="(.+?)">&lt;&lt; Yesterday</a>')
    help = 'Index format: mmddyy'

I don't really know Python or regular expressions, I just modified some of the plugins already present (it seems to work, though...)

I also have a suggestion: could it be possible to add a time index to the downloaded files? Some comics, like those from The Sharing Machine, are not numbered and it's impossible to read them in sorted order.

Thanks!

Fatal error on using -a with an index

I tried to fetch part of a comic using the following to avoid another bug:

dosage -a eeriecuties:limited-edition

but it crashed with the following error (on windows with 1.8):

********** Oops, I did it again. *************

You have found an internal error in Dosage. Please write a bug report at https://github.com/wummel/dosage/issues and include at least the information
below:

Not disclosing some of the information below due to privacy reasons is ok.
I will try to help you nonetheless, but you have to give me something
I can work with ;) .

<type 'exceptions.UnboundLocalError'> local variable 'msg' referenced before assignment
Traceback (most recent call last):
File "C:\Python27\Scripts\dosage", line 258, in main
res = run(options, comics)
File "C:\Python27\Scripts\dosage", line 157, in run
return getComics(options, comics)
File "C:\Python27\Scripts\dosage", line 108, in getComics
errors += getStrips(scraperobj, options)
File "C:\Python27\Scripts\dosage", line 130, in getStrips
for strip in strips:
File "C:\Python27\lib\site-packages\dosagelib\scraper.py", line 79, in getAllStrips
msg += "Retrieving %d strips for indexes %s" % (len(self.indexes), self.indexes)
UnboundLocalError: local variable 'msg' referenced before assignment
System info:
Dosage 1.8
Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
Local time: 2013-01-20 06:23:15+001
sys.argv ['C:\Python27\Scripts\dosage', '-a', 'eeriecuties:limited-edition']

******** Dosage internal error, over and out ********

Could not found DieFruehreifen

./dosage DieFruehreifen
DieFruehreifen> Retrieving 1 strip
DieFruehreifen> ERROR: Patterns ['<\s_[iI][mM][gG]\s+(?:[^>]\s+)?[sS][rR][cC]\s=\s_"(strips/[F,f]rueh[]?[S,s]trip\d+.jpg)"[^>][^>]>'] not found at URL http://www.die-fruehreifen.de/index.php.

dosage crashes when starting it twice with -o html

It crashes because the previous created html already exists. Can you implement that dosage updates the html with newly found comics of the same day?

_Dosage_configdata.py[co] not recorded during insall

Hello,

I do not know if this is a bug in distutils or in dosage's setup.py

When running python setup.py install --record foo -c -O1
I see that the following 3 files are installed:
/usr/local/lib/python2.7/site-packages/_Dosage_configdata.py
/usr/local/lib/python2.7/site-packages/_Dosage_configdata.pyc
/usr/local/lib/python2.7/site-packages/_Dosage_configdata.pyo

But only the first one is recorded in the file foo:
/usr/local/lib/python2.7/site-packages/_Dosage_configdata.py

LookingForGroup

I get the following errors (python dosage --adult -n 3 LookingForGroup, last one I got was 0664.jpg):

LookingForGroup> Retrieving 3 strips
LookingForGroup> ERROR: Patterns ['<\s_[iI][mM][gG]\s+(?:[^>]\s+)?[sS][rR][cC]\s=\s_"(http://cdn.lfgcomic.com
/wp-content/uploads/[^"]+)"[^>][^>]>'] not found at URL http://www.lfgcomic.com/page/668/.
LookingForGroup> ERROR: Patterns ['<\s_[iI][mM][gG]\s+(?:[^>]\s+)?[sS][rR][cC]\s=\s_"(http://cdn.lfgcomic.com
/wp-content/uploads/[^"]+)"[^>][^>]>'] not found at URL http://www.lfgcomic.com/page/667/.
LookingForGroup> ERROR: Patterns ['<\s_[iI][mM][gG]\s+(?:[^>]\s+)?[sS][rR][cC]\s=\s_"(http://cdn.lfgcomic.com
/wp-content/uploads/[^"]+)"[^>][^>]>'] not found at URL http://www.lfgcomic.com/page/666/.

LookingForGroup broken

LookingForGroup> ERROR: Patterns ['<\s_[aA]\s+(?:[^>]\s+)?[hH][rR][eE][fF]
s=\s_"(http://www.lfgcomic.com/page/[-0-9]+/)"[^>]feature-previou
s[^>]>'] not found at URL http://www.lfgcomic.com/.

EvilInc website structure changed

Hi,

it seems like the Website from EvilInc changed. I get warnings for multiple file matches and it downloads wrong pictures (like ads for a ringtone or so).

Thx

the doghouse diaries

Could you add this??
(http://thedoghousediaries.com)

stops if no image is found

with Cyanide & Happiness it stops at 2307, because there's a movie (http://www.explosm.net/comics/2307/) and it won't go to 2306.
It is possible to add a parameter to make it continue??

PennyArcade no longer retrieving

Using dosage 2.13, Penny Arcade has failed to retrieve since March 10, with the following output.

speedster@whinge:~$ dosage -vv PennyArcade
MainThread> SingleInstance lockfile: /tmp/-usr-local-bin-dosage.lock
MainThread> Loading comic modules...
MainThread> ... 2071 modules loaded.
PennyArcade> Open URL http://penny-arcade.com/comic/
PennyArcade> Sending cookies <<class 'requests.cookies.RequestsCookieJar'>[]>
PennyArcade> Response cookies: <<class 'requests.cookies.RequestsCookieJar'>[<Cookie pmc_last_activity=1395282307 for penny-arcade.com/>, <Cookie pmc_last_visit=1079922307 for penny-arcade.com/>, <Cookie pmc_super_search_history=83783 for penny-arcade.com/>, <Cookie pmc_tracker=a%3A1%3A%7Bi%3A0%3Bs%3A5%3A%22comic%22%3B%7D for penny-arcade.com/>]>
PennyArcade> matched URL u'http://penny-arcade.com/comic/2014/03/17/pluro-singluarity' with pattern <\s*[aA]\s+[^>]*btnPrev[^>]*\s+[hH][rR][eE][fF]\s*=\s*"(http\:\/\/penny\-arcade\.com\/comic\/[^"]+)"[^>]*[^>]*>
PennyArcade> matched URL u'http://penny-arcade.com/comic/2014/03/17/pluro-singluarity' with pattern <\s*[aA]\s+[^>]*btnPrev[^>]*\s+[hH][rR][eE][fF]\s*=\s*"(http\:\/\/penny\-arcade\.com\/comic\/[^"]+)"[^>]*[^>]*>
PennyArcade> Open URL http://penny-arcade.com/comic/2014/03/17/pluro-singluarity
PennyArcade> Sending cookies <<class 'requests.cookies.RequestsCookieJar'>[<Cookie pmc_last_activity=1395282307 for penny-arcade.com/>, <Cookie pmc_last_visit=1079922307 for penny-arcade.com/>, <Cookie pmc_super_search_history=83783 for penny-arcade.com/>, <Cookie pmc_tracker=a%3A1%3A%7Bi%3A0%3Bs%3A5%3A%22comic%22%3B%7D for penny-arcade.com/>]>
PennyArcade> Response cookies: <<class 'requests.cookies.RequestsCookieJar'>[<Cookie pmc_last_activity=1395282207 for penny-arcade.com/>, <Cookie pmc_last_visit=1079922207 for penny-arcade.com/>, <Cookie pmc_super_search_history=197130 for penny-arcade.com/>, <Cookie pmc_tracker=a%3A1%3A%7Bi%3A0%3Bs%3A34%3A%22comic%2F2014%2F03%2F17%2Fpluro-singluarity%22%3B%7D for penny-arcade.com/>]>
PennyArcade> matched URL u'http://penny-arcade.com/comic/2014/03/19/titania' with pattern <\s*[aA]\s+[^>]*btnNext[^>]*\s+[hH][rR][eE][fF]\s*=\s*"(http\:\/\/penny\-arcade\.com\/comic\/[^"]+)"[^>]*[^>]*>
PennyArcade> matched URL u'http://penny-arcade.com/comic/2014/03/19/titania' with pattern <\s*[aA]\s+[^>]*btnNext[^>]*\s+[hH][rR][eE][fF]\s*=\s*"(http\:\/\/penny\-arcade\.com\/comic\/[^"]+)"[^>]*[^>]*>
PennyArcade> Retrieving 1 strip
PennyArcade> Get strip URL http://penny-arcade.com/comic/2014/03/19/titania
PennyArcade> Open URL http://penny-arcade.com/comic/2014/03/19/titania
PennyArcade> Sending cookies <<class 'requests.cookies.RequestsCookieJar'>[<Cookie pmc_last_activity=1395282207 for penny-arcade.com/>, <Cookie pmc_last_visit=1079922207 for penny-arcade.com/>, <Cookie pmc_super_search_history=197130 for penny-arcade.com/>, <Cookie pmc_tracker=a%3A1%3A%7Bi%3A0%3Bs%3A34%3A%22comic%2F2014%2F03%2F17%2Fpluro-singluarity%22%3B%7D for penny-arcade.com/>]>
PennyArcade> Response cookies: <<class 'requests.cookies.RequestsCookieJar'>[<Cookie pmc_last_activity=1395282090 for penny-arcade.com/>, <Cookie pmc_last_visit=1079922090 for penny-arcade.com/>, <Cookie pmc_super_search_history=625630 for penny-arcade.com/>, <Cookie pmc_tracker=a%3A1%3A%7Bi%3A0%3Bs%3A24%3A%22comic%2F2014%2F03%2F19%2Ftitania%22%3B%7D for penny-arcade.com/>]>
PennyArcade> matched URL u'http://art.penny-arcade.com/photos/i-5cwMmnj/0/1050x10000/i-5cwMmnj-1050x10000.jpg' with pattern <\s*[iI][mM][gG]\s+(?:[^>]*\s+)?[sS][rR][cC]\s*=\s*"(http://art\.penny-arcade\.com/photos/[^"]+)"[^>]*[^>]*>
PennyArcade> ERROR: invalid literal for int() with base 10: 'titania'

Pattern Errors

Running latest master code, received these errors.

ERROR: Patterns ['<\\s*[aA]\\s+(?:[^>]*\\s+)?[hH][rR][eE][fF]\\s*=\\s*"(http\\:\\/\\/abstrusegoose\\.com\\/\\d+)"[^>]*[^>]*>&laquo; Previous</a>'] not found at URL http://abstrusegoose.com/.

ERROR: Patterns ['<\\s*[iI][mM][gG]\\s+[^>]*strip[^>]*\\s+[sS][rR][cC]\\s*=\\s*"([^"]+/comics/[^"]+)"[^>]*[^>]*>'] not found at URL http://www.questionablecontent.net/.

Dog House Diaries & AppleGeeks Lite

could you add AppleGeeks Lite??

and DogHouseDiaries doesn't work anymore...

Alt text for comics

Some comics (XKCD for example) have extra content (extra jokes) hidden in the alt text of the image. That content is currently lost when using dosage.

A solution to this would be adding alt text to the bottom of the image that is saved. Of course this should only be done for comics with this enabled.

The manual mentioned a -c option which doesn't exist

In the man page, under EXAMPLES there's this:

Retrieve Calvin and Hobbes strips from a given index going backwards to the beginning until an existing file is found:
dosage -c calvinandhobbes:20120722

but there is no -c option. Is this the catchup option and has it been removed?

ImportError

Hello,
I'm french, so i'm sorry about my english. I tried to install dosage but i have an error when i did "dosage --list" :

Traceback (most recent call last):
File "/usr/local/bin/dosage", line 19, in
from dosagelib import events, scraper
File "/usr/local/lib/python2.7/dist-packages/dosagelib/events.py", line 8, in
from . import rss, util, configuration
File "/usr/local/lib/python2.7/dist-packages/dosagelib/util.py", line 8, in
import requests
ImportError: No module named requests

I have the last version of python, and i installed dosage with : pip install dosage.

thank's

ptitlu

import _dosage_configdata as configdata

When I try to run Dosage I get the following error:
python dosage --adult -n 3 -b \Comics @
Traceback (most recent call last):
File "dosage", line 18, in
from dosagelib import events, configuration, singleton, director
File "\Dosage\dosagelib__init__.py", line 21, in
from .configuration import Version as version
File "\Dosage\dosagelib\configuration.py", line 5, in
import _dosage_configdata as configdata
ImportError: No module named _dosage_configdata

Pattern error with some strips

Hi,

With some strips i have a pattern error like this:

Oglaf> ERROR: Patterns ['<\\s*[iI][mM][gG]\\s+[^>]*strip[^>]*\\s+[sS][rR][cC]\\s
*=\\s*"(/media-haha/comic/[^"]+)"[^>]*[^>]*>'] not found at URL http://oglaf.com
/.
Oglaf> WARN: Patterns ['<\\s*[aA]\\s+(?:[^>]*\\s+)?[hH][rR][eE][fF]\\s*=\\s*"([^
"]+)"[^>]*[^>]*><\\s*[dD][iI][vV]\\s+(?:[^>]*\\s+)?[iI][dD]\\s*=\\s*"pvs"[^>]*[^
>]*>'] not found at URL http://oglaf.com/. Assuming no previous comic strips exi
st.

Why there is this issue?

Thank you.

dosage -c only grabs current page

At least for the GirlGenius comic, dosage -c only grabs the current strip, despite being several days behind. dosage -a still goes back through the whole list, and will grab any missing pages while skipping existing ones.

Looking For Group under Windows??

Is there a way to have this running on Windows??

ImportError: No module named email.utils

When running fresh installed dosage 2.15 installed from .exe, with fresh installment of Python 3.4.1 with python-requests module installed I get following error:

Traceback (most recent call last):
  File "dosage", line 18, in <module>
  File "dosagelib\events.pyo", line 11, in <module>
  File "dosagelib\util.pyo", line 17, in <module>
  File "requests\__init__.pyo", line 58, in <module>
  File "requests\utils.pyo", line 25, in <module>
  File "requests\compat.pyo", line 7, in <module>
  File "requests\packages\__init__.pyo", line 3, in <module>
  File "requests\packages\urllib3\__init__.pyo", line 16, in <module>
  File "requests\packages\urllib3\connectionpool.pyo", line 39, in <module>
  File "requests\packages\urllib3\request.pyo", line 12, in <module>
  File "requests\packages\urllib3\filepost.pyo", line 15, in <module>
  File "requests\packages\urllib3\fields.pyo", line 7, in <module>
ImportError: No module named email.utils

When trying to uninstall 2.15 I get following error:

dosage -l error

Not disclosing some of the information below due to privacy reasons is ok.
I will try to help you nonetheless, but you have to give me something
I can work with ;) .

<type 'exceptions.TypeError'> unicode argument expected, got 'str'
Traceback (most recent call last):
File "/usr/local/bin/dosage", line 395, in main
res = run(parser.parse_args())
File "/usr/local/bin/dosage", line 262, in run
return doList()
File "/usr/local/bin/dosage", line 282, in doList
out.info('Available comic scrapers:')
File "/usr/local/lib/python2.7/dist-packages/dosagelib/output.py", line 29, in info
self.write(s, level=level)
File "/usr/local/lib/python2.7/dist-packages/dosagelib/output.py", line 64, in write
self.stream.write('%s' % s, color=color)
File "/usr/local/lib/python2.7/dist-packages/dosagelib/ansicolor.py", line 253, in _write
self.fp.write(text)
TypeError: unicode argument expected, got 'str'
System info:
Dosage 2.0
Python 2.7.3 (default, Jan 2 2013, 13:56:14)
[GCC 4.7.2] on linux2
Local time: 2013-04-11 22:36:33+002
sys.argv ['/usr/local/bin/dosage', '-l']
LANG = 'fr_FR.UTF-8'

Sometimes, comic files are saved with 0 bytes

I don't know if this is a bug in Doage or Requests, but sometimes my downloaded comic files end up as a 0 byte file.

Here is a log of one of those downloads:

22:37:08 Namesake> Open URL http://namesakecomic.com/comic/hell-wont-get-back-to-that-book-for-awhile/
22:37:08 Namesake> Sending headers {'User-Agent': u'Mozilla/5.0 (compatible; Dosage/1.7; +https://github.com/wummel/dosage)'}
22:37:15 Namesake> Open URL http://namesakecomic.com/comic/hell-wont-get-back-to-that-book-for-awhile/
22:37:15 Namesake> Sending headers {'User-Agent': u'Mozilla/5.0 (compatible; Dosage/1.7; +https://github.com/wummel/dosage)'}
22:37:16 Namesake> matched URL u'http://namesakecomic.com/wp-content/uploads/2012/12/Chapter12_06.jpg' with pattern <\s*[iI][mM][gG]\s+(?:[^>]*\s+)?[sS][rR][cC]\s*=\s*"([^"]*/wp-content/uploads/[^"]+)"[^>]*title=[^>]*>
22:37:16 Namesake> Open URL http://namesakecomic.com/wp-content/uploads/2012/12/Chapter12_06.jpg
22:37:16 Namesake> Sending headers {'Referer': 'http://namesakecomic.com/comic/hell-wont-get-back-to-that-book-for-awhile/', 'User-Agent': u'Mozilla/5.0 (compatible; Dosage/1.7; +https://github.com/wummel/dosage)'}
22:37:23 Namesake> ... filename = '2012-12-Chapter12_06', ext = '.jpg', contentLength = 404768
22:37:23 Namesake> Writing comic to file Comics/Namesake/2012-12-Chapter12_06.jpg...
22:37:23 Namesake> Saved Comics/Namesake/2012-12-Chapter12_06.jpg (0B).

But no error and Dosage just continues as if nothing has happened :(

(PS: Pull request for that comic is pending 😉)

PVPOnline

python dosage PVPOnline
PvPonline> Retrieving the current strip...
PvPonline> ERROR: Pattern <\s_[iI][mM][gG]\s+(?:[^>]\s+)?[sS][rR][cC]\s=s_"(http://newcdn.pvponline.com/img/comic/pvp[^"]+.jpg)"[^>]_[^>]_> not found at URL http://pvponline.com/comic.

Least I could do

seems Least I Could Do changed the url...
Now it is:
http://cdn.leasticoulddo.com/wp-content/uploads/2013/01/201301101.gif

Not sure how to create a new plugin

I have been looking over the plugins that came with dosage, and have not figured out the proper way to create a new one. I am wanting to create a new plugin for "Alien Dice" at http://aliendice.com. Can someone give me a little direction, or point me toward some documentation on proper plugin creation?

Thank you.

"dosage -l" does not work anymore

It seems the changes to argument parsing broke all arguments that don't need a comic.

So I have to do:

dosage -l @
dosage --version @

etc.

An Eerie Cuties strip produces a fatal error, likely due to a unicode url

When downloading Eerie Cuties with the -a option, it fails at http://www.eeriecuties.com/strips-ec/you_like_them_too%EF%BC%9F

The character ？seems to be the problem. This hits a couple other comics I added plugins for. Menage a 3 and Magick Chicks also have this character at one page. (I used Eerie Cuties as a template for creating those two).

This is on windows.

EerieCuties> Skipping existing file "Comics\EerieCuties\ec20110214.png".
C:\Python27\lib\urllib.py:1268: UnicodeWarning: Unicode equal comparison failed to convert both arguments to Unicode - interpreting them as being unequal
return ''.join(map(quoter, s))

********** Oops, I did it again. *************

You have found an internal error in Dosage. Please write a bug report
at https://github.com/wummel/dosage/issues and include at least the information below:

Not disclosing some of the information below due to privacy reasons is ok.
I will try to help you nonetheless, but you have to give me something
I can work with ;) .

<type 'exceptions.KeyError'> u'\xef'
Traceback (most recent call last):
File "C:\Python27\Scripts\dosage", line 258, in main
res = run(options, comics)
File "C:\Python27\Scripts\dosage", line 157, in run
return getComics(options, comics)
File "C:\Python27\Scripts\dosage", line 108, in getComics
errors += getStrips(scraperobj, options)
File "C:\Python27\Scripts\dosage", line 130, in getStrips
for strip in strips:
File "C:\Python27\lib\site-packages\dosagelib\scraper.py", line 92, in getAllStrips
for strip in self.getStripsFor(url, maxstrips):
File "C:\Python27\lib\site-packages\dosagelib\scraper.py", line 101, in getStripsFor
self.prevSearch, session=self.session)
File "C:\Python27\lib\site-packages\dosagelib\util.py", line 125, in fetchUrls
data, baseUrl = getPageContent(url, session=session)
File "C:\Python27\lib\site-packages\dosagelib\util.py", line 90, in getPageContent
check_robotstxt(url)
File "C:\Python27\lib\site-packages\dosagelib\util.py", line 210, in check_robotstxt
if not rp.can_fetch(UserAgent, url):
File "C:\Python27\lib\robotparser.py", line 139, in can_fetch
url = urllib.quote(url)
File "C:\Python27\lib\urllib.py", line 1268, in quote
return ''.join(map(quoter, s))
KeyError: u'\xef'
System info:
Dosage 1.8
Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
Local time: 2013-01-20 05:53:53+001
sys.argv ['C:\Python27\Scripts\dosage', '-a', 'eeriecuties']

******** Dosage internal error, over and out ********

the thin H line

Could you add it?
http://thinhline.tumblr.com/

List not redirectable

Hello, what is the intended way of browsing the list of dosage -l ?
It works when I just run it alone and scrolls by rapidly, but this is what I get when I try to capture it (windows 7 command line, dosage 1.12):

D:\Users\tolgyesipeter>dosage -l | more
Traceback (most recent call last):
File "dosage", line 13, in
File "dosagelib\events.pyo", line 6, in
File "dosagelib\util.pyo", line 18, in
File "dosagelib\output.pyo", line 8, in
File "dosagelib\ansicolor.pyo", line 234, in
File "dosagelib\colorama.pyo", line 122, in init
File "dosagelib\colorama.pyo", line 89, in GetConsoleScreenBufferInfo
WindowsError: [Error 6] A leÝrˇ ÚrvÚnytelen.

D:\Users\tolgyesipeter>dosage -l>dosage-list.txt
Traceback (most recent call last):
File "dosage", line 13, in
File "dosagelib\events.pyo", line 6, in
File "dosagelib\util.pyo", line 18, in
File "dosagelib\output.pyo", line 8, in
File "dosagelib\ansicolor.pyo", line 234, in
File "dosagelib\colorama.pyo", line 122, in init
File "dosagelib\colorama.pyo", line 89, in GetConsoleScreenBufferInfo
WindowsError: [Error 6] A leÝrˇ ÚrvÚnytelen.

The WindowsError says Invalid descriptor.
Is this not a command line app?
I get the same error with --singlelist.

Peter

ImportError: No module named requests

Im having a problem. Installed dosage for windows, as well as python 2.7.6 and requests module. When trying to invoke dosage in command shell, getting an error:

Traceback (most recent call last):
File "dosage", line 28, in
File "dosagelib\events.pyo", line 11, in
File "dosagelib\util.pyo", line 17, in
ImportError: No module named requests

Running win 8. Am I doing something completely wrong? Help very appreciated

ERROR: Patterns not found

Tried this for both "endtown" and "GoComics/Endtown":

C:\Windows\system32>dosage endtown
GoComics/Endtown> ERROR: Patterns ['<\s_[aA]\s+(?:[^>]\s+)?[hH][rR][eE][fF]
\s=\s_"(/[^"]+/\d+/\d+/\d+)"[^>]prev[^>]>'] not found at URL http://www.g
ocomics.com/endtown.

Most other sources are fine, so it might be the site that's the issue.

xkcd - large images

When xkcd has a large image option to a comic, could we be provide a flag to download the largest image possible?

example:
http://xkcd.com/1298/
http://xkcd.com/1298/large

pip install dosage doesn't work

Trying to install using

pip install dosage

doesn't work and gives a not found error.

New features for events?

Hi,

Since some comics use pretty unpredictable file names I wanted to enhance Dosage with something to preserver the comic order. My first idea is an event listener which creates a simple JSON dump of all infos Dosage gathers while it runs. See https://github.com/TobiX/dosage/commits/event-rework for a first implementation.

Since the events are currently in such a way that the comicDownloaded event is only fired on the first successful download (not if I do a "recheck" via "dosage -a"), you loose information if you forget to run dosage without "-o json" once.

I propose the following additional changes:

Allow multiple event listeners (-o rss,json) - My current idea for this would make events.py pretty ugly...
Always enable all listeners (I don't know if this would be popular with users)

Dilbert

it downloads the latest as dilbert.com.jpg and after that doesn't rename it to the date of the comic and tries to download the next but stops because the comic already exists (dilbert.com.jpg)...

Hagar - other source

Would it be possible to add this source also??
http://www.hagarthehorrible.net/
There seem to be a lot more comics of Hagar...

abstrusegoose - alt text

master branch source doesn't seem to download abstrusegoose alt text.

The deb target in the Makefile fails with a missing dosage.mak file

When trying to build a deb package using "make deb" it fails due to not finding the "dosage.mak" file.

I'd like to be able to make deb files for apt tracking and dependancies so if this could be fixed I'd be grateful.

Request: Whomp Comic

request to add whomp comic to dosage http://www.whompcomic.com/

Detect end-of-life comics

Some comics are definitely end-of-life and there should be an option or a tag to say "this comic is completely downloaded, do not crawl this anymore".
Ideas:

first, mark comics that are end-of-life in the source, eg. endOfLife = True
then, after a successful (ie. no errors) run with "--all" without indexes, set a marker in the downloaded comic directory.
let the marker be easily added/deleted manually (not necessarily with dosage, but with a file editor or system tools). The marker could be a single file "complete.txt" or something similar.

wummel / dosage Goto Github PK

dosage's Introduction

Dosage

And then updating to the latest revision on the new repository.

dosage's People

Contributors

Stargazers

Watchers

Forkers

dosage's Issues

Recommend Projects

Recommend Topics

Recommend Org