Comments (6)
Here's an example of other materials that might be useful (with examples from progfun-002):
- Assignment index page (e.g.), linking to
I don't know how generic this scheme is.
from coursera-dl.
+1
from coursera-dl.
+1 This would be helpful.
from coursera-dl.
I don't think that changing the info to PDF would be a good idea,
maybe keeping the original html, would be better.
The main issue of downloading the extra materials is that each course
defines their own sections. Generally they do it through an internal
wiki that produces links like:
https://class.coursera.org/<COURSE_NAME>/wiki/view?page=Schedule
(e.g. for the schedule, course information, faq, course logistics,
etc):
I think that parsing and getting such pages must not be so
difficult. As well as downloading the standard sections that almost
all courses have (that have an almost regular structure):
/quiz/index -> quizzes
/assignment/index -> auto graded assignments
/human_grading/index -> peer graded assignments
/quiz/index?quiz_type=homework -> homework questions (not in all courses)
/quiz/index?quiz_type=exam -> exam (not in all courses)
Another issue is that many times those pages have links to external
ressources such as .zip files or .pdf and those are sometimes hosted
in other places. The question would be where to stop, and what's the
best way to do such crawler without reinventing the wheel or hand
coding everything, to have in the end a nice 'browsable' site like
with (wget -p, or httrack).
from coursera-dl.
I recently discovered that some courses do not (or no longer) make their materials available indefinitely after the course and lost important reference material as a result. I have turned to coursera-dl
for downloading, and am manually grabbing what it does not.
Just the ability to grab the surrounding coursera web(wiki?) pages (syllabus, course materials,etc) would be a great next step. I agree that HTML is better than PDF.
Following all the links on the coursera pages to grab the linked PDFs, Word docs, YouTube videos, and everything would be a really cool future goal, but probably not as important. In many cases a user may not want to download all of those external resources and may prefer to follow the links from the downloaded coursera page.
from coursera-dl.
Chrome: https://chrome.google.com/webstore/detail/coursera-quiz-printer/pkgbcmdpjlnmngdfjicnkppkkmnaejnm
Firefox: https://addons.mozilla.org/addon/coursera-quiz-printer/
Blogpost: https://churchofthought.org/blog/2020/10/17/coursera-quiz-printer-a-cross-browser-webextension/
from coursera-dl.
Related Issues (20)
- Coursera-dl worked and now stops with an 404 error on the same class HOT 12
- Error 403 Client Error: Forbidden for url: https://api.coursera.org/api/memberships.v1?includes=courseId,courses.v1&q=me&showHidden=true&filter=current,preEnrolled getting page https://api.coursera.org/api/memberships.v1?includes=courseId,courses.v1&q=me&showHidden=true&filter=current,preEnrolled The server replied: {"errorCode":"Not Authorized","message":null,"details":null} HOT 7
- Cannot download files on Coursera platform
- Issue in Parsing syllabus of on-demand course HOT 2
- Unable to reinstall coursera-dl HOT 1
- Course with `-` prefix to course name not downloading! HOT 2
- How to download only the transcript
- Please provide a username with the -u option, or a .netrc file with the -n option. HOT 3
- stops downloading after a while
- connection.py
- Autosave failed and permission denied in all notebooks HOT 2
- Coursera - API Route Does Not Exist HOT 5
- Coursera - API Route Does Not Exist They have upgraded to V3 HOT 3
- What login pass should I use?
- Issue with coursera-dl: Error 400 Client Error: Bad Request HOT 4
- Not able to download course
- Not working with latest python versions HOT 7
- courser-helper error HOT 1
- HTTPError 404 Client Error: Not Found for URL HOT 7
- Ability to skip a single video by request HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from coursera-dl.