Giter Club home page Giter Club logo

webtoon-crawler's Introduction

webtoon-crawler

네이버 웹툰 크롤러

만화 콘텐츠(이미지) 크롤링

casperjs crawl_contents.js --id=웹툰아이디 --no=회수번호 --dst=대상경로

__stdout__으로 출력되는 정보

{
  "title" : "해당 화의 제목",
  "rating" : "별점 (숫자)",
  "counts" : "이미지_조각_수 (숫자)"
}

마지막 화 정보 크롤링

casperjs crawl_last_episode.js --id=웹툰아이디

__stdout__으로 출력되는 정보

{
  "title" : "해당 화의 제목",
  "rating" : "별점 (숫자)",
  "date" : "날짜 / 예:15.01.20",
  "no" : "회수번호 (숫자)"
}
  • 목록 중 특별편, 프롤로그 등으로 인해서 제목의 회수번호랑 __no__번호랑 차이가 날 수 있음.

인기급상승 랭킹 크롤링

casperjs crawl_ranking.js

__stdout__으로 출력되는 정보

{
  "rank_favorite" : ["title1", "title2", "title3", "title~10"],
  "rank_update" : ["title1", "title2", "title3", "title~10"]
}
  • rank_favorite : 인기 순 정렬
  • rank_update : 업데이트 순 정렬

주의사항

  • Windows CMD에서 실행 시 __chcp 65001__설정 후 사용
  • 가끔 no중 빠진 번호가 존재하는 웹툰이 있음

ToDo

  • 웹툰 목록 크롤링
  • 댓글

webtoon-crawler's People

Contributors

pjc0247 avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.