Giter Club home page Giter Club logo

phpcrawler's Introduction

PHPCrawler

Script de Crawl en PHP dans le but de créer un sitemap personnalisé.

Cette petite classe, va vous permettre de générer par exemple le sitemap de votre site.

A l'aide d'une tâche cron, vous pourriez ensuite automatiser la génération de votre sitemap afin qu'il soit tout le temps à jour.

Ce script est à votre disposition, il permet plus généralement de crawler un site web.

Créer par Thomas Moreira (www.thomasmoreira.fr)

PHP Crawler (English)

This script will crawl all pages in the directory to create a custom sitemap.

Using a cron job, you could automate the generation of your sitemap so that it is updated periodically, or run it yourself manually when you have made a change.

To Use

On line 5 of main.php change the domain to your own. If you want to test on your localhost enter local host and the port number instead.

$website = "http://www.thomasmoreira.fr"; // Change this to your domain

Add crawler.php and main.php to the root directory - then open main.php to run the crawler.

The sitemap.xml will output to the root as normal, so you can access the sitemap with http://yoursitedomain.com/sitemap.xml and submit this to your chosen search engine.

Licence GNU GPL

Pour savoir ce que vous pouvez faire ou pas avec mon outil, voici quelques liens à lire :

phpcrawler's People

Contributors

edimitchel avatar laurenclark avatar

Stargazers

Rose Ludovic avatar Dmytro Serpokrylenko avatar Alexander Dubovskoy avatar  avatar  avatar Thomas Moreira avatar Jiri Sifalda avatar agentcobra avatar

Watchers

James Cloos avatar Thomas Moreira avatar  avatar

phpcrawler's Issues

Ajoute une license

Bonjour ! Pas mal, le projet ;)

Mais parce que je ne sais pas à lesquels conditions j'ai la permission d'utiliser le PHPCrawler sa sera pas mal d'ajouter une licence. Merci beaucoup!

Playlist linked

Bonjour Thomas,
Je viens de tester ton crawler
Un problème : il crawl les liens playlist de video et musique.
As tu un pathern pour ca?
Il faudrait suprimer les liens contenant "mpeg,png,jpeg..."

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.