Results 1 to 5 of 5

Thread: Creating a Search Spider anyone know how?

  1. #1
    Join Date
    Sep 2006
    Beans
    530

    Creating a Search Spider anyone know how?

    Hi

    I am interested in making my own search spider (just to say I made one). I tried doing google searches for the subject but I just get information about PageRank. I'm looking for information on how to create a search spider with c++ or c or whatever and how to run one.

    Does anyone either know where to start or have a good resource about this subject? I'm a self taught programmer so haven't been able to rely on schooling for this type of thing.

    Thanks,

    The Net Duck
    My personal website is www.thenetduck.com
    A great new design community is: www.FreakinAmazing.com

  2. #2
    Join Date
    Nov 2008
    Location
    Athens, Hellas
    Beans
    332
    Distro
    Ubuntu 9.04 Jaunty Jackalope

    Re: Creating a Search Spider anyone know how?

    Quote Originally Posted by thenetduck View Post
    Hi

    I am interested in making my own search spider (just to say I made one). I tried doing google searches for the subject but I just get information about PageRank. I'm looking for information on how to create a search spider with c++ or c or whatever and how to run one.

    Does anyone either know where to start or have a good resource about this subject? I'm a self taught programmer so haven't been able to rely on schooling for this type of thing.

    Thanks,

    The Net Duck
    I guess we are talking about a metacrawler ?

    http://en.wikipedia.org/wiki/Metacrawler

  3. #3
    Join Date
    Nov 2008
    Location
    Athens, Hellas
    Beans
    332
    Distro
    Ubuntu 9.04 Jaunty Jackalope

    Re: Creating a Search Spider anyone know how?

    maybe something like this ?

    http://www.cs.uiowa.edu/~asignori/helios/

    by the way it include the source in the download and is written in C. Should be enough.

  4. #4
    Join Date
    Aug 2008
    Location
    Istanbul, Turkey
    Beans
    17
    Distro
    Ubuntu 9.04 Jaunty Jackalope

    Re: Creating a Search Spider anyone know how?

    You can make a score system for searches.

    For example

    If title include searched keyword
    Add Score 40
    If domain
    Add Score 70
    If description
    Add Score 30
    If keyword
    Add Score 20
    If in HTML
    Add Score 25
    If In Headers like h1, h2 in HTML
    Add Score 15
    If in <p>
    Add Score 10

    and you can score all results and you can order by score

  5. #5
    Join Date
    Apr 2006
    Location
    Hamilton, New Zealand
    Beans
    198
    Distro
    Ubuntu 9.04 Jaunty Jackalope

    Re: Creating a Search Spider anyone know how?

    It might be worth having a look at nutch - which is an open source search engine - it has a crawler component that you can look at.

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •