Skip to content

Sample search engine with web crawler, built on Node.js + CouchDB + Limestone

Notifications You must be signed in to change notification settings

patrickhempel/node-search-engine

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is a really simple search engine, built with:

  • node.js
  • Sphinx search server
  • CouchDB
  • Express web framework

Right now it uses Node.js v0.1.90 (process.mixin is the only thing that won't work in 0.1.91 though).

Usage

First, you need to setup host to crawl in settings.js. Right now it's finalfantasy.wikia.com. You also need CouchDB installed, and at least one database. Put database name and host into settings.js. You need to install Sphinx server from http://www.sphinxsearch.com .

Indexing

Run:

node spider.js

Spider starts crawling pages and putting them into DB.

To index crawled pages with Sphinx, enter node /path/to/xmlpipe2.js in sphinx.conf as xmlpipe2 source. To index newly created source right now, start indexer manually:

indexer <sourcename>

Searching

To start site, launch

node app.js

Site will be available at port 8000. Enter search terms in form and submit it.

About

Sample search engine with web crawler, built on Node.js + CouchDB + Limestone

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published