Information retrieval project at UB.
This project contains code used for the Information Retrival project in University at Buffalo.
This project aims to build a Wikipedia indexer with the following goals:• Parse fairly involved Wikipedia markup.• Index a decent sized subset of the Wikipedia corpus.• Create multiple indexes on the page data as well as metadata.• Provide an index introspection mechanism that can later be built upon to support queries.
Steps to clone the project in Eclipse:
- Install the Githb plugin for eclipse.
- Change the view to git repository exploring.
- Use the option, clone a git repository and add the clone to this view.