Super Smash Search Engine

This is my most recent project for my Senior Project in Computer Science undergrad.

The Front End consists of HTML,CSS, and JavaScript, and Jquery. These are the technologies making it possible to see the UI layed out for design and visual purposes. Next, I use Java Spark as onf of the backend technologies to run on a local web server. Also included are other APIs such as Snowball and Gson for parsing Json files or text queries.

The searching mechanism of this project works with several algorithms such as positional inverted indexing and ranked retrieval using the Vector Space Model. There is tokenization of words and work with phrase queries. Together they run through a boolean query parser which takes a query to search through a Hashmap of Postings lists that will search in constant time.

The program works as follows, the user will enter a corpus directory to upload, this can be of large sizes of even thousands of documents. Then after the directory is indexed, the user will be able to search for queries based on their informational need. The search engine, then will look for several top matches for that specfic query. Lastly, the user will be able to access documents related to that query as well.

Technologies:
Link to Repository
Search Results
Document Opened