2004

CG125 search engine

A simple document indexing and retrieval engine

Polytechnic
Java/Scala

2004

Making a search engine from scratch was one the early projects I did during bachelor in computer engineering. Back then in early 2000s search engines were not as complicated as they are today, and we were supposed to develop one for a fairly large collection of texts for the Information Storage and Retrieval course.

The project was implemented using the newly released Java 1.4 and used a customized inverted indexing to quickly retrieve document pieces using specific keyword formulations. It was a great learning experience and significantly influenced everything I did later!