My research interests include Information Retrieval, Music Information Retrieval, recommender technology, pattern matching, compression computer music technology, audio processing, informetrics, genomic data-searching techniques.
Below are a list of suggested topics. If you have an idea that is not on this list but in one of the areas listed above, feel free to see me and discuss its potential as a thesis/project topic.
The prototype should provide a method of determining the user's level of language knowledge (using vocabulary lists based on word frequency), and return a ranked list of documents that match the user's knowledge.
For regular users, the application should track what they've read as
well as their knowledge, so that new documents can be presented.
Other aspects of the topic include implementing several solutions to indexing documents for retrieval based on reading difficulty. Reading difficulty can be approximated using vocabulary measures and sentence length. Documents that are easy to read are likely to be those that are largely made up of common words and short sentences. However, search engines usually don't index common words due to the resulting large index size.