Department of Informatics
The Strand, London WC2R 2LS, UK
  Undergraduate enquiries:
Postgraduate enquiries:
Fax:
+44 (0)20 7848 2005
+44 (0)20 7848 2588
+44 (0)20 7848 2851
 
 



2011/12 BSc/MSci individual projects

Supervisor: Maxime Crochemore
Email: @kcl   Office: Strand Building S6.33


Project titles

The project development platforms are to be discussed with the supervisor.


Detailed Descriptions

Runs in strings

The aim of this project is to develop a software to detect and display all the runs (repetitions) occurring in a text The search for runs is described in a recent article that will be available.

Maximal exponent

The aim of this project is to track repetitions in strings and to report their maximal exponent. For example "gaga" is twice "ga", i.e. has exponent 2; and "alfalfa" is a 7/3-repetition of "alf".

Genomic sequence comparison

The aim of this project is to design and implement an algorithm for comparing genomic sequences without alignment. The method is based on sets of absent words derived from each sequence. An algorithm for computing absent words is provided.

Similarities between two texts

The aim of this project is to develop a system that reports the largest segments appearing in two different texts written in natural language. Several parameters can be considered according to a threshold length of segments or to the similarity between segments. A preprocessing of the texts can transform them into sequences of integers (by hashing) in order to simplify further word comparisons. The project includes a user-friendly display of similarities.

Best ranking

The aim of this project is to test heuristics to find the center of three permutations, called the Best sorting. When we have three permutations, associated to ranking made by three persons for example, we want to have the median ranking defined according to some criteria. A paper presenting the problem is provided.

Your own project idea

You can submit your own project ideas in the following domains: algorithms and data structures, data compression, natural language analysis, bioinformatics, automata and formal languages, pattern matching, games, etc.


Informatics Homepage     Local Homepage