Problem

Sequence alignment. When a new gene is discovered, a standard approach to understanding it...

Sequence alignment. When a new gene is discovered, a standard approach to understanding its function is to look through a database of known genes and find close matches. The closeness of two genes is measured by the extent to which they are aligned. To formalize this, think of a gene as being a long string over an alphabet ∑ = {A, C, G, T}. Consider two genes (strings) x = ATGCC and y = TACGC A. An alignment of x and y is a way of matching up these two strings by writing them in columns, for instance:

— A T — G C C

T A — C G C A

Here the “-” indicates a “gap.” The characters of each string must appear in order, and each column must contain a character from at least one of the strings. The score of an alignment is specified by a scoring matrix δ of size (|∑| + 1) x (|∑| + 1), where the extra row and column are to accommodate gaps. For instance the preceding alignment has the following score:

Give a dynamic programming algorithm that takes as input two strings x[1...n] and y[1...m] and a scoring matrix δ, and returns the highest-scoring alignment. The running time should be O (mn).

Step-by-Step Solution

Request Professional Solution

Request Solution!

We need at least 10 more requests to produce the solution.

0 / 10 have requested this problem solution

The more requests, the faster the answer.

Request! (Login Required)

All students who have requested the solution will be notified once they are available.

Add your Solution

Textbook Solutions and Answers Search

Solutions For Problems in Chapter 6

Free Homework Help App

Download From Google Play

Scan Your Homework
to Get Instant Free Answers

Need Online Homework Help?

Ask a Question

Get Answers For Free
Most questions answered within 3 hours.

Recent Solutions