DNA最长的常见子序列

时间:2013-07-30 02:40:43

标签: python-3.x

所以这是一个家庭作业问题,它让我很困惑。我曾多次尝试编写此代码但仍然没有。问题是找到两条DNA链之间最长的共同子序列并打印序列。谁能在这里帮助我,也让我知道该怎么做?我已设置窗口并使用find函数查看字符串相等的位置,然后将max函数设置为仅显示最长的函数。

def main():

s1 = input("Enter first strand: ")
s2 = input("Enter second strand: ")
window = 0
if len(s1) < len(s2):
    window = len(s1)
elif len(s1) > len(s2):
    window = len(s2)
else:
    window = len(s1)
for i in range(window, 1, -1):
    for j in range(0, i, 1):
     if s1.find(s1[i:i + window]) == s2.find(s2[i:i + window]):
        max = a[0][0]
        for i in range(len(a)):
            for j in range(len(a[i]):
                if (a[i][j] > max):
                    max = a[i][j]
                    print ("Common Subsequence: ", max)
     else:
        print ("No Common Sequence Found")

main()

1 个答案:

答案 0 :(得分:0)

假设你被给予相同长度的股线(因为这是作业,我会让你调整它的长度不等)

from itertools import zip_longest, chain
def longestCommonSubstring(p1, p2):

    answer = [[0.0 for _ in range(len(p2)+1)] for _ in range(len(p1)+1)]

    for r, row in enumerate(answer[1:], 1):
        for c, _ in enumerate(row[1:], 1):
            if p1[r-1] == p2[c-1]:
                answer[r][c] = answer[r-1][c-1]+1
            else:
                answer[r][c] = 0.0

    return max(chain.from_iterable(matrix))/max(len(p1), len(p2))

希望这有帮助

编辑:实施find

def myFind(string, substring):
    for i in range(len(string) - len(substring)):
        if string[i] == substring[0]:
            if string[i:i+len(substring)] == substring:
                print substring, "found in", string, "at position", i

如果您不想使用字符串比较:

def myFind(string, substring):
    for i in range(len(string) - len(substring)):
        if string[i] == substring[0]:
            found = True
            for j in range(len(substring)):
                if string[i+j] != substring[j]:
                    found = False
            if found:
                print substring, "found in", string, "at position", i