使用 Python 获取 DNA 链的反向互补

当前位置：主页 > 学无止境 > 编程语言 > Python >

Python PHP Java Go TypeScript C++ Vba Node.js C语言 MATLAB

使用 Python 获取 DNA 链的反向互补

作者：迹忆客最近更新：2023/03/01 浏览次数：

在使用 Python 处理科学任务时，我们需要执行某些任务。本文讨论了使用 Python 获得 DNA 链反向互补的各种方法。

DNA 链主要由四种碱基表示，即腺嘌呤（A）、胸腺嘧啶（T）、鸟嘌呤（G）和胞嘧啶（C）。也可能有其他类型的碱基。

每条 DNA 链由一系列字母 A、T、G 和 C 表示。例如，ACGTAATTGGCC 可能是其中一条 DNA 链。

为了得到一条 DNA 链的互补链，我们将原始链中的 A 替换为 T，C 替换为 G，G 替换为 C，T 替换为 A。例如，ACGTAATTGGCC 的补码是 TGCATTAACCGG。

为了反转 DNA 链的补体，我们将反转 DNA 链的补体中的字符。因此，反向补码将是 GGCCAATTACGT。

现在让我们讨论使用 Python 获得 DNA 字符串的反向补码的方法。

我们将按照以下步骤在 Python 中使用 for 循环来获得 DNA 链的反向互补。

你可以在以下示例中观察到这一点。

input_strand = "ACGTAATTGGCC"
reversed_strand = ""
length = len(input_strand)
for i in range(length):
    character = input_strand[length - 1 - i]
    if character == "A":
        reversed_strand = reversed_strand + "T"
    elif character == "T":
        reversed_strand = reversed_strand + "A"
    elif character == "G":
        reversed_strand = reversed_strand + "C"
    elif character == "C":
        reversed_strand = reversed_strand + "G"
    else:
        reversed_strand = reversed_strand + character
print("The input DNA strand is:", input_strand)
print("The reverse complement is:", reversed_strand)

输出：

The input DNA strand is: ACGTAATTGGCC
The reverse complement is: GGCCAATTACGT

在上述方法中，在创建 reversed_strand 时，会为输入 DNA 链中的每个字符创建一个新字符串。如果输入的 DNA 链太长，这在时间和内存方面可能会很昂贵。

为了避免这种情况，我们可以使用列表来使用 Python 获取 DNA 链的反向互补。

我们将使用以下步骤使用 for 循环、列表和 join() 方法来反向互补 DNA 链。

你可以在以下示例中观察到这一点。

input_strand = "ACGTAATTGGCC"
reversed_strand = ""
complement_chars = []
length = len(input_strand)
for i in range(length):
    character = input_strand[length - 1 - i]
    if character == "A":
        complement_chars.append("T")
    elif character == "T":
        complement_chars.append("A")
    elif character == "G":
        complement_chars.append("C")
    elif character == "C":
        complement_chars.append("G")
    else:
        complement_chars.append(character)
reversed_strand = "".join(complement_chars)
print("The input DNA strand is:", input_strand)
print("The reverse complement is:", reversed_strand)

输出：

The input DNA strand is: ACGTAATTGGCC
The reverse complement is: GGCCAATTACGT

代替在 for 循环中使用 if-else 块，我们可以使用字典和 get() 方法使用 Python 获取 DNA 链的反向补码。对于此任务，我们将创建以下字典。

reverse_dict={"A":"T","T":"A","G":"C","C":"G"}

get() 方法检索与字典中的键关联的值。当在字典上调用时，get() 方法将键作为其第一个输入参数，将一个可选值作为其第二个输入参数。

如果键存在于字典中，则返回与其关联的值。否则，get() 方法返回作为第二个参数传递的可选值。

我们将使用 get() 方法和 reverse_dict 使用以下步骤来反转 DNA 链的互补链。

你可以在以下示例中观察到这一点。

input_strand = "ACGTAATTGGCC"
reversed_strand = ""
reverse_dict = {"A": "T", "T": "A", "G": "C", "C": "G"}
length = len(input_strand)
for i in range(length):
    character = input_strand[length - 1 - i]
    reversed_strand = reversed_strand + reverse_dict.get(character, character)

print("The input DNA strand is:", input_strand)
print("The reverse complement is:", reversed_strand)

输出：

The input DNA strand is: ACGTAATTGGCC
The reverse complement is: GGCCAATTACGT

如前所述，在 for 循环中创建字符串的方法成本很高。因此，我们可以使用列表和 join() 方法和 get() 方法使用 Python 获取 DNA 链的反向互补，如以下示例所示。

input_strand = "ACGTAATTGGCC"
reversed_strand = ""
reverse_dict = {"A": "T", "T": "A", "G": "C", "C": "G"}
complement_chars = []
length = len(input_strand)
for i in range(length):
    character = input_strand[length - 1 - i]
    complement_chars.append(reverse_dict.get(character, character))
reversed_strand = "".join(complement_chars)
print("The input DNA strand is:", input_strand)
print("The reverse complement is:", reversed_strand)

输出：

The input DNA strand is: ACGTAATTGGCC
The reverse complement is: GGCCAATTACGT

在这里，我们首先在迭代输入 DNA 链时创建了反向补码中的字符列表。之后，我们通过使用 join() 方法连接字符来创建反向补码。

除了使用 for 循环，你还可以使用列表推导来使用 Python 反向补充 DNA 链。

我们将首先使用索引来反转输入的 DNA 链，以使用列表理解来补充 DNA 链。之后，我们将使用列表推导和上一个示例中创建的 get() 方法和 reverse_dict 来获取反向补码的字符列表。

一旦我们得到字符列表，我们将使用 join() 方法来查找输入 DNA 链的反向补码，如下例所示。

input_strand = "ACGTAATTGGCC"
reversed_strand = ""
reverse_dict = {"A": "T", "T": "A", "G": "C", "C": "G"}
temp = input_strand[::-1]
complement_chars = [reverse_dict.get(character) for character in temp]
reversed_strand = "".join(complement_chars)
print("The input DNA strand is:", input_strand)
print("The reverse complement is:", reversed_strand)

输出：

The input DNA strand is: ACGTAATTGGCC
The reverse complement is: GGCCAATTACGT

我们还可以使用 translate() 方法找到 DNA 链的反向互补。为此，我们将使用以下步骤。

你可以在以下示例中观察到这一点。

input_strand = "ACGTAATTGGCC"
translation_table = input_strand.maketrans("ATCG", "TAGC")
temp = input_strand[::-1]
reversed_strand = temp.translate(translation_table)
print("The input DNA strand is:", input_strand)
print("The reverse complement is:", reversed_strand)

输出：

The input DNA strand is: ACGTAATTGGCC
The reverse complement is: GGCCAATTACGT

我们还可以使用 Python 中的 Biopython 模块来反向互补 DNA 链。使用以下语句，你可以使用 Python PIP3 的包安装程序安装 Biopython 模块。

pip3 install Bio

Biopython 模块提供了 reverse_complement() 方法来使用 Python 反向互补 DNA 链。当在 DNA 序列对象上调用时，reverse_complement() 方法返回 DNA 序列的反向补码。

我们将使用 Python 中的 reverse_complement() 方法使用以下步骤来获得 DNA 链的反向互补。

from Bio.Seq import Seq

input_strand = "ACGTAATTGGCC"
sequence = Seq(input_strand)
reversed_strand = sequence.reverse_complement()
print("The input DNA strand is:", input_strand)
print("The reverse complement is:", reversed_strand)

输出：

The input DNA strand is: ACGTAATTGGCC
The reverse complement is: GGCCAATTACGT

在本文中，我们讨论了使用 Python 反向互补 DNA 链的各种方法。在所有这些方法中，如果不允许使用外部库，你可以选择使用 translate() 方法的方法；否则，你可以使用 Biopython 模块在 Python 中反向互补 DNA 链。

我们希望你喜欢阅读这篇文章。请继续关注更多内容丰富的文章。

上一篇：从 Python 中的字符串中删除 HTML 标记

下一篇：Python 中最长的公共子字符串

转载请发邮件至 1244347461@qq.com 进行申请，经作者同意之后，转载请以链接形式注明出处

本文地址：

迹忆客专注技术分享

使用 Python 获取 DNA 链的反向互补

相关文章

Pandas DataFrame DataFrame.shift() 函数

Pandas pandas.melt() 函数

Python pandas.pivot_table() 函数

Pandas read_csv()函数

Pandas 追加数据到 CSV 中

Pandas 多列合并

用多个条件过滤 Pandas DataFrame

Pandas loc vs iloc

在 Python 中将 Pandas 系列的日期时间转换为字符串

扫一扫阅读全部技术教程

社交账号

 https://www.github.com/onmpw

 qq:1244347461



最新推荐

教程更新

热门标签

使用 Python 获取 DNA 链的反向互补

相关文章

扫一扫阅读全部技术教程

社交账号  https://www.github.com/onmpw  qq:1244347461 

最新推荐

教程更新

热门标签

社交账号

 https://www.github.com/onmpw

 qq:1244347461

