【问题标题】:ERROR in Biostrings while trying to MSA using ggmsa尝试使用 ggmsa 进行 MSA 时出现 Biostrings 中的错误
【发布时间】:2022-11-02 21:06:52
【问题描述】:

我想在 3 个物种(大鼠、斑马鱼和 pupfish)中做一个相同肽的 msa,并将它与我拥有的 2 个合成肽(M35 和 M871)匹配(发现相同的身份/差异),但我得到以下构建向量后的错误:

Library (ggmsa)
galanin_table <- c("MACSKHLVLFLTILLSLAETPDSAPAHRGRGGWTLNSAGYLLGPVLHLSSKANQGRKTDSALEILDLWKAIDGLPYSRSPRMTKRSMGETFVKPRTGDLRIVDKNVPDEEATLNL", "Rat", "MHRCVGGVCVSLIVCAFLTETLGMVIAAKEKRGWTLNSAGYLLGPRRIDHLIQIKDTPSARGREDLLGQYAIDSHRSLSDKHGLAGKREMPLDEDFKTGALRIADEDVVHTIIDFLSYLKLKEIGALDSLPSSLTSEEISQP", "Zebrafish", "MQRSFAVFCVSLIFCATLSETIGLVIAAKEKRGWTLNSAGYLLGPRRIDHLIQIKDSPSARGRDELVNQYGIDGHRTLGDKAGLAGKRDMAQEDDVRTGPLRIGDEDIIHTVIDFLSYLKLKEMGALDSLPSPLTSDELANP", "Pupfish", "GWTLNSAGYLLGPPPGFSPFR","M35", "WTLNSAGYLLGPEHPPPALALA","M871")

galanin_matrix <- matrix(galanin_table, byrow=T, nrow=5)
galanin_table <- as.data.frame(galanin_matrix, stringsAsFactors = F)
colnames(galanin_table) <- c("Sequences", "Species")
galanin_table <- as.data.frame(galanin_table)
galanin_list <- as.list(galanin_table)
galanin_asvector <- as.vector(galanin_list)
galanin_asvector_ss <- Biostrings::AAStringSet(x= galanin_asvector)

(函数(类,fdef,mtable)中的错误: 无法为签名“字符”找到函数“seqtype”的继承方法





可能我以错误的方式构建向量

【问题讨论】:

    标签: r bioinformatics


    【解决方案1】:

    您肯定已经开始使用一种有趣的方法将序列导入 R。ggmsa() 期望系统文件以识别格式(如 FASTA)或序列的 XStringSet 对象来识别序列。我不知道你是否真的将你的序列存储在一个字符串中,或​​者这只是在这个例子中包含它们的一个简单途径,但假设这就是你所拥有的,这应该让你开始:

    # load in decipher for the aligner
    suppressMessages(library(DECIPHER))
    # load in ggmsa
    library(ggmsa)
    
    # your sequences
    galanin_table <- c("MACSKHLVLFLTILLSLAETPDSAPAHRGRGGWTLNSAGYLLGPVLHLSSKANQGRKTDSALEILDLWKAIDGLPYSRSPRMTKRSMGETFVKPRTGDLRIVDKNVPDEEATLNL", "Rat", "MHRCVGGVCVSLIVCAFLTETLGMVIAAKEKRGWTLNSAGYLLGPRRIDHLIQIKDTPSARGREDLLGQYAIDSHRSLSDKHGLAGKREMPLDEDFKTGALRIADEDVVHTIIDFLSYLKLKEIGALDSLPSSLTSEEISQP", "Zebrafish", "MQRSFAVFCVSLIFCATLSETIGLVIAAKEKRGWTLNSAGYLLGPRRIDHLIQIKDSPSARGRDELVNQYGIDGHRTLGDKAGLAGKRDMAQEDDVRTGPLRIGDEDIIHTVIDFLSYLKLKEMGALDSLPSPLTSDELANP", "Pupfish", "GWTLNSAGYLLGPPPGFSPFR","M35", "WTLNSAGYLLGPEHPPPALALA","M871")
    
    # grab your sequnces, c(T,F) will recycle over the original vector to select
    # a 1,3,5,7,etc pattern
    # conversely c(F,T) can grab the names in the opposite pattern
    seqs <- AAStringSet(galanin_table[c(T,F)])
    names(seqs) <- galanin_table[c(F,T)]
    
    # align your sequences
    ali <- AlignSeqs(seqs)
    
    # call ggmsa
    ggmsa(msa = ali,
          color = "Clustal", 
          font = "DroidSansMono",
          char_width = 0.5,
          seq_name = TRUE)
    
    
    

    祝你好运!

    【讨论】:

      猜你喜欢
      • 2020-06-19
      • 2023-04-03
      • 2013-06-16
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      • 2015-09-26
      • 1970-01-01
      • 2012-02-05
      相关资源
      最近更新 更多