【问题标题】:Bioinformatics filtering/selecting based on multiple conditions基于多种条件的生物信息学过滤/选择
【发布时间】:2020-10-13 22:21:55
【问题描述】:

您好,我正在尝试选择具有某些 KEGG 通路的不同 OTU,我想知道为什么以下方法不起作用或您的建议是什么。我试过 dplyr 并使用 =!=<> 没有成功。有什么建议吗?

Group1<-sqldf("SELECT DISTINCT OTU FROM 'retro.flux.avg.OTU'
WHERE Pathway IN ('ko00362','ko00625','ko00361','ko00623','ko00622','ko00633','ko00642','ko00626','ko00624')")
          AND Pathway IN ('ko02030')")

Group2<-sqldf("SELECT DISTINCT OTU FROM 'retro.flux.avg.OTU'
WHERE Pathway IN ('ko00362','ko00625','ko00361','ko00623','ko00622','ko00633','ko00642','ko00626','ko00624')")
          AND Pathway NOT IN ('ko02030')")

Group3<-sqldf("SELECT DISTINCT OTU FROM 'retro.flux.avg.OTU'
WHERE Pathway NOT IN ('ko00362','ko00625','ko00361','ko00623','ko00622','ko00633','ko00642','ko00626','ko00624')")
          AND Pathway IN ('ko02030')")


Group4<-sqldf("SELECT DISTINCT OTU FROM 'retro.flux.avg.OTU'
WHERE Pathway NOT IN ('ko02030','ko00362','ko00625','ko00361','ko00623','ko00622','ko00633','ko00642','ko00626','ko00624')")

【问题讨论】:

    标签: sql r bioinformatics sqldf


    【解决方案1】:

    这是我的想法

    代替

    Group1<-sqldf("SELECT DISTINCT OTU FROM 'retro.flux.avg.OTU'
    WHERE Pathway IN ('ko00362','ko00625','ko00361','ko00623','ko00622','ko00633','ko00642','ko00626','ko00624')") 
    AND Pathway IN ('ko02030')")
    

    只需输入以下内容

    删除了额外的 "),我注意到查询中还有 1 个错误并修复了它

    Group1<-sqldf("SELECT DISTINCT OTU FROM 'retro.flux.avg.OTU'
    WHERE Pathway IN ('ko00362','ko00625','ko00361','ko00623','ko00622','ko00633','ko00642','ko00626','ko00624')")
    

    【讨论】:

      猜你喜欢
      • 1970-01-01
      • 2011-06-30
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      • 2022-07-21
      • 2021-12-14
      相关资源
      最近更新 更多