【发布时间】:2019-10-12 19:59:27
【问题描述】:
我需要在集群中运行一个 snakemake 规则,因此对于某些规则,我需要一些工具和库来加载,而这些工具是独立于/专有于其他规则的。在这种情况下,我如何在我的蛇形规则中指定这些。例如,对于rule score,我目前需要module load r/3.5.1 和export R_lib =/user/tools/software,我在运行snakemake 之前在命令行中分别运行这些行。但是,如果有办法在 env 的规则内做到这一点,那就太好了。
- 问题,
我有如下规则,
rule score:
input:
count=os.path.join(config['general']['paths']['outdir'], 'count_expression', '{sample}.tsv'),
libsize=os.path.join(config['general']['paths']['outdir'], 'count_expression', '{sample}.size_tsv')
params:
result_dir=os.path.join(config['general']['paths']['outdir'], 'score'),
cancertype=config['general']['paths']['cancertype'],
sample_id=expand('{sample}',sample=samples['sample'].unique())
output:
files=os.path.join(config['general']['paths']['outdir'], 'score', '{sample}_bg_scores.tsv', '{sample}_tp_scores.tsv')
shell:
'mkdir -p {params.result_dir};Rscript {config[general][paths][tool]} {params.result_dir} {params.cancertype} {params.sample_id} {input.count} {input.libsize}'
我对上述代码 sn-p 的实际行为是:
shell:
mkdir -p /cluster/user/snakemake_test/results_april30/score;Rscript /cluster/home/user/Projects/R_scripts/scoretool.R /cluster/user/snakemake_test/results_april30/score DMC GNMS4 MRT5T /cluster/projects/test/results/exp/MRT5T.tsv /cluster/projects/test/results/Exp/MRT5T.size.tsv
然而,预期的行为是:
shell:
mkdir -p /cluster/user/snakemake_test/results_april30/score;Rscript /cluster/home/user/Projects/R_scripts/scoretool.R /cluster/user/snakemake_test/results_april30/score DMC MRT5T /cluster/projects/test/results/exp/MRT5T.tsv /cluster/projects/test/results/Exp/MRT5T.size.tsv
对于第二个样本,
shell:
mkdir -p /cluster/user/snakemake_test/results_april30/score;Rscript /cluster/home/user/Projects/R_scripts/scoretool.R /cluster/user/snakemake_test/results_april30/score DMC GNMS4 /cluster/projects/test/results/exp/GNMS4.tsv /cluster/projects/test/results/Exp/GNMS4.ize.tsv
我需要变量sample_d ['GNMS4', 'MRT5T'] 应该单独使用,而不是放在一个shell命令行中。
【问题讨论】:
标签: python function wildcard expand snakemake