Hi, thanks so much for making this program it looks really promising. I had a couple of issues I was hoping to get help with.
My specifications are:
OS is ubuntu 20.04, linux 5.4
conda version 23.9.0
mamba version 1.5.3
python version 3.10.13.final.0
I downloaded cellsnake with no issues
installed the R packages and got the 'all packages are OK' message
and ran cellsnake standard data on the practice fetal-brain dataset provided in the workflow tutorial.
This is the path to the data files (cellsnake) unam@IP:~/Downloads/data$ ls
10X_17_028 10X_17_029
When I run 'cellsnake standard data'
(cellsnake) unam@IP:~/Downloads$ cellsnake standard data
It runs and I get some of the output files but then it terminates due to this following error. Any suggestions on how I can resolve this? Thanks!
Execution halted
[Fri Nov 3 15:52:34 2023]
Error in rule normalization_pca_rds:
jobid: 9
input: analyses/raw/percent_mt10/resolution0.8/10X_17_028.rds
output: analyses/processed/percent_mt10/resolution0.8/10X_17_028.rds
shell:
/home/unam@IP/miniforge3/envs/cellsnake/lib/python3.9/site-packages/cellsnake/scrna/workflow/scripts/scrna-normalization-pca.R --rds analyses/raw/percent_mt10/resolution0.8/10X_17_028.rds --doublet.filter --normalization.method LogNormalize --cpu 1 --scale.factor 10000 --reference BlueprintEncodeData --variable.selection.method vst --nfeature 2000 --resolution 0.8 --output.rds analyses/processed/percent_mt10/resolution0.8/10X_17_028.rds --umap --tsne
(one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!)
Exiting because a job execution failed. Look above for error message
Complete log: .snakemake/log/2023-11-03T153327.979328.snakemake.log
Traceback (most recent call last):
File "/home/#####/#####/miniforge3/envs/cellsnake/bin/cellsnake", line 10, in
sys.exit(main())
File "/home//#####/#####//miniforge3/envs/cellsnake/lib/python3.9/site-packages/cellsnake/command_line.py", line 379, in main
run_workflow(cli_arguments)
File "/home//#####/#####//miniforge3/envs/cellsnake/lib/python3.9/site-packages/cellsnake/command_line.py", line 351, in run_workflow
subprocess.check_call(str(snakemake_argument),shell=True)
File "/home//#####/#####//miniforge3/envs/cellsnake/lib/python3.9/subprocess.py", line 373, in check_call
raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command 'snakemake --retries 5 --rerun-incomplete -k -j 1 -s /home//#####/#####//miniforge3/envs/cellsnake/lib/python3.9/site-packages/cellsnake/scrna/workflow/Snakefile --config datafolder=data cellsnake_path=/home//#####/#####//miniforge3/envs/cellsnake/lib/python3.9/site-packages/cellsnake/scrna/ show_labels=True marker_plots_per_cluster_n=20 min_percentage_to_plot=5 reduction=cca highly_variable_features=2000 umap_markers_plot=True test_use=wilcox resolution=0.8 singler_ref=BlueprintEncodeData celltypist_model=Immune_All_Low.pkl mapping=org.Hs.eg.db min_features=200 percent_mt=10 max_molecules=Inf min_molecules=0 normalization_method=LogNormalize scale_factor=10000 metadata_column=condition confidence=0.05 organism=hsa percent_rp=0 variable_selection_method=vst doublet_filter=True species=human microbiome_min_features=3 metadata=None microbiome_min_cells=1 max_features=Inf logfc_threshold=0.25 tsne_markers_plot=False min_cells=3 min_hit_groups=4 taxa=genus dims=30 runid=haz29875 option=standard' returned non-zero exit status 1
I then attempted to integrate the data to see if it would run but I get the following output:
(cellsnake) unam:IP:~/Downloads$ cellsnake integrate data
{'10X_17_029', '10X_17_028'}
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 1 (use --cores to define parallelism)
Rules claiming more threads will be scaled down.
Job stats:
job count
all 1
total 1
Select jobs to execute...
[Fri Nov 3 15:53:54 2023]
localrule all:
jobid: 0
reason: Rules with neither input nor output files are always executed.
resources: tmpdir=/tmp
Hi, thanks so much for making this program it looks really promising. I had a couple of issues I was hoping to get help with.
My specifications are:
OS is ubuntu 20.04, linux 5.4
conda version 23.9.0
mamba version 1.5.3
python version 3.10.13.final.0
I downloaded cellsnake with no issues
installed the R packages and got the 'all packages are OK' message
and ran cellsnake standard data on the practice fetal-brain dataset provided in the workflow tutorial.
This is the path to the data files (cellsnake) unam@IP:~/Downloads/data$ ls
10X_17_028 10X_17_029
When I run 'cellsnake standard data'
(cellsnake) unam@IP:~/Downloads$ cellsnake standard data
It runs and I get some of the output files but then it terminates due to this following error. Any suggestions on how I can resolve this? Thanks!
Execution halted
[Fri Nov 3 15:52:34 2023]
Error in rule normalization_pca_rds:
jobid: 9
input: analyses/raw/percent_mt
10/resolution0.8/10X_17_028.rdsoutput: analyses/processed/percent_mt
10/resolution0.8/10X_17_028.rdsshell:
/home/unam@IP/miniforge3/envs/cellsnake/lib/python3.9/site-packages/cellsnake/scrna/workflow/scripts/scrna-normalization-pca.R --rds analyses/raw/percent_mt
10/resolution0.8/10X_17_028.rds --doublet.filter --normalization.method LogNormalize --cpu 1 --scale.factor 10000 --reference BlueprintEncodeData --variable.selection.method vst --nfeature 2000 --resolution 0.8 --output.rds analyses/processed/percent_mt10/resolution0.8/10X_17_028.rds --umap --tsne(one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!)
Exiting because a job execution failed. Look above for error message
Complete log: .snakemake/log/2023-11-03T153327.979328.snakemake.log
Traceback (most recent call last):
File "/home/#####/#####/miniforge3/envs/cellsnake/bin/cellsnake", line 10, in
sys.exit(main())
File "/home//#####/#####//miniforge3/envs/cellsnake/lib/python3.9/site-packages/cellsnake/command_line.py", line 379, in main
run_workflow(cli_arguments)
File "/home//#####/#####//miniforge3/envs/cellsnake/lib/python3.9/site-packages/cellsnake/command_line.py", line 351, in run_workflow
subprocess.check_call(str(snakemake_argument),shell=True)
File "/home//#####/#####//miniforge3/envs/cellsnake/lib/python3.9/subprocess.py", line 373, in check_call
raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command 'snakemake --retries 5 --rerun-incomplete -k -j 1 -s /home//#####/#####//miniforge3/envs/cellsnake/lib/python3.9/site-packages/cellsnake/scrna/workflow/Snakefile --config datafolder=data cellsnake_path=/home//#####/#####//miniforge3/envs/cellsnake/lib/python3.9/site-packages/cellsnake/scrna/ show_labels=True marker_plots_per_cluster_n=20 min_percentage_to_plot=5 reduction=cca highly_variable_features=2000 umap_markers_plot=True test_use=wilcox resolution=0.8 singler_ref=BlueprintEncodeData celltypist_model=Immune_All_Low.pkl mapping=org.Hs.eg.db min_features=200 percent_mt=10 max_molecules=Inf min_molecules=0 normalization_method=LogNormalize scale_factor=10000 metadata_column=condition confidence=0.05 organism=hsa percent_rp=0 variable_selection_method=vst doublet_filter=True species=human microbiome_min_features=3 metadata=None microbiome_min_cells=1 max_features=Inf logfc_threshold=0.25 tsne_markers_plot=False min_cells=3 min_hit_groups=4 taxa=genus dims=30 runid=haz29875 option=standard' returned non-zero exit status 1
I then attempted to integrate the data to see if it would run but I get the following output:
(cellsnake) unam:IP:~/Downloads$ cellsnake integrate data
{'10X_17_029', '10X_17_028'}
Building DAG of jobs...
Using shell: /usr/bin/bash
Provided cores: 1 (use --cores to define parallelism)
Rules claiming more threads will be scaled down.
Job stats:
job count
all 1
total 1
Select jobs to execute...
[Fri Nov 3 15:53:54 2023]
localrule all:
jobid: 0
reason: Rules with neither input nor output files are always executed.
resources: tmpdir=/tmp