Added geNomad tool for viral classification of assemblies#364
Added geNomad tool for viral classification of assemblies#364CarsonJM merged 52 commits intonf-core:devfrom
Conversation
|
|
Ill let Daniel and Sabrina decide on inclusion, but updating the workflow diagram is just done in inkscape. Feel free to ping me if you don't know how/don't have time and I can do it 👍 |
|
Great, thanks @jfy133, I can try updating the workflow diagram. Should I update the figure that includes CheckM? Also, I'm not sure why the virus classification CI test fails when running here during the |
Co-authored-by: Carson J Miller <[email protected]>
Co-authored-by: Carson J Miller <[email protected]>
Changed GENOMAD subworkflow to VIRUS_IDENTIFICATION, and added updated schematic
|
@PhilPalmer would you please replace the images in the docs/images directory with the attached files? I am thinking this might be easier than me opening another PR! If there is a way I can directly edit the files, please let me know! |
jfy133
left a comment
There was a problem hiding this comment.
Just need the image updated, then it's na OK from me
Co-authored-by: James A. Fellows Yates <[email protected]>
|
@prototaxites @d4straub if we get a ✔️ from you, then I guess we are good to go for a merge? |
| ================================================================================ | ||
| */ | ||
|
|
||
| if (params.run_genomad){ |
There was a problem hiding this comment.
One last little one - should we now call this params.run_virus_identification?
There was a problem hiding this comment.
Yep, that makes sense to me. Sorry for missing that on the last pass!
prototaxites
left a comment
There was a problem hiding this comment.
✅ from me in general! One last little comment, and I think the output docs still need updating to make sure they list the correct info.
|
Submitted another PR to @PhilPalmer ! |
|
@CarsonJM once the PR to the fork is in, and tests pass here, you're welcome to merge :) |
changed run_genomad to run_virus_identification, and updated output docs
|
🚀🚀🚀 |
Hi,
Thanks for developing a great pipeline!
I have been using the nf-core/mag pipeline for a project where I want to investigate what percentage of the assemblies/reads are viral. I noticed that the pipeline is currently more focused on bacteria (eg using the GTD-Tk for taxonomic classification which only supports bacteria/archaea). I, therefore, decided to add a new (optional) process using a recently developed tool geNomad (see paper) which performs viral classification i.e. predicts which assemblies are viral and assigns them to taxa.
Would you be interested in including this tool within the pipeline?
Hopefully, the PR should be most of the way there. I have followed the nf-core contributing guidelines, tested the changes myself on a full-scale dataset and added CI tests. Some things that I have not done are:
PR checklist
- [ ] If necessary, also make a PR on the nf-core/mag branch on the nf-core/test-datasets repository.nf-core lint).nextflow run . -profile test,docker --outdir <OUTDIR>).docs/usage.mdis updated.docs/output.mdis updated.CHANGELOG.mdis updated.README.mdis updated (including new tool citations and authors/contributors).