Skip to content

Added geNomad tool for viral classification of assemblies#364

Merged
CarsonJM merged 52 commits intonf-core:devfrom
naobservatory:genomad
Jul 21, 2023
Merged

Added geNomad tool for viral classification of assemblies#364
CarsonJM merged 52 commits intonf-core:devfrom
naobservatory:genomad

Conversation

@PhilPalmer
Copy link
Copy Markdown
Contributor

@PhilPalmer PhilPalmer commented Dec 5, 2022

Hi,

Thanks for developing a great pipeline!

I have been using the nf-core/mag pipeline for a project where I want to investigate what percentage of the assemblies/reads are viral. I noticed that the pipeline is currently more focused on bacteria (eg using the GTD-Tk for taxonomic classification which only supports bacteria/archaea). I, therefore, decided to add a new (optional) process using a recently developed tool geNomad (see paper) which performs viral classification i.e. predicts which assemblies are viral and assigns them to taxa.

Would you be interested in including this tool within the pipeline?

Hopefully, the PR should be most of the way there. I have followed the nf-core contributing guidelines, tested the changes myself on a full-scale dataset and added CI tests. Some things that I have not done are:

  • Used a BioContainers Docker/Singularity container (because the tool is not yet available in BioContainers)
  • Updated the workflow schema/image (because I am not sure how you usually do this)

PR checklist

  • This comment contains a description of changes (with reason).
  • If you've fixed a bug or added code that should be tested, add tests!
    • If you've added a new tool - have you followed the pipeline conventions in the contribution docs
      - [ ] If necessary, also make a PR on the nf-core/mag branch on the nf-core/test-datasets repository.
  • Make sure your code lints (nf-core lint).
  • Ensure the test suite passes (nextflow run . -profile test,docker --outdir <OUTDIR>).
  • Usage Documentation in docs/usage.md is updated.
  • Output Documentation in docs/output.md is updated.
  • CHANGELOG.md is updated.
  • README.md is updated (including new tool citations and authors/contributors).

@github-actions
Copy link
Copy Markdown

github-actions Bot commented Dec 5, 2022

nf-core lint overall result: Passed ✅ ⚠️

Posted for pipeline commit d57ae9a

+| ✅ 156 tests passed       |+
#| ❔   1 tests were ignored |#
!| ❗   1 tests had warnings |!
Details

❗ Test warnings:

  • pipeline_todos - TODO string in methods_description_template.yml: #Update the HTML below to your preferred methods description, e.g. add publication citation for this pipeline

❔ Tests ignored:

  • files_unchanged - File ignored due to lint config: lib/NfcoreTemplate.groovy

✅ Tests passed:

Run details

  • nf-core/tools version 2.9
  • Run at 2023-07-20 15:21:09

@jfy133
Copy link
Copy Markdown
Member

jfy133 commented Dec 5, 2022

Ill let Daniel and Sabrina decide on inclusion, but updating the workflow diagram is just done in inkscape. Feel free to ping me if you don't know how/don't have time and I can do it 👍

@PhilPalmer
Copy link
Copy Markdown
Contributor Author

Great, thanks @jfy133,

I can try updating the workflow diagram. Should I update the figure that includes CheckM?

Also, I'm not sure why the virus classification CI test fails when running here during the mmseqs subprocess. The same command works locally so maybe it's to do with computational resources? I will look into this also

Comment thread docs/images/mag_workflow.png
Comment thread workflows/mag.nf Outdated
Comment thread workflows/mag.nf Outdated
@CarsonJM
Copy link
Copy Markdown
Contributor

CarsonJM commented Jul 10, 2023

@PhilPalmer would you please replace the images in the docs/images directory with the attached files? I am thinking this might be easier than me opening another PR! If there is a way I can directly edit the files, please let me know!

mag_workflow
mag_workflow

Copy link
Copy Markdown
Member

@jfy133 jfy133 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just need the image updated, then it's na OK from me

Comment thread README.md Outdated
@jfy133
Copy link
Copy Markdown
Member

jfy133 commented Jul 14, 2023

@prototaxites @d4straub if we get a ✔️ from you, then I guess we are good to go for a merge?

Comment thread workflows/mag.nf Outdated
================================================================================
*/

if (params.run_genomad){
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One last little one - should we now call this params.run_virus_identification?

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep, that makes sense to me. Sorry for missing that on the last pass!

Copy link
Copy Markdown
Contributor

@prototaxites prototaxites left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ from me in general! One last little comment, and I think the output docs still need updating to make sure they list the correct info.

Copy link
Copy Markdown
Collaborator

@d4straub d4straub left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@CarsonJM
Copy link
Copy Markdown
Contributor

Submitted another PR to @PhilPalmer !

@jfy133
Copy link
Copy Markdown
Member

jfy133 commented Jul 20, 2023

@CarsonJM once the PR to the fork is in, and tests pass here, you're welcome to merge :)

changed run_genomad to run_virus_identification, and updated output docs
@CarsonJM
Copy link
Copy Markdown
Contributor

🚀🚀🚀
Thanks for all the help with this PR!

@CarsonJM CarsonJM merged commit a8674de into nf-core:dev Jul 21, 2023
@CarsonJM CarsonJM mentioned this pull request Sep 7, 2023
9 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants