Mercurial > repos > fubar > egapx_runner
diff egapx_runner.xml @ 8:1680e72e27be draft default tip
planemo upload for repository https://github.com/ncbi/egapx commit bdbe05027c2c40e217a2ff0c9e0556450c443e54
author | fubar |
---|---|
date | Mon, 05 Aug 2024 03:56:41 +0000 |
parents | 9c778770514f |
children |
line wrap: on
line diff
--- a/egapx_runner.xml Sun Aug 04 13:21:59 2024 +0000 +++ b/egapx_runner.xml Mon Aug 05 03:56:41 2024 +0000 @@ -1,11 +1,14 @@ -<tool name="egapx_runner" id="egapx_runner" version="6.0.1" profile="22.05"> +<tool name="egapx_runner" id="egapx_runner" version="@TOOL_VERSION@" profile="22.05"> <description>Runs egapx</description> + <macros> + <token name="@TOOL_VERSION@">0.02-alpha</token> + </macros> <requirements> <requirement version="3.12.3" type="package">python</requirement> <requirement version="24.04.4-0" type="package">nextflow</requirement> <requirement version="6.0.1" type="package">pyyaml</requirement> </requirements> - <version_command><![CDATA[echo "6.0.1"]]></version_command> + <version_command><![CDATA[echo "@TOOL_VERSION@"]]></version_command> <command><![CDATA[mkdir -p ./egapx_config && #set econfigfile = $econfig + '.config' cp '$__tool_directory__/ui/assets/config/executor/$econfigfile' ./egapx_config/ && @@ -73,8 +76,10 @@ The simplest possible example is shown below - can be cut/paste into a history dataset in the upload tool. -*./examples/input_D_farinae_small.yaml* is included in the examples linked above. RNA-seq data is provided as URI to the reads FASTA files. -These FASTA files are a sampling of the reads from the complete SRA read files to expedite testing. +*./examples/input_D_farinae_small.yaml* is shown below and can be cut and pasted into the upload form to create a yaml file. +RNA-seq data is provided as URI to the reads FASTA files. + +input_D_farinae_small.yaml :: @@ -87,7 +92,22 @@ - https://ftp.ncbi.nlm.nih.gov/genomes/TOOLS/EGAP/data/Dermatophagoides_farinae_small/SRR9005248.2 +input_Gavia_stellata.yaml +:: + + genome: https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/030/936/135/GCF_030936135.1_bGavSte3.hap2/GCF_030936135.1_bGavSte3.hap2_genomic.fna.gz + reads: txid37040[Organism] AND biomol_transcript[properties] NOT SRS024887[Accession] + taxid: 37040 + +input_C_longicornis.yaml + +:: + + genome: https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/029//603/195/GCF_029603195.1_ASM2960319v2/GCF_029603195.1_ASM2960319v2_genomic.fna.gz + reads: txid2530218[Organism] AND biomol_transcript[properties] NOT SRS024887[Accession] + taxid: 2530218 + Purpose ======== @@ -109,7 +129,8 @@ EGAPx is the publicly accessible version of the updated NCBI [Eukaryotic Genome Annotation Pipeline](https://www.ncbi.nlm.nih.gov/genome/annotation_euk/process/). -EGAPx takes an assembly fasta file, a taxid of the organism, and RNA-seq data. Based on the taxid, EGAPx will pick protein sets and HMM models. The pipeline runs `miniprot` to align protein sequences, and `STAR` to align RNA-seq to the assembly. Protein alignments and RNA-seq read alignments are then passed to `Gnomon` for gene prediction. In the first step of `Gnomon`, the short alignments are chained together into putative gene models. In the second step, these predictions are further supplemented by _ab-initio_ predictions based on HMM models. The final annotation for the input assembly is produced as a `gff` file. +EGAPx takes an assembly fasta file, a taxid of the organism, and RNA-seq data. Based on the taxid, EGAPx will pick protein sets and HMM models. The pipeline runs `miniprot` to align protein sequences, and `STAR` to align RNA-seq to the assembly. Protein alignments and RNA-seq read alignments are then passed to `Gnomon` for gene prediction. In the first step of `Gnomon`, the short alignments are chained together into putative gene models. +In the second step, these predictions are further supplemented by *ab-initio* predictions based on HMM models. The final annotation for the input assembly is produced as a `gff` file. **Security Notice:**