# HG changeset patch # User petr-novak # Date 1646747931 0 # Node ID 4ea506b39297ffe6b5f92201a73b749b5f93ec66 # Parent 7f1032da7a0aed9c509cdcd97bc2fdcb168655c7 "planemo upload" diff -r 7f1032da7a0a -r 4ea506b39297 README.org --- a/README.org Mon Feb 21 10:35:13 2022 +0000 +++ b/README.org Tue Mar 08 13:58:51 2022 +0000 @@ -29,7 +29,11 @@ #+begin_comment -create tarball for toolshed: +# create tarball for toolshed: tar -czvf ../repeat_annotation_pipeline.tar.gz --exclude test_data \ --exclude .git --exclude tmp --exclude hg_repository --exclude .idea --exclude .gitignore . #+end_comment + + + +- TODO RM take only short name of sequences - validate name / adjust \ No newline at end of file diff -r 7f1032da7a0a -r 4ea506b39297 repeat_annotate_custom.xml --- a/repeat_annotate_custom.xml Mon Feb 21 10:35:13 2022 +0000 +++ b/repeat_annotate_custom.xml Tue Mar 08 13:58:51 2022 +0000 @@ -1,10 +1,15 @@ - + repeatmasker bioconductor-rtracklayer &2 && cp `basename $input`.out $output2 @@ -17,16 +22,23 @@ + + + + + sequence_id#classification_level1/classification_level2/...** + This tools uses RepeatMasker to annotate repetitive sequences in the genome assemblies using custom library of repeats created either from RepeatExplorer output or from DANTE_LTR transposable element library. + Library of repeats can be created from RepeatExplorer output from contigs and TAREAN consensus sequences. + Fasta formatted library of repeats must contain header containing information about classification of repeats as **>sequence_id#classification_level1/classification_level2/...** - Classification in RepeatExplorer based library follows predetermined classification levels. User can however specify additional classification levels or completelly custom classifications. Conflicts in annotations are resolved based on classification hierarchy. + Classification in RepeatExplorer based library follows predetermined classification levels. User can however specify additional classification levels or custom classifications. + Conflicts in annotations are resolved based on classification hierarchy. ]]>