view test-data/predict_augustus/Genus_species.discrepency.report.txt @ 0:b5ec3983deda draft

"planemo upload commit 9613152729099079c7465c3d5d42005ef22ca91e"
author iuc
date Thu, 26 Aug 2021 06:57:03 +0000
parents
children
line wrap: on
line source

Discrepancy Report Results

Summary
DISC_PROTEIN_NAMES:All proteins have same name "hypothetical protein"
DISC_SOURCE_QUALS_ASNDISC:taxname (all present, all same)
DISC_FEATURE_COUNT:gene: 18 present
DISC_FEATURE_COUNT:CDS: 18 present
DISC_FEATURE_COUNT:mRNA: 18 present
DISC_COUNT_NUCLEOTIDES:4 nucleotide Bioseqs are present
JOINED_FEATURES:32 features have joined locations.
NO_ANNOTATION:2 bioseqs have no features
DISC_QUALITY_SCORES:Quality scores are missing on all sequences.
ONCALLER_COMMENT_PRESENT:4 comment descriptors were found (all same)
MISSING_GENOMEASSEMBLY_COMMENTS:4 bioseqs are missing GenomeAssembly structured comments
MOLTYPE_NOT_MRNA:4 molecule types are not set as mRNA.
TECHNIQUE_NOT_TSA:4 technique are not set as TSA
MISSING_STRUCTURED_COMMENT:4 sequences do not include structured comments.
MISSING_PROJECT:22 sequences do not include project.
DISC_INCONSISTENT_MOLINFO_TECH:Molinfo Technique Report (some missing, all same)


Detailed Report

DiscRep_ALL:DISC_PROTEIN_NAMES::All proteins have same name "hypothetical protein"

DiscRep_ALL:DISC_SOURCE_QUALS_ASNDISC::taxname (all present, all same)
DiscRep_SUB:DISC_SOURCE_QUALS_ASNDISC::4 sources have 'Genus species' for taxname
DiscRep_ALL:DISC_FEATURE_COUNT::gene: 18 present
DiscRep_ALL:DISC_FEATURE_COUNT::CDS: 18 present
DiscRep_ALL:DISC_FEATURE_COUNT::mRNA: 18 present
DiscRep_ALL:DISC_COUNT_NUCLEOTIDES::4 nucleotide Bioseqs are present
genome:sample (length 215740)
genome:sample2 (length 2030)
genome:sample3 (length 2100)
genome:sample4 (length 7560)

DiscRep_ALL:JOINED_FEATURES::32 features have joined locations.
DiscRep_SUB:JOINED_FEATURES::32 features have joined location but no exception
genome:CDS	hypothetical protein	(sample:2126-2199, 2258-3224, 3284-3490, 3549-3863)	FUN_000002
genome:mRNA	hypothetical protein	(sample4:2126-2199, 2258-3224, 3284-3490, 3549-3863)	FUN_000017
genome:CDS	hypothetical protein	(sample4:2126-2199, 2258-3224, 3284-3490, 3549-3863)	FUN_000017
genome:mRNA	hypothetical protein	(sample:2126-2199, 2258-3224, 3284-3490, 3549-3863)	FUN_000002
genome:CDS	hypothetical protein	(sample4:c5494-4930, c4759-4248)	FUN_000018
genome:mRNA	hypothetical protein	(sample4:c5494-4930, c4759-4248)	FUN_000018
genome:mRNA	hypothetical protein	(sample:c5802-5797, c5539-4883)	FUN_000003
genome:CDS	hypothetical protein	(sample:c5802-5797, c5539-4883)	FUN_000003
genome:CDS	hypothetical protein	(sample:c10557-10549, c10462-8696)	FUN_000004
genome:mRNA	hypothetical protein	(sample:c10557-10549, c10462-8696)	FUN_000004
genome:mRNA	hypothetical protein	(sample:c15214-15209, c14648-14247)	FUN_000005
genome:CDS	hypothetical protein	(sample:c15214-15209, c14648-14247)	FUN_000005
genome:CDS	hypothetical protein	(sample:c21705-21700, c21515-19533)	FUN_000006
genome:mRNA	hypothetical protein	(sample:c21705-21700, c21515-19533)	FUN_000006
genome:CDS	hypothetical protein	(sample:c35679-35675, c35655-35648, c35594-34843)	FUN_000007
genome:mRNA	hypothetical protein	(sample:c35679-35675, c35655-35648, c35594-34843)	FUN_000007
genome:CDS	hypothetical protein	(sample:40223-40396, 40659-41234)	FUN_000008
genome:mRNA	hypothetical protein	(sample:40223-40396, 40659-41234)	FUN_000008
genome:mRNA	hypothetical protein	(sample:41267-41274, 41437-41444, 41707-42107)	FUN_000009
genome:CDS	hypothetical protein	(sample:41267-41274, 41437-41444, 41707-42107)	FUN_000009
genome:CDS	hypothetical protein	(sample:87202-87207, 88054-88320)	FUN_000010
genome:mRNA	hypothetical protein	(sample:87202-87207, 88054-88320)	FUN_000010
genome:CDS	hypothetical protein	(sample:94727-94732, 94873-95016, 95449-95583)	FUN_000011
genome:mRNA	hypothetical protein	(sample:94727-94732, 94873-95016, 95449-95583)	FUN_000011
genome:CDS	hypothetical protein	(sample:133134-133142, 133209-134539, 134668-135510, 135569-136346)	FUN_000012
genome:mRNA	hypothetical protein	(sample:133134-133142, 133209-134539, 134668-135510, 135569-136346)	FUN_000012
genome:CDS	hypothetical protein	(sample:144294-144551, 149012-149244, 149367-149588, 149654-149897, 149952-150112, 150174-150248, 151966-152072, 152314-152429, 152496-152751, 153651-159010, 159150-164491, 167135-168360, 168722-169208, 169350-169416)	FUN_000013
genome:mRNA	hypothetical protein	(sample:144294-144551, 149012-149244, 149367-149588, 149654-149897, 149952-150112, 150174-150248, 151966-152072, 152314-152429, 152496-152751, 153651-159010, 159150-164491, 167135-168360, 168722-169208, 169350-169416)	FUN_000013
genome:CDS	hypothetical protein	(sample:192049-192067, 193549-193658, 194041-194455, 194518-194669)	FUN_000014
genome:mRNA	hypothetical protein	(sample:192049-192067, 193549-193658, 194041-194455, 194518-194669)	FUN_000014
genome:CDS	hypothetical protein	(sample:c210553-210548, c210474-209044)	FUN_000015
genome:mRNA	hypothetical protein	(sample:c210553-210548, c210474-209044)	FUN_000015

DiscRep_ALL:NO_ANNOTATION::2 bioseqs have no features
genome:sample2 (length 2030)
genome:sample3 (length 2100)

DiscRep_ALL:DISC_QUALITY_SCORES::Quality scores are missing on all sequences.

DiscRep_ALL:ONCALLER_COMMENT_PRESENT::4 comment descriptors were found (all same)
genome:sample:"Annotated using 1.8.7"
genome:sample2:"Annotated using 1.8.7"
genome:sample3:"Annotated using 1.8.7"
genome:sample4:"Annotated using 1.8.7"

DiscRep_ALL:MISSING_GENOMEASSEMBLY_COMMENTS::4 bioseqs are missing GenomeAssembly structured comments
genome:sample (length 215740)
genome:sample2 (length 2030)
genome:sample3 (length 2100)
genome:sample4 (length 7560)

DiscRep_ALL:MOLTYPE_NOT_MRNA::4 molecule types are not set as mRNA.
genome:sample (length 215740)
genome:sample2 (length 2030)
genome:sample3 (length 2100)
genome:sample4 (length 7560)

DiscRep_ALL:TECHNIQUE_NOT_TSA::4 technique are not set as TSA
genome:sample (length 215740)
genome:sample2 (length 2030)
genome:sample3 (length 2100)
genome:sample4 (length 7560)

DiscRep_ALL:MISSING_STRUCTURED_COMMENT::4 sequences do not include structured comments.
genome:sample (length 215740)
genome:sample2 (length 2030)
genome:sample3 (length 2100)
genome:sample4 (length 7560)

DiscRep_ALL:MISSING_PROJECT::22 sequences do not include project.
genome:sample (length 215740)
genome:ncbi:FUN_000001-T1 (length 124)
genome:ncbi:FUN_000002-T1 (length 520)
genome:ncbi:FUN_000003-T1 (length 220)
genome:ncbi:FUN_000004-T1 (length 591)
genome:ncbi:FUN_000005-T1 (length 135)
genome:ncbi:FUN_000006-T1 (length 662)
genome:ncbi:FUN_000007-T1 (length 254)
genome:ncbi:FUN_000008-T1 (length 249)
genome:ncbi:FUN_000009-T1 (length 138)
genome:ncbi:FUN_000010-T1 (length 90)
genome:ncbi:FUN_000011-T1 (length 94)
genome:ncbi:FUN_000012-T1 (length 986)
genome:ncbi:FUN_000013-T1 (length 4717)
genome:ncbi:FUN_000014-T1 (length 231)
genome:ncbi:FUN_000015-T1 (length 478)
genome:sample2 (length 2030)
genome:sample3 (length 2100)
genome:sample4 (length 7560)
genome:ncbi:FUN_000016-T1 (length 124)
genome:ncbi:FUN_000017-T1 (length 520)
genome:ncbi:FUN_000018-T1 (length 358)

DiscRep_ALL:DISC_INCONSISTENT_MOLINFO_TECH::Molinfo Technique Report (some missing, all same)
DiscRep_SUB:DISC_INCONSISTENT_MOLINFO_TECH::technique (all missing)
DiscRep_SUB:DISC_INCONSISTENT_MOLINFO_TECH::4 Molinfos are missing field technique
genome:sample (length 215740)
genome:sample2 (length 2030)
genome:sample3 (length 2100)
genome:sample4 (length 7560)