Repository 'fml_gff3togtf'
hg clone https://toolshed.g2.bx.psu.edu/repos/vipints/fml_gff3togtf

Changeset 2:db3c67b03d55 (2014-06-10)
Previous changeset 1:c6cf723c2ed4 (2014-06-10) Next changeset 3:ff2c2e6f4ab3 (2014-06-11)
Commit message:
Deleted selected files
removed:
GFFtools-GX
fml_gff_converter_programs/README
fml_gff_converter_programs/galaxy/bed_to_gff3.xml
fml_gff_converter_programs/galaxy/gff3_to_bed.xml
fml_gff_converter_programs/galaxy/gff3_to_gtf.xml
fml_gff_converter_programs/galaxy/gtf_to_gff3.xml
fml_gff_converter_programs/scripts/bed_to_gff3_converter.py
fml_gff_converter_programs/scripts/gff3_to_bed_converter.py
fml_gff_converter_programs/scripts/gff3_to_gtf_converter.pl
fml_gff_converter_programs/scripts/gtf_to_gff3_converter.py
fml_gff_converter_programs/t/AceView_gff3_to_gtf.gtf
fml_gff_converter_programs/t/AceView_ncbi_37.gff3
fml_gff_converter_programs/t/AceView_ncbi_37.gtf
fml_gff_converter_programs/t/Aly_JGI.bed
fml_gff_converter_programs/t/Aly_JGI.gff3
fml_gff_converter_programs/t/ENSEMBL_mm9.gff3
fml_gff_converter_programs/t/ENSEMBL_mm9.gtf
fml_gff_converter_programs/t/ENSEMBL_mm9_gff3_to_gtf.gtf
fml_gff_converter_programs/t/JGI_genes.gff3
fml_gff_converter_programs/t/JGI_genes.gtf
fml_gff_converter_programs/t/MB7_3R.bed
fml_gff_converter_programs/t/MB7_3R.gff3
fml_gff_converter_programs/t/UCSC_transcripts.gff3
fml_gff_converter_programs/t/UCSC_transcripts.gtf
fml_gff_converter_programs/t/ccds_genes.bed
fml_gff_converter_programs/t/ccds_genes.gff3
fml_gff_converter_programs/t/hs_2009.bed
fml_gff_converter_programs/t/hs_2009.gff3
b
diff -r c6cf723c2ed4 -r db3c67b03d55 GFFtools-GX
--- a/GFFtools-GX Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
[
b'@@ -1,906 +0,0 @@\n-\n-\n-\n-\n-\n-<!DOCTYPE html>\n-<html class="   ">\n-  <head prefix="og: http://ogp.me/ns# fb: http://ogp.me/ns/fb# object: http://ogp.me/ns/object# article: http://ogp.me/ns/article# profile: http://ogp.me/ns/profile#">\n-    <meta charset=\'utf-8\'>\n-    <meta http-equiv="X-UA-Compatible" content="IE=edge">\n-    \n-    \n-    <title>vipints/GFFtools-GX \xc2\xb7 GitHub</title>\n-    <link rel="search" type="application/opensearchdescription+xml" href="/opensearch.xml" title="GitHub" />\n-    <link rel="fluid-icon" href="https://github.com/fluidicon.png" title="GitHub" />\n-    <link rel="apple-touch-icon" sizes="57x57" href="/apple-touch-icon-114.png" />\n-    <link rel="apple-touch-icon" sizes="114x114" href="/apple-touch-icon-114.png" />\n-    <link rel="apple-touch-icon" sizes="72x72" href="/apple-touch-icon-144.png" />\n-    <link rel="apple-touch-icon" sizes="144x144" href="/apple-touch-icon-144.png" />\n-    <meta property="fb:app_id" content="1401488693436528"/>\n-\n-      <meta content="@github" name="twitter:site" /><meta content="summary" name="twitter:card" /><meta content="vipints/GFFtools-GX" name="twitter:title" /><meta content="GFFtools-GX - gfftools - Galaxy toolshed repository" name="twitter:description" /><meta content="https://avatars3.githubusercontent.com/u/1714549?s=400" name="twitter:image:src" />\n-<meta content="GitHub" property="og:site_name" /><meta content="object" property="og:type" /><meta content="https://avatars3.githubusercontent.com/u/1714549?s=400" property="og:image" /><meta content="vipints/GFFtools-GX" property="og:title" /><meta content="https://github.com/vipints/GFFtools-GX" property="og:url" /><meta content="GFFtools-GX - gfftools - Galaxy toolshed repository" property="og:description" />\n-\n-    <link rel="assets" href="https://assets-cdn.github.com/">\n-    <link rel="conduit-xhr" href="https://ghconduit.com:25035">\n-    \n-\n-    <meta name="msapplication-TileImage" content="/windows-tile.png" />\n-    <meta name="msapplication-TileColor" content="#ffffff" />\n-    <meta name="selected-link" value="repo_source" data-pjax-transient />\n-      <meta name="google-analytics" content="UA-3769691-2">\n-\n-    <meta content="collector-new.githubapp.com" name="octolytics-host" /><meta content="collector-cdn.github.com" name="octolytics-script-host" /><meta content="github" name="octolytics-app-id" /><meta content="8076FA06:4760:623B615:53979B84" name="octolytics-dimension-request_id" />\n-    \n-\n-    \n-    \n-    <link rel="icon" type="image/x-icon" href="https://assets-cdn.github.com/favicon.ico" />\n-\n-\n-    <meta content="authenticity_token" name="csrf-param" />\n-<meta content="8RFayg0epQkPCssjmCYy7wh4dPucfoF71qZu96AGi6wPKKHwIkpQgMBnz5kb6Q/LmWGaXKr5cfsR3UrgWGhQjA==" name="csrf-token" />\n-\n-    <link href="https://assets-cdn.github.com/assets/github-2a5fe476db9a69ca883f024853575d588b2bd187.css" media="all" rel="stylesheet" type="text/css" />\n-    <link href="https://assets-cdn.github.com/assets/github2-a1fd2161b2d8ea3aef1c3d15c79772ff80eb5523.css" media="all" rel="stylesheet" type="text/css" />\n-    \n-\n-\n-    <meta http-equiv="x-pjax-version" content="cc9167eb91e25e1e59deba1003934246">\n-\n-      \n-  <meta name="description" content="GFFtools-GX - gfftools - Galaxy toolshed repository" />\n-\n-  <meta content="1714549" name="octolytics-dimension-user_id" /><meta content="vipints" name="octolytics-dimension-user_login" /><meta content="19711908" name="octolytics-dimension-repository_id" /><meta content="vipints/GFFtools-GX" name="octolytics-dimension-repository_nwo" /><meta content="true" name="octolytics-dimension-repository_public" /><meta content="false" name="octolytics-dimension-repository_is_fork" /><meta content="19711908" name="octolytics-dimension-repository_network_root_id" /><meta content="vipints/GFFtools-GX" name="octolytics-dimension-repository_network_root_nwo" />\n-  <link href="https://github.com/vipints/GFFtools-GX/commits/master.atom" rel="alternate" title="Recent Commits to GFFtools-GX:master'..b'rom GTF to valid GFF3.\n-    gff_to_gtf.py: convert data from GFF3 to GTF.\n-    bed_to_gff.py: convert data from a 12 column UCSC wiggle BED format to GFF3.\n-    gff_to_bed.py: convert gene transcript annotation from GFF3 to UCSC wiggle 12 column BED format.\n-    gbk_to_gff.py: convert data from genbank format to GFF. \n-    GFFParser.py: Parse GFF/GTF files.  \n-    helper.py: Utility functions.\n-\n-test-data: Test data set. (move to your galaxy_root_folder/test-data/)\n-    \n-    You may need to move the test files into your test-data directory so galaxy can find them. \n-    If you want to run the functional tests eg as: \n-\n-    exmaple: \n-    sh run_functional_tests.sh -id fml_gtf2gff\n-\n-REQUIREMENTS\n-\n-    python \n-\n-COMMENTS/QUESTIONS \n-\n-I can be reached at vipin [at] cbio.mskcc.org \n-\n-LICENSE\n-\n-Copyright (C) 2009-2012 Friedrich Miescher Laboratory of the Max Planck Society\n-              2013-2014 Memorial Sloan Kettering Cancer Center\n-\n-This program is free software; you can redistribute it and/or modify\n-it under the terms of the GNU General Public License as published by\n-the Free Software Foundation; either version 3 of the License, or\n-(at your option) any later version.\n-\n-COURTESY\n-\n-To the Galaxy Team.\n-</pre></div>\n-  </div>\n-\n-\n-        </div>\n-\n-      </div><!-- /.repo-container -->\n-      <div class="modal-backdrop"></div>\n-    </div><!-- /.container -->\n-  </div><!-- /.site -->\n-\n-\n-    </div><!-- /.wrapper -->\n-\n-      <div class="container">\n-  <div class="site-footer">\n-    <ul class="site-footer-links right">\n-      <li><a href="https://status.github.com/">Status</a></li>\n-      <li><a href="http://developer.github.com">API</a></li>\n-      <li><a href="http://training.github.com">Training</a></li>\n-      <li><a href="http://shop.github.com">Shop</a></li>\n-      <li><a href="/blog">Blog</a></li>\n-      <li><a href="/about">About</a></li>\n-\n-    </ul>\n-\n-    <a href="/">\n-      <span class="mega-octicon octicon-mark-github" title="GitHub"></span>\n-    </a>\n-\n-    <ul class="site-footer-links">\n-      <li>&copy; 2014 <span title="0.04832s from github-fe123-cp1-prd.iad.github.net">GitHub</span>, Inc.</li>\n-        <li><a href="/site/terms">Terms</a></li>\n-        <li><a href="/site/privacy">Privacy</a></li>\n-        <li><a href="/security">Security</a></li>\n-        <li><a href="/contact">Contact</a></li>\n-    </ul>\n-  </div><!-- /.site-footer -->\n-</div><!-- /.container -->\n-\n-\n-    <div class="fullscreen-overlay js-fullscreen-overlay" id="fullscreen_overlay">\n-  <div class="fullscreen-container js-fullscreen-container">\n-    <div class="textarea-wrap">\n-      <textarea name="fullscreen-contents" id="fullscreen-contents" class="fullscreen-contents js-fullscreen-contents" placeholder="" data-suggester="fullscreen_suggester"></textarea>\n-    </div>\n-  </div>\n-  <div class="fullscreen-sidebar">\n-    <a href="#" class="exit-fullscreen js-exit-fullscreen tooltipped tooltipped-w" aria-label="Exit Zen Mode">\n-      <span class="mega-octicon octicon-screen-normal"></span>\n-    </a>\n-    <a href="#" class="theme-switcher js-theme-switcher tooltipped tooltipped-w"\n-      aria-label="Switch themes">\n-      <span class="octicon octicon-color-mode"></span>\n-    </a>\n-  </div>\n-</div>\n-\n-\n-\n-    <div id="ajax-error-message" class="flash flash-error">\n-      <span class="octicon octicon-alert"></span>\n-      <a href="#" class="octicon octicon-x close js-ajax-error-dismiss"></a>\n-      Something went wrong with that request. Please try again.\n-    </div>\n-\n-\n-      <script crossorigin="anonymous" src="https://assets-cdn.github.com/assets/frameworks-c044173f1e759e8299dbe414ec8ed23e4405bdc5.js" type="text/javascript"></script>\n-      <script async="async" crossorigin="anonymous" src="https://assets-cdn.github.com/assets/github-abafbb1b61823e40063e9f65128096bcfef3b025.js" type="text/javascript"></script>\n-      \n-      \n-        <script async src="https://www.google-analytics.com/analytics.js"></script>\n-  </body>\n-</html>\n-\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/README
--- a/fml_gff_converter_programs/README Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
@@ -1,60 +0,0 @@
-A collection of tools for converting genome annotation between GTF (Gene Transfer Format), BED (Browser Extensible Data) and GFF (Generic Feature Format)
-
-INTRODUCTION
-
-Several genome annotation centers provide their data in GTF, BED, GFF3 etc. I have few programs 
-they mainly deals with converting between GTF, BED and GFF3 formats. They are extensively tested 
-with files from different centers like ENSEMBL, UCSC, JGI and NCBI AceView. Please follow the 
-instructions below to clone these tools into your galaxy instance.
-
-CONTENTS
-
-galaxy: Contains tool configuration files in *.xml format. 
-
-    Tool configuration files. Please add right path to locate the scripts.
-
-    gtf_to_gff3.xml
-
-    gff3_to_gtf.xml
-
-    bed_to_gff3.xml
-
-    gff3_to_bed.xml
-
-t: Test data set. (move to your galaxy root folder/test-data/)
-    
-    You may need to move the test files into your test-data directory so galaxy can find them. 
-    If you want to run the functional tests eg as: 
-
-    sh run_functional_tests.sh -id fml_gtf2gff3
-
-scripts: Python based and Perl based scripts. 
-
-    gtf_to_gff3_converter.py: This tool converts data from GTF format to valid GFF3 format.
-    
-    gff3_to_gtf_converter.pl: This tool converts data from GFF3 format to GTF format.
-
-    bed_to_gff3_converter.py: This tool converts data from a 12 column UCSC wiggle BED format to GFF3 format.
-
-    gff3_to_bed_converter.py: This tool converts gene transcript annotation from GFF3 format to UCSC wiggle 12 column BED format.
-
-REQUIREMENTS
-
-    Python, Perl and if you are interested to use gff3_to_gtf_converter.pl please install Bio::Perl module.
-
-COMMENTS/QUESTIONS 
-
-I can be reached at vipin.ts@tuebingen.mpg.de
-
-LICENSE
-
-Copyright (C) 2010 Friedrich Miescher Laboratory of the Max Planck Society
-
-This program is free software; you can redistribute it and/or modify
-it under the terms of the GNU General Public License as published by
-the Free Software Foundation; either version 3 of the License, or
-(at your option) any later version.
-
-COURTESY
-
-To the Galaxy Team.
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/galaxy/bed_to_gff3.xml
--- a/fml_gff_converter_programs/galaxy/bed_to_gff3.xml Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
@@ -1,84 +0,0 @@
-<tool id="fml_bed2gff" name="BED_to_GFF3" version="1.0.0">
- <description>converter</description>
- <command interpreter="python">bed_to_gff3_converter.py $inf_bed > $gff_format 
- </command> 
- <inputs>
-   <param format="bed" name="inf_bed" type="data" label="Convert this query" help="Please look at About format section."/>
-    </inputs>
-   <outputs>
-   <data format="gff3" name="gff_format" /> 
-   </outputs>
- <tests>
-        <test>
-                <param name="inf_bed" value="ccds_genes.bed" />
-                <output name="gff_format" file="ccds_genes.gff3" />
-        </test>
-        <test>
-                <param name="inf_bed" value="hs_2009.bed" />
-                <output name="gff_format" file="hs_2009.gff3" />
-        </test>
-        </tests>
-   <help>
-
-**What it does**
-
-This tool converts data from a 12 column UCSC wiggle BED format to GFF3 format.
-
---------
-
-**Example**
-
-- The following data in UCSC Wiggle BED format::
-
- chr1    11873   14409   uc001aaa.3      0       +       11873   11873   0       3       354,109,1189,   0,739,1347,
-
-- Will be converted to GFF3 format::
-
- ##gff-version 3
- chr1    bed2gff gene    11874   14409   0       +       .       ID=Gene:uc001aaa.3;Name=Gene:uc001aaa.3
- chr1    bed2gff transcript      11874   14409   0       +       .       ID=uc001aaa.3;Name=uc001aaa.3;Parent=Gene:uc001aaa.3
- chr1    bed2gff exon    11874   12227   0       +       .       Parent=uc001aaa.3
- chr1    bed2gff exon    12613   12721   0       +       .       Parent=uc001aaa.3
- chr1    bed2gff exon    13221   14409   0       +       .       Parent=uc001aaa.3
-
---------
-
-**About formats**
-
-**BED format** Browser Extensible Data format was designed at UCSC for displaying data tracks in the Genome Browser. It has three required fields and several additional optional ones:
-
-The first three BED fields (required) are::
-
-    1. chrom - The name of the chromosome (e.g. chr1, chrY_random).
-    2. chromStart - The starting position in the chromosome. (The first base in a chromosome is numbered 0.)
-    3. chromEnd - The ending position in the chromosome, plus 1 (i.e., a half-open interval).
-
-The additional BED fields (optional) are::
-
-    4. name - The name of the BED line.
-    5. score - A score between 0 and 1000.
-    6. strand - Defines the strand - either '+' or '-'.
-    7. thickStart - The starting position where the feature is drawn thickly at the Genome Browser.
-    8. thickEnd - The ending position where the feature is drawn thickly at the Genome Browser.
-    9. reserved - This should always be set to zero.
-   10. blockCount - The number of blocks (exons) in the BED line.
-   11. blockSizes - A comma-separated list of the block sizes. The number of items in this list should correspond to blockCount.
-   12. blockStarts - A comma-separated list of block starts. All of the blockStart positions should be calculated relative to chromStart. The number of items in this list should correspond to blockCount.
-
-**GFF3 format** General Feature Format is a format for describing genes and other features associated with DNA, RNA and Protein sequences. GFF3 lines have nine tab-separated fields:
-
-    1. seqid - Must be a chromosome or scaffold or contig.
-    2. source - The program that generated this feature.
-    3. type - The name of this type of feature. Some examples of standard feature types are "gene", "CDS", "protein", "mRNA", and "exon". 
-    4. start - The starting position of the feature in the sequence. The first base is numbered 1.
-    5. stop - The ending position of the feature (inclusive).
-    6. score - A score between 0 and 1000. If there is no score value, enter ".".
-    7. strand - Valid entries include '+', '-', or '.' (for don't know/care).
-    8. phase - If the feature is a coding exon, frame should be a number between 0-2 that represents the reading frame of the first base. If the feature is not a coding exon, the value should be '.'.
-    9. attributes - All lines with the same group are linked together into a single item.
-
---------
-
-This tool is a part of the **MLB Group at Friedrich Miescher Laboratory of the Max Planck Society**. Copyright (C) 2010 Vipin T. Sreedharan (vipin.ts@tuebingen.mpg.de)
- </help>
-</tool>
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/galaxy/gff3_to_bed.xml
--- a/fml_gff_converter_programs/galaxy/gff3_to_bed.xml Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
@@ -1,84 +0,0 @@
-<tool id="fml_gff2bed" name="GFF3_to_BED" version="1.0.0">
- <description>converter</description> 
- <command interpreter="python">gff3_to_bed_converter.py $inf_gff > $bed_format 
- </command> 
- <inputs>
-   <param format="gff3" name="inf_gff" type="data" label="Genome annotation file in GFF3 format" help="Please look at About format section."/>
-    </inputs>
-   <outputs>
-   <data format="bed" name="bed_format"/> 
-   </outputs>
- <tests>
-        <test>
-                <param name="inf_gff" value="Aly_JGI.gff3" />
-                <output name="bed_format" file="Aly_JGI.bed" />
-        </test>
-        <test>
-                <param name="inf_gff" value="MB7_3R.gff3" />
-                <output name="bed_format" file="MB7_3R.bed" />
-        </test>
-        </tests>
-   <help>
-
-**What it does**
-
-This tool converts gene transcript annotation from GFF3 format to UCSC wiggle 12 column BED format.
-
---------
-
-**Example**
-
-- The following data in GFF3 format::
-
- ##gff-version 3
- chr1    protein_coding  gene    11874   14409   0       +       .       ID=Gene:uc001aaa.3;Name=Gene:uc001aaa.3
- chr1    protein_coding  transcript      11874   14409   0       +       .       ID=uc001aaa.3;Name=uc001aaa.3;Parent=Gene:uc001aaa.3
- chr1    protein_coding  exon    11874   12227   0       +       .       Parent=uc001aaa.3
- chr1    protein_coding  exon    12613   12721   0       +       .       Parent=uc001aaa.3
- chr1    protein_coding  exon    13221   14409   0       +       .       Parent=uc001aaa.3
-
-- Will be converted to UCSC Wiggle BED format::
-
- chr1    11874   14409   uc001aaa.3      0       +       11874   14409   0       3       354,109,1189,   0,739,1347,
-
---------
-
-**About formats**
-
-**GFF3 format** General Feature Format is a format for describing genes and other features associated with DNA, RNA and Protein sequences. GFF3 lines have nine tab-separated fields:
-
- 1. seqid - Must be a chromosome or scaffold or contig.
- 2. source - The program that generated this feature.
- 3. type - The name of this type of feature. Some examples of standard feature types are "gene", "CDS", "protein", "mRNA", and "exon". 
- 4. start - The starting position of the feature in the sequence. The first base is numbered 1.
- 5. stop - The ending position of the feature (inclusive).
- 6. score - A score between 0 and 1000. If there is no score value, enter ".".
- 7. strand - Valid entries include '+', '-', or '.' (for don't know/care).
- 8. phase - If the feature is a coding exon, frame should be a number between 0-2 that represents the reading frame of the first base. If the feature is not a coding exon, the value should be '.'.
- 9. attributes - All lines with the same group are linked together into a single item.
-
-**BED format** Browser Extensible Data format was designed at UCSC for displaying data tracks in the Genome Browser. It has three required fields and several additional optional ones:
-
-The first three BED fields (required) are::
-
-    1. chrom - The name of the chromosome (e.g. chr1, chrY_random).
-    2. chromStart - The starting position in the chromosome. (The first base in a chromosome is numbered 0.)
-    3. chromEnd - The ending position in the chromosome, plus 1 (i.e., a half-open interval).
-
-The additional BED fields (optional) are::
-
-    4. name - The name of the BED line.
-    5. score - A score between 0 and 1000.
-    6. strand - Defines the strand - either '+' or '-'.
-    7. thickStart - The starting position where the feature is drawn thickly at the Genome Browser.
-    8. thickEnd - The ending position where the feature is drawn thickly at the Genome Browser.
-    9. reserved - This should always be set to zero.
-   10. blockCount - The number of blocks (exons) in the BED line.
-   11. blockSizes - A comma-separated list of the block sizes. The number of items in this list should correspond to blockCount.
-   12. blockStarts - A comma-separated list of block starts. All of the blockStart positions should be calculated relative to chromStart. The number of items in this list should correspond to blockCount.
-
---------
-
-This tool is a part of the **MLB Group at Friedrich Miescher Laboratory of the Max Planck Society**. Copyright (C) 2010 Vipin T. Sreedharan (vipin.ts@tuebingen.mpg.de)
- </help>
-</tool>
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/galaxy/gff3_to_gtf.xml
--- a/fml_gff_converter_programs/galaxy/gff3_to_gtf.xml Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
@@ -1,83 +0,0 @@
-<tool id="fml_gff3togtf" name="GFF3_to_GTF" version="1.0.0">
- <description>converter</description> 
- <command interpreter="perl">gff3_to_gtf_converter.pl $inf_gff3 $gtf_format
- </command> 
- <inputs>
-   <param format="gff3" name="inf_gff3" type="data" label="File in GFF3 format" help="GFF3 format file for converting to GTF."/>
-        </inputs>
-   <outputs>
-   <data format="gtf" name="gtf_format" /> 
-   </outputs>
- <tests>
-        <test>
-                <param name="inf_gff3" value="AceView_ncbi_37.gff3" />
-                <output name="gtf_format" file="AceView_gff3_to_gtf.gtf" />
-        </test>
-        <test>
-                <param name="inf_gff3" value="ENSEMBL_mm9.gff3" />
-                <output name="gtf_format" file="ENSEMBL_mm9_gff3_to_gtf.gtf" />
-        </test>
-    </tests>
-   <help>
-
-**What it does**
-
-This tool converts data from GFF3 format to GTF format.
-
---------
-
-**Example**
-
-- The following data in GFF3 format::
-
- ##gff-version 3
- 17      protein_coding  gene    7255208 7258258 .       +       .       ID=ENSG00000213859;Name=KCTD11
- 17      protein_coding  mRNA    7255208 7258258 .       +       .       ID=ENST00000333751;Name=KCTD11-001;Parent=ENSG00000213859
- 17      protein_coding  protein 7256262 7256960 .       +       .       ID=ENSP00000328352;Name=KCTD11-001;Parent=ENST00000333751
- 17      protein_coding  five_prime_UTR  7255208 7256261 .       +       .       Parent=ENST00000333751
- 17      protein_coding  CDS     7256262 7256960 .       +       0       Name=CDS:KCTD11;Parent=ENST00000333751,ENSP00000328352
- 17      protein_coding  three_prime_UTR 7256961 7258258 .       +       .       Parent=ENST00000333751
- 17      protein_coding  exon    7255208 7258258 .       +       .       Parent=ENST00000333751
-
-- Will be converted to GTF format::
-
- 17      protein_coding  exon    7255208 7258258 .       +       .        gene_id "ENSG00000213859"; transcript_id "ENST00000333751"; exon_number "1"; gene_name "KCTD11"; transcript_name "KCTD11-001";
- 17      protein_coding  CDS     7256262 7256957 .       +       0        gene_id "ENSG00000213859"; transcript_id "ENST00000333751"; exon_number "1"; gene_name "KCTD11"; transcript_name "KCTD11-001"; protein_id "ENSP00000328352";
- 17      protein_coding  start_codon     7256262 7256264 .       +       0        gene_id "ENSG00000213859"; transcript_id "ENST00000333751"; exon_number "1"; gene_name "KCTD11"; transcript_name "KCTD11-001";
- 17      protein_coding  stop_codon      7256958 7256960 .       +       0        gene_id "ENSG00000213859"; transcript_id "ENST00000333751"; exon_number "1"; gene_name "KCTD11"; transcript_name "KCTD11-001";
-
---------
-
-**About formats**
-
-
-**GFF3 format** General Feature Format is a format for describing genes and other features associated with DNA, RNA and Protein sequences. GFF3 lines have nine tab-separated fields:
-
-    1. seqid - Must be a chromosome or scaffold.
-    2. source - The program that generated this feature.
-    3. type - The name of this type of feature. Some examples of standard feature types are "gene", "CDS", "protein", "mRNA", and "exon". 
-    4. start - The starting position of the feature in the sequence. The first base is numbered 1.
-    5. stop - The ending position of the feature (inclusive).
-    6. score - A score between 0 and 1000. If there is no score value, enter ".".
-    7. strand - Valid entries include '+', '-', or '.' (for don't know/care).
-    8. phase - If the feature is a coding exon, frame should be a number between 0-2 that represents the reading frame of the first base. If the feature is not a coding exon, the value should be '.'.
-    9. attributes - All lines with the same group are linked together into a single item.
-
-
-**GTF format** Gene Transfer Format, it borrows from GFF, but has additional structure that warrants a separate definition and format name. GTF lines have nine tab-seaparated fields:
-
-    1. seqname - The name of the sequence.
-    2. source - This indicating where the annotation came from.
-    3. feature - The name of the feature types. The following feature types are required: 'CDS', 'start_codon' and 'stop_codon'
-    4. start - The starting position of the feature in the sequence. The first base is numbered 1.
-    5. end - The ending position of the feature (inclusive).
-    6. score - The score field indicates a degree of confidence in the feature's existence and coordinates.
-    7. strand - Valid entries include '+', '-', or '.'
-    8. frame - If the feature is a coding exon, frame should be a number between 0-2 that represents the reading frame of the first base.
-    9. attributes - These attributes are designed for handling multiple transcripts from the same genomic region.
-
---------
-
-This tool is a part of the **MLB Group at Friedrich Miescher Laboratory of the Max Planck Society**. Copyright (C) 2010 Vipin T. Sreedharan (vipin.ts@tuebingen.mpg.de)
- </help>
-</tool>
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/galaxy/gtf_to_gff3.xml
--- a/fml_gff_converter_programs/galaxy/gtf_to_gff3.xml Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
@@ -1,89 +0,0 @@
-<tool id="fml_gtf2gff3" name="GTF_to_GFF3" version="1.0.0">
- <description>converter</description> 
- <command interpreter="python">gtf_to_gff3_converter.py $inf_gtf > $gff3_format 
- </command> 
- <inputs>
-   <param format="gtf" name="inf_gtf" type="data" label="File in GTF format" help="GTF file for converting to GFF3 format."/>
-        </inputs>
-   <outputs>
-   <data format="gff3" name="gff3_format" /> 
-   </outputs>
-     <tests>
-         <test>
-                <param name="inf_gtf" value="UCSC_transcripts.gtf" />
-                <output name="gff3_format" file="UCSC_transcripts.gff3" />
-         </test>
-         <test>
-                <param name="inf_gtf" value="JGI_genes.gtf" />
-                <output name="gff3_format" file="JGI_genes.gff3" />
-         </test>
-         <test>
-                <param name="inf_gtf" value="ENSEMBL_mm9.gtf" />
-                <output name="gff3_format" file="ENSEMBL_mm9.gff3" />
-         </test>
-         <test>
-                <param name="inf_gtf" value="AceView_ncbi_37.gtf" />
-                <output name="gff3_format" file="AceView_ncbi_37.gff3" />
-         </test>
-        </tests>
-   <help>
-
-**What it does**
-
-This tool converts data from GTF format to valid GFF3 format.
-
---------
-
-**Example**
-
-- The following data in GTF format::
-
- 17      protein_coding  exon    7255208 7258258 .       +       .        gene_id "ENSG00000213859"; transcript_id "ENST00000333751"; exon_number "1"; gene_name "KCTD11"; transcript_name "KCTD11-001";
- 17      protein_coding  CDS     7256262 7256957 .       +       0        gene_id "ENSG00000213859"; transcript_id "ENST00000333751"; exon_number "1"; gene_name "KCTD11"; transcript_name "KCTD11-001"; protein_id "ENSP00000328352";
- 17      protein_coding  start_codon     7256262 7256264 .       +       0        gene_id "ENSG00000213859"; transcript_id "ENST00000333751"; exon_number "1"; gene_name "KCTD11"; transcript_name "KCTD11-001";
- 17      protein_coding  stop_codon      7256958 7256960 .       +       0        gene_id "ENSG00000213859"; transcript_id "ENST00000333751"; exon_number "1"; gene_name "KCTD11"; transcript_name "KCTD11-001";
-
-- Will be converted to GFF3 format::
-
- ##gff-version 3
- 17      protein_coding  gene    7255208 7258258 .       +       .       ID=ENSG00000213859;Name=KCTD11
- 17      protein_coding  mRNA    7255208 7258258 .       +       .       ID=ENST00000333751;Name=KCTD11-001;Parent=ENSG00000213859
- 17      protein_coding  protein 7256262 7256960 .       +       .       ID=ENSP00000328352;Name=KCTD11-001;Parent=ENST00000333751
- 17      protein_coding  five_prime_UTR  7255208 7256261 .       +       .       Parent=ENST00000333751
- 17      protein_coding  CDS     7256262 7256960 .       +       0       Name=CDS:KCTD11;Parent=ENST00000333751,ENSP00000328352
- 17      protein_coding  three_prime_UTR 7256961 7258258 .       +       .       Parent=ENST00000333751
- 17      protein_coding  exon    7255208 7258258 .       +       .       Parent=ENST00000333751
-
---------
-
-**About formats**
-
-**GTF format** Gene Transfer Format, it borrows from GFF, but has additional structure that warrants a separate definition and format name. GTF lines have nine tab-seaparated fields:
-
-    1. seqname - The name of the sequence.
-    2. source - This indicating where the annotation came from.
-    3. feature - The name of the feature types. The following feature types are required: 'CDS', 'start_codon' and 'stop_codon'
-    4. start - The starting position of the feature in the sequence. The first base is numbered 1.
-    5. end - The ending position of the feature (inclusive).
-    6. score - The score field indicates a degree of confidence in the feature's existence and coordinates.
-    7. strand - Valid entries include '+', '-', or '.'
-    8. frame - If the feature is a coding exon, frame should be a number between 0-2 that represents the reading frame of the first base.
-    9. attributes - These attributes are designed for handling multiple transcripts from the same genomic region.
-
-**GFF3 format** General Feature Format is a format for describing genes and other features associated with DNA, RNA and Protein sequences. GFF3 lines have nine tab-separated fields:
-
-    1. seqid - Must be a chromosome or scaffold.
-    2. source - The program that generated this feature.
-    3. type - The name of this type of feature. Some examples of standard feature types are "gene", "CDS", "protein", "mRNA", and "exon". 
-    4. start - The starting position of the feature in the sequence. The first base is numbered 1.
-    5. stop - The ending position of the feature (inclusive).
-    6. score - A score between 0 and 1000. If there is no score value, enter ".".
-    7. strand - Valid entries include '+', '-', or '.' (for don't know/care).
-    8. phase - If the feature is a coding exon, frame should be a number between 0-2 that represents the reading frame of the first base. If the feature is not a coding exon, the value should be '.'.
-    9. attributes - All lines with the same group are linked together into a single item.
-
---------
-
-This tool is a part of the **MLB Group at Friedrich Miescher Laboratory of the Max Planck Society**. Copyright (C) 2010 Vipin T. Sreedharan (vipin.ts@tuebingen.mpg.de)
- </help>
-</tool>
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/scripts/bed_to_gff3_converter.py
--- a/fml_gff_converter_programs/scripts/bed_to_gff3_converter.py Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
[
@@ -1,48 +0,0 @@
-#!/usr/bin/env python
-#
-# This program is free software; you can redistribute it and/or modify
-# it under the terms of the GNU General Public License as published by
-# the Free Software Foundation; either version 3 of the License, or
-# (at your option) any later version.
-#
-# Written (W) 2010 Vipin T Sreedharan, Friedrich Miescher Laboratory of the Max Planck Society
-# Copyright (C) 2010 Max Planck Society
-#
-# Description : Convert a BED format file to GFF3 format
-
-import re, sys
-
-def __main__():
-
-    try:
-        bed_fh = open(sys.argv[1], 'rU')
-    except:
-        sys.stderr.write('BED format file fail to open, Cannot continue...\n')
-        sys.stderr.write('USAGE: bed_to_gff3_converter.py <bed file> > *.gff3\n')
-        sys.exit(-1)
-    print '##gff-version 3'
-    for line in bed_fh: 
-        line = line.strip( '\n\r' ).split( '\t' )
-        if re.match('#', line[0]):continue
-        if len(line) != 12: # considering BED lines with 12 fields
-            line = '\t'.join(line)
-            sys.stdout.write('Warning: Invalid BED line found- ' + line + '\n') 
-            continue
-        if len(line[-1].split(',')) != len(line[-2].split(',')):continue # checking the consistency b/w relative start of exon and its length
-        rstart = line[-1].split(',')
-        if rstart[-1] == '': rstart.pop()
-        exon_len = line[-2].split(',')
-        if exon_len[-1] == '': exon_len.pop()
-        if len(rstart) != int(line[-3]): continue # checking the number of exons and block count are same
-        if line[5] != '+' and line[5] != '-':line[5] = '.' # replace the unknown starnd with '.' 
-        # write feature lines to the result file 
-        print line[0] + '\tbed2gff\tgene\t' + str(int(line[1]) + 1) + '\t' + line[2] + '\t' + line[4] + '\t' + line[5] + '\t.\t' + 'ID=Gene:' + line[3] + ';Name=Gene:' + line[3] 
-        print line[0] + '\tbed2gff\ttranscript\t' + str(int(line[1]) + 1) + '\t' + line[2] + '\t' + line[4] + '\t' + line[5] + '\t.\t' + 'ID=' + line[3] + ';Name=' + line[3] + ';Parent=Gene:' + line[3]
-        st = int(line[1])
-        for ex_cnt in range(int(line[-3])):
-            start = st + int(rstart[ex_cnt]) + 1
-            stop = start + int(exon_len[ex_cnt]) - 1
-            print line[0] + '\tbed2gff\texon\t' + str(start) + '\t' + str(stop) + '\t' + line[4] + '\t' + line[5] + '\t.\t' + 'Parent=' + line[3]
-    bed_fh.close()
-
-if __name__ == "__main__": __main__()
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/scripts/gff3_to_bed_converter.py
--- a/fml_gff_converter_programs/scripts/gff3_to_bed_converter.py Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
[
@@ -1,79 +0,0 @@
-#!/usr/bin/env python
-#
-# This program is free software; you can redistribute it and/or modify
-# it under the terms of the GNU General Public License as published by
-# the Free Software Foundation; either version 3 of the License, or
-# (at your option) any later version.
-#
-# Written (W) 2010 Vipin T Sreedharan, Friedrich Miescher Laboratory of the Max Planck Society
-# Copyright (C) 2010 Max Planck Society
-#
-# Description : Convert a genome annotation in GFF3 format to UCSC 12 column Wiggle BED format. BED format typically represents the transcript models. 
-
-import re, sys
-
-def WriteBED(tinfo, einfo):
-    
-    for contig_id, features in tinfo.items():
-        for tid, tloc in features.items():
-            if tid in einfo: # get corresponding exon info 
-                exon_cnt, exon_len, exon_cod, fex, rstart = 0, '', '', 0, None
-                if tloc[-1] == '-':
-                    if einfo[tid][0][1] > einfo[tid][-1][1]:einfo[tid].sort()
-                for ex_ele in einfo[tid]:
-                    if ex_ele[0] != contig_id:continue
-                    exon_cnt += 1
-                    exon_len += str(int(ex_ele[2])-int(ex_ele[1])+1) + ','
-                    if fex == 0: # calculate the relative exon start 
-                        exon_cod += '0,'
-                        fex = 1
-                        rstart = int(ex_ele[1])
-                    else:
-                        exon_cod += str(int(ex_ele[1])-rstart) + ','
-                if exon_len: # display bed line
-                    print contig_id + '\t' + tloc[0] + '\t' + tloc[1] + '\t' + tid + '\t' + tloc[2] + '\t' + tloc[-1] + '\t' + tloc[0] + '\t' + tloc[1] + '\t0\t' + str(exon_cnt) + '\t' + exon_len + '\t' + exon_cod 
-
-def ParseAnno(gff_fh):
-   
-    tinfo, einfo = dict(), dict()
-    for gff_line in gff_fh:
-        gff_line = gff_line.strip('\n\r').split('\t')
-        if re.match(r'#', gff_line[0]):continue
-        if re.match(r'>', gff_line[0]):continue
-        if len(gff_line) == 1:
-            if re.search(r'\w+', gff_line[0]):continue## GFF files with FASTA sequence together 
-        if len(gff_line) != 9:sys.stderr.write('Warning: Found invalid GFF line\n' + '\t'.join(gff_line) + '\n');continue
-        if gff_line[3] == '' and gff_line[4] == '':sys.stderr.write('Warning: Found invalid coordinates in GFF line: ' + '\t'.join(gff_line) + '\n');continue
-        if gff_line[2] == 'transcript' or gff_line[2] == 'scRNA' or gff_line[2] == "mRNA" or gff_line[2] == 'ncRNA' or gff_line[2] == 'miRNA' or gff_line[2] == 'rRNA' or gff_line[2] == 'snoRNA' or gff_line[2] == 'snRNA' or gff_line[2] == 'tRNA' or gff_line[2] == 'pseudogenic_transcript':
-            col9 = gff_line[-1].split(';')
-            tid = None
-            for ele in col9:
-                if re.search(r'ID=', ele):tid = re.search(r'ID=(.+)', ele).group(1);break
-            if gff_line[0] in tinfo:
-                tinfo[gff_line[0]][tid] = (gff_line[3], gff_line[4], gff_line[5], gff_line[6])
-            else:
-                tinfo[gff_line[0]] = {tid:(gff_line[3], gff_line[4], gff_line[5], gff_line[6])}
-        if gff_line[2] == 'exon':
-            col9 = gff_line[-1].split(';')
-            pid = None
-            for ele in col9:
-                if re.search(r'Parent=', ele):pid = re.search(r'Parent=(.+)', ele).group(1);break
-            if pid in einfo:
-                einfo[pid].append((gff_line[0], int(gff_line[3]), int(gff_line[4])))
-            else:
-                einfo[pid] = [(gff_line[0], int(gff_line[3]), int(gff_line[4]))]
-    gff_fh.close()
-    return tinfo, einfo
-
-if __name__ == "__main__": 
-
-    try:
-        gff_fh = open(sys.argv[1], 'rU')
-    except:
-        sys.stderr.write('GFF format file fail to open, Cannot continue...\n')
-        sys.stderr.write('USAGE: gff3_to_bed_converter.py <gff file> > *.bed\n')
-        sys.exit(-1)
-    ## get transcript annotation
-    tinfo, einfo = ParseAnno(gff_fh)
-    ## write into bed format 
-    WriteBED(tinfo, einfo)
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/scripts/gff3_to_gtf_converter.pl
--- a/fml_gff_converter_programs/scripts/gff3_to_gtf_converter.pl Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
[
@@ -1,83 +0,0 @@
-#!/usr/bin/env perl
-#
-# This program is free software; you can redistribute it and/or modify
-# it under the terms of the GNU General Public License as published by
-# the Free Software Foundation; either version 3 of the License, or
-# (at your option) any later version.
-#
-# Written (W) 2010 Vipin T Sreedharan, Friedrich Miescher Laboratory of the Max Planck Society
-# Copyright (C) 2010 Max Planck Society
-#
-# Description : Convert a GFF3 format file to GTF format.

-use strict;
-use warnings;
-
-use lib '/home/galaxy/perl5/share/perl/5.8.8/';
-use Bio::FeatureIO;
-
-my $usage = q(
-gff3_to_gtf_converter.pl  - Program to convert a valid GFF3 format file to GTF format.
-USAGE: gff3_to_gtf_converter.pl <GFF3 file name> <output file name>
-);
-
-# requirement check
-if (scalar(@ARGV) != 2) {
-    print $usage;
-    exit
-}
-
-my $inFile = $ARGV[0]; 
-my $outFile = $ARGV[1];

-my $inGFF = Bio::FeatureIO->new( '-file' => "$inFile",
- '-format' => 'GFF',
- '-version' => 3 );
-my $outGTF = Bio::FeatureIO->new( '-file' => ">$outFile",
- '-format' => 'GFF',
- '-version' => 2.5);
-
-my ($gene, $feature_type, $exon_exon_cnt, $cds_exon_cnt) = ('', '', 0, 0);
-
-while (my $feature = $inGFF->next_feature() ) {
-    # from a valid GFF3 file try to get transcript information.
-    if ($feature->type->name eq 'mRNA' || $feature->type->name eq 'miRNA' || $feature->type->name eq 'ncRNA' || $feature->type->name eq 'rRNA' || $feature->type->name eq 'snoRNA' || $feature->type->name eq 'snRNA' || $feature->type->name eq 'tRNA' || $feature->type->name eq 'misc_RNA' || $feature->type->name eq 'processed_transcript' || $feature->type->name eq 'transcript' || $feature->type->name eq 'scRNA') {
-        my $parent = ($feature->get_Annotations('Parent'))[0]; 
-        $gene = $parent->value;
-        $cds_exon_cnt = 1; 
-        $exon_exon_cnt = 1;
-        $feature_type = $feature->type->name;
-    }
-    if ($feature->type->name eq 'exon' || $feature->type->name eq 'CDS' ||$feature->type->name eq 'stop_codon' || $feature->type->name eq 'start_codon') {
-        my $parent = ($feature->get_Annotations('Parent'))[0];
-        my $transcript = $parent->value;
-       
-        my ($col_exon_number, $protein_id) = ('', '');
-        if ($feature->type->name eq 'exon') {
-            $col_exon_number = Bio::Annotation::SimpleValue->new( '-value' => $exon_exon_cnt, '-tagname' => 'exon_number');
-            $exon_exon_cnt++;
-        } elsif ($feature->type->name eq 'CDS') {
-            $col_exon_number = Bio::Annotation::SimpleValue->new( '-value' => $cds_exon_cnt, '-tagname' => 'exon_number');
-            $cds_exon_cnt++;
-            my $pid = $transcript;
-            $pid =~s/Transcript/Protein/;
-            $protein_id = Bio::Annotation::SimpleValue->new( '-value' => $pid, '-tagname' => 'protein_id');
-        } elsif ($feature->type->name eq 'start_codon'){
-            $col_exon_number = Bio::Annotation::SimpleValue->new( '-value' => $cds_exon_cnt, '-tagname' => 'exon_number');
-        } else {
-            $col_exon_number = Bio::Annotation::SimpleValue->new( '-value' => $cds_exon_cnt-1, '-tagname' => 'exon_number');
-        }
-
-        my $transcript_id = Bio::Annotation::SimpleValue->new( '-value' => $transcript, '-tagname' => 'transcript_id');
-        my $gene_id = Bio::Annotation::SimpleValue->new( '-value' => $gene, '-tagname' => 'gene_id');
-        my $source_identifer = Bio::Annotation::SimpleValue->new( '-value' => $feature_type, '-tagname' => 'source_id');
-
-        $feature->add_Annotation($source_identifer);
-        $feature->add_Annotation($transcript_id);
-        $feature->add_Annotation($gene_id);
-        $feature->add_Annotation($col_exon_number);
-        $feature->add_Annotation($protein_id) if ($feature->type->name eq 'CDS');
-    }
-    $outGTF->write_feature($feature);
-}
-exit;
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/scripts/gtf_to_gff3_converter.py
--- a/fml_gff_converter_programs/scripts/gtf_to_gff3_converter.py Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
[
b'@@ -1,233 +0,0 @@\n-#!/usr/bin/env python\n-#\n-# This program is free software; you can redistribute it and/or modify\n-# it under the terms of the GNU General Public License as published by\n-# the Free Software Foundation; either version 3 of the License, or\n-# (at your option) any later version.\n-#\n-# Written (W) 2010 Vipin T Sreedharan, Friedrich Miescher Laboratory of the Max Planck Society\n-# Copyright (C) 2010 Max Planck Society\n-#\n-# Description : Convert a GTF format file to GFF3 format, tested extensively on files from following centers NCBI AceView, ENSEMBL, Joint Genome Institute and UCSC Genome Center\n-\n-import re, sys\n-from operator import itemgetter\n-\n-def addCDSphase(strand, cds):\n-    """Add CDS phase to the CDS exons"""\n-    \n-    cds_region, cds_flag = [], 0 \n-    if strand == \'+\':\n-        for cdspos in cds:\n-            if cds_flag == 0:\n-                cdspos = (cdspos[0], cdspos[1], 0)\n-                diff = (cdspos[1]-(cdspos[0]-1))%3\n-            else:\n-                xy = 0\n-                if diff == 0: \n-                    cdspos = (cdspos[0], cdspos[1], 0)\n-                elif diff == 1: \n-                    cdspos = (cdspos[0], cdspos[1], 2)\n-                    xy = 2\n-                elif diff == 2: \n-                    cdspos = (cdspos[0], cdspos[1], 1)\n-                    xy = 1\n-                diff = ((cdspos[1]-(cdspos[0]-1))-xy)%3\n-            cds_region.append(cdspos)\n-            cds_flag = 1 \n-    elif strand == \'-\':\n-        cds.reverse()\n-        for cdspos in cds: \n-            if cds_flag == 0:\n-                cdspos = (cdspos[0], cdspos[1], 0)\n-                diff = (cdspos[1]-(cdspos[0]-1))%3\n-            else:  \n-                xy = 0 \n-                if diff == 0: \n-                    cdspos = (cdspos[0], cdspos[1], 0)\n-                elif diff == 1:\n-                    cdspos = (cdspos[0], cdspos[1], 2)\n-                    xy = 2\n-                elif diff == 2: \n-                    cdspos = (cdspos[0], cdspos[1], 1)\n-                    xy = 1\n-                diff = ((cdspos[1]-(cdspos[0]-1))-xy)%3\n-            cds_region.append(cdspos)\n-            cds_flag = 1\n-        cds_region.reverse()\n-    return cds_region\n-\n-def buildUTR(cc, ec, strand):\n-    """Build UTR regions from CDS and exon coordiantes"""\n-    \n-    utr5, utr3 = [], []\n-    if strand == \'+\':\n-        cds_s = cc[0][0]\n-        for ex in ec:\n-            if ex[0] <= cds_s and cds_s <= ex[1]:\n-                if ex[0] != cds_s:utr5.append((ex[0], cds_s-1))\n-                break\n-            else:\n-                utr5.append(ex)\n-        cds_e = cc[-1][1]\n-        for i in range(len(ec)):\n-            i += 1\n-            if ec[-i][0] <= cds_e and cds_e <= ec[-i][1]:\n-                if ec[-i][1] != cds_e:utr3.append((cds_e +1, ec[-i][1]))\n-                break\n-            else:\n-                utr3.append(ec[-i]) \n-        utr3.reverse()\n-    elif strand == \'-\':\n-        cds_s = cc[-1][1]\n-        for i in range(len(ec)):\n-            i += 1\n-            if ec[-i][0] <= cds_s and cds_s <= ec[-i][1]:\n-                if ec[-i][1] != cds_s:utr5.append((cds_s+1, ec[-i][1]))\n-                break\n-            else:\n-                utr5.append(ec[-i])\n-        utr5.reverse()\n-        cds_e = cc[0][0] \n-        for ex in ec:\n-            if ex[0] <= cds_e and cds_e <= ex[1]:\n-                if ex[0] != cds_e:utr3.append((ex[0], cds_e-1))\n-                break\n-            else:\n-                utr3.append(ex)\n-    return utr5, utr3\n-\n-def GFFWriter(gtf_file_content):\n-    """Write into GFF3 format"""\n-\n-    for contig, contig_info in sorted(gtf_file_content.items()): # first level, chromosome\n-        for feature, details in contig_info.items(): # second level, gene\n-            source, gene_start, gene_stop, transcript_details, orient, gname = dict(), [], [], dict(), None, None\n-            for ftid, tinfo in details.items(): # third level, transcripts \n-                tinfo[\'exon\'].sort() # ge'..b'gname= name;continue\n-            if re.search(r\'^(transcript_name|transcriptname)$\', tag, re.IGNORECASE):tname= name;continue\n-            if re.search(r\'^(exonNumber|exon_number)$\', tag, re.IGNORECASE):ex_cnt = name;continue\n-\n-        if tid == None:\n-            if gtf_line[2] == \'CDS\':tid = recall # JGI Joint Genome Institute GTF files dont have transcript ID for CDS line\n-        if tid == pid == None:continue # stop_codon icluded in CDS coordinates of JGI GTF files, moreover stop_codon lines dont have any transcript identifications.\n-        if gid == tid:gid = \'Gene:\' + str(gid);tid = \'Transcript:\' + str(tid) # UCSC gene and transcript ID are similar, differentaiting with Gene: and Transcript: tag in the begning \n-        \n-        if gtf_line[0] in gtf_content: # existing chromosome\n-            if gid in gtf_content[gtf_line[0]].keys(): # existing gene \n-                if tid in gtf_content[gtf_line[0]][gid].keys(): # existing transcript\n-                    if gtf_line[2] == \'exon\':gtf_content[gtf_line[0]][gid][tid][\'exon\'].append((int(gtf_line[3]), int(gtf_line[4])))\n-                    elif gtf_line[2] == \'CDS\':gtf_content[gtf_line[0]][gid][tid][\'CDS\'].append((int(gtf_line[3]), int(gtf_line[4])))\n-                    elif gtf_line[2] == \'stop_codon\':gtf_content[gtf_line[0]][gid][tid][\'stop_codon\'].append((int(gtf_line[3]), int(gtf_line[4])))\n-                else: # new transcript \n-                    if gtf_line[2] == \'exon\':gtf_content[gtf_line[0]][gid][tid] = dict(exon = [(int(gtf_line[3]), int(gtf_line[4]))], CDS = [], stop_codon = [], info = [gtf_line[1], gtf_line[6], gtf_line[5], gname, tname])\n-                    elif gtf_line[2] == \'CDS\':gtf_content[gtf_line[0]][gid][tid] = dict(exon = [], CDS = [(int(gtf_line[3]), int(gtf_line[4]))], stop_codon = [], info = [gtf_line[1], gtf_line[6], gtf_line[5], gname, tname])\n-                    elif gtf_line[2] == \'stop_codon\':gtf_content[gtf_line[0]][gid][tid] = dict(exon = [], CDS = [], stop_codon = [(int(gtf_line[3]), int(gtf_line[4]))], info = [gtf_line[1], gtf_line[6], gtf_line[5], gname, tname])\n-            else: # new gene  \n-                if gtf_line[2] == \'exon\':gtf_content[gtf_line[0]][gid] = {tid : dict(exon = [(int(gtf_line[3]), int(gtf_line[4]))], CDS = [], stop_codon = [], info = [gtf_line[1], gtf_line[6], gtf_line[5], gname, tname])}\n-                elif gtf_line[2] == \'CDS\':gtf_content[gtf_line[0]][gid] = {tid : dict(exon = [], CDS = [(int(gtf_line[3]), int(gtf_line[4]))], stop_codon = [], info = [gtf_line[1], gtf_line[6], gtf_line[5], gname, tname])}\n-                elif gtf_line[2] == \'stop_codon\':gtf_content[gtf_line[0]][gid] = {tid : dict(exon = [], CDS = [], stop_codon = [(int(gtf_line[3]), int(gtf_line[4]))], info = [gtf_line[1], gtf_line[6], gtf_line[5], gname, tname])}\n-        else: # new chromosome\n-            if gtf_line[2] == \'exon\':gtf_content[gtf_line[0]] = {gid : {tid : dict(exon = [(int(gtf_line[3]), int(gtf_line[4]))], CDS = [], stop_codon = [], info = [gtf_line[1], gtf_line[6], gtf_line[5], gname, tname])}}\n-            elif gtf_line[2] == \'CDS\':gtf_content[gtf_line[0]] = {gid : {tid : dict(exon = [], CDS = [(int(gtf_line[3]), int(gtf_line[4]))], stop_codon = [], info = [gtf_line[1], gtf_line[6], gtf_line[5], gname, tname])}}\n-            elif gtf_line[2] == \'stop_codon\':gtf_content[gtf_line[0]] = {gid : {tid : dict(exon = [], CDS = [], stop_codon = [(int(gtf_line[3]), int(gtf_line[4]))], info = [gtf_line[1], gtf_line[6], gtf_line[5], gname, tname])}}\n-        recall = tid\n-\n-    gtf_fh.close()\n-    return gtf_content\n-\n-def __main__():\n-\n-    try:\n-        gtf_file = sys.argv[1]\n-    except:\n-        sys.stderr.write(\'GTF format file required, Cannot continue...\\n\')\n-        sys.stderr.write(\'USAGE: gtf_to_gff3_converter.py <gtf file> > *.gff3\\n\')\n-        sys.exit(-1)\n-    \n-    gtf_file_content = get_GTF_info(gtf_file)\n-    print \'##gff-version 3\'\n-    GFFWriter(gtf_file_content)\n-\n-if __name__ == "__main__": __main__()\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/AceView_gff3_to_gtf.gtf
--- a/fml_gff_converter_programs/t/AceView_gff3_to_gtf.gtf Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,2544 +0,0 @@\n-##gff-version 2.5\n-1\tprotein_coding\tCDS\t12704566\t12704733\t.\t+\t0\tgene_id "AADACL4"; transcript_id "AADACL4.aAug10"; exon_number "1"; protein_id "AADACL4.aAug10";\n-1\tprotein_coding\tCDS\t12711142\t12711358\t.\t+\t0\tgene_id "AADACL4"; transcript_id "AADACL4.aAug10"; exon_number "2"; protein_id "AADACL4.aAug10";\n-1\tprotein_coding\tCDS\t12721802\t12721865\t.\t+\t0\tgene_id "AADACL4"; transcript_id "AADACL4.aAug10"; exon_number "3"; protein_id "AADACL4.aAug10";\n-1\tprotein_coding\tCDS\t12725972\t12726746\t.\t+\t0\tgene_id "AADACL4"; transcript_id "AADACL4.aAug10"; exon_number "4"; protein_id "AADACL4.aAug10";\n-1\tprotein_coding\texon\t12704566\t12704733\t.\t+\t0\tgene_id "AADACL4"; transcript_id "AADACL4.aAug10"; exon_number "1";\n-1\tprotein_coding\texon\t12711142\t12711358\t.\t+\t0\tgene_id "AADACL4"; transcript_id "AADACL4.aAug10"; exon_number "2";\n-1\tprotein_coding\texon\t12721802\t12721865\t.\t+\t0\tgene_id "AADACL4"; transcript_id "AADACL4.aAug10"; exon_number "3";\n-1\tprotein_coding\texon\t12725972\t12727097\t.\t+\t0\tgene_id "AADACL4"; transcript_id "AADACL4.aAug10"; exon_number "4";\n-1\tprotein_coding\tCDS\t12776344\t12776347\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.bAug10"; exon_number "1"; protein_id "AADACL3.bAug10";\n-1\tprotein_coding\tCDS\t12780885\t12780948\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.bAug10"; exon_number "2"; protein_id "AADACL3.bAug10";\n-1\tprotein_coding\tCDS\t12785189\t12785963\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.bAug10"; exon_number "3"; protein_id "AADACL3.bAug10";\n-1\tprotein_coding\texon\t12776119\t12776347\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.bAug10"; exon_number "1";\n-1\tprotein_coding\texon\t12780885\t12780948\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.bAug10"; exon_number "2";\n-1\tprotein_coding\texon\t12785189\t12788726\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.bAug10"; exon_number "3";\n-1\tprotein_coding\tCDS\t12779480\t12779693\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.aAug10"; exon_number "1"; protein_id "AADACL3.aAug10";\n-1\tprotein_coding\tCDS\t12780885\t12780948\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.aAug10"; exon_number "2"; protein_id "AADACL3.aAug10";\n-1\tprotein_coding\tCDS\t12785189\t12785963\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.aAug10"; exon_number "3"; protein_id "AADACL3.aAug10";\n-1\tprotein_coding\texon\t12776119\t12776347\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.aAug10"; exon_number "1";\n-1\tprotein_coding\texon\t12779477\t12779693\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.aAug10"; exon_number "2";\n-1\tprotein_coding\texon\t12780885\t12780948\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.aAug10"; exon_number "3";\n-1\tprotein_coding\texon\t12785189\t12788726\t.\t+\t0\tgene_id "AADACL3"; transcript_id "AADACL3.aAug10"; exon_number "4";\n-10\tprotein_coding\tCDS\t52566489\t52566640\t.\t-\t0\tgene_id "A1CF"; transcript_id "A1CF.fAug10"; exon_number "1"; protein_id "A1CF.fAug10";\n-10\tprotein_coding\tCDS\t52569654\t52569802\t.\t-\t0\tgene_id "A1CF"; transcript_id "A1CF.fAug10"; exon_number "2"; protein_id "A1CF.fAug10";\n-10\tprotein_coding\tCDS\t52570800\t52570936\t.\t-\t0\tgene_id "A1CF"; transcript_id "A1CF.fAug10"; exon_number "3"; protein_id "A1CF.fAug10";\n-10\tprotein_coding\tCDS\t52573617\t52573798\t.\t-\t0\tgene_id "A1CF"; transcript_id "A1CF.fAug10"; exon_number "4"; protein_id "A1CF.fAug10";\n-10\tprotein_coding\tCDS\t52575766\t52576039\t.\t-\t0\tgene_id "A1CF"; transcript_id "A1CF.fAug10"; exon_number "5"; protein_id "A1CF.fAug10";\n-10\tprotein_coding\tCDS\t52580312\t52580409\t.\t-\t0\tgene_id "A1CF"; transcript_id "A1CF.fAug10"; exon_number "6"; protein_id "A1CF.fAug10";\n-10\tprotein_coding\tCDS\t52587891\t52588055\t.\t-\t0\tgene_id "A1CF"; transcript_id "A1CF.fAug10"; exon_number "7"; protein_id "A1CF.fAug10";\n-10\tprotein_coding\tCDS\t52595834\t52596072\t.\t-\t0\tgene_id "A1CF"; transcript_id "A1CF.fAug10"; exon_number "8"; protein_id "A1CF.fAug10";\n-10\tprotein_coding\tCDS\t52601622\t52601752\t.\t-\t0\tgene_id "A1CF"; transcript_id "A1CF.fAug10"; exon_number "9"; protein_id "A1CF.fAug10";\n-10\tprotein_coding\tCDS\t52603748\t52603882\t.\t-\t0\tgene'..b'10"; exon_number "1"; protein_id "AAA1.aAug10";\n-7\tprotein_coding\tCDS\t34768349\t34768428\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.aAug10"; exon_number "2"; protein_id "AAA1.aAug10";\n-7\tprotein_coding\tCDS\t34797686\t34797710\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.aAug10"; exon_number "3"; protein_id "AAA1.aAug10";\n-7\tprotein_coding\texon\t34607864\t34607984\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.aAug10"; exon_number "1";\n-7\tprotein_coding\texon\t34609324\t34609473\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.aAug10"; exon_number "2";\n-7\tprotein_coding\texon\t34768349\t34768428\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.aAug10"; exon_number "3";\n-7\tprotein_coding\texon\t34797686\t34797884\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.aAug10"; exon_number "4";\n-7\tprotein_coding\tCDS\t34682958\t34682963\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.dAug10"; exon_number "1"; protein_id "AAA1.dAug10";\n-7\tprotein_coding\tCDS\t34768349\t34768428\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.dAug10"; exon_number "2"; protein_id "AAA1.dAug10";\n-7\tprotein_coding\tCDS\t34800724\t34800802\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.dAug10"; exon_number "3"; protein_id "AAA1.dAug10";\n-7\tprotein_coding\texon\t34682839\t34682963\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.dAug10"; exon_number "1";\n-7\tprotein_coding\texon\t34768349\t34768428\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.dAug10"; exon_number "2";\n-7\tprotein_coding\texon\t34800724\t34800803\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.dAug10"; exon_number "3";\n-7\ttranscript\texon\t34758479\t34759420\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.gAug10"; exon_number "1";\n-7\ttranscript\texon\t34760254\t34760397\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.gAug10"; exon_number "2";\n-7\ttranscript\texon\t34762896\t34763007\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.gAug10"; exon_number "3";\n-7\ttranscript\texon\t34768349\t34768428\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.gAug10"; exon_number "4";\n-7\ttranscript\texon\t34800724\t34800803\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.gAug10"; exon_number "5";\n-7\ttranscript\texon\t34873773\t34873948\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.gAug10"; exon_number "6";\n-7\ttranscript\texon\t34758474\t34759420\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.hAug10"; exon_number "1";\n-7\ttranscript\texon\t34762896\t34763007\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.hAug10"; exon_number "2";\n-7\ttranscript\texon\t34768349\t34768428\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.hAug10"; exon_number "3";\n-7\ttranscript\texon\t34807954\t34808052\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.hAug10"; exon_number "4";\n-7\ttranscript\texon\t34873773\t34873943\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.hAug10"; exon_number "5";\n-7\ttranscript\texon\t34390034\t34390459\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.iAug10"; exon_number "1";\n-7\ttranscript\texon\t34457191\t34457284\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.iAug10"; exon_number "2";\n-7\ttranscript\texon\t34609324\t34609473\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.iAug10"; exon_number "3";\n-7\ttranscript\texon\t34768349\t34768428\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.iAug10"; exon_number "4";\n-7\ttranscript\texon\t34800724\t34800803\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.iAug10"; exon_number "5";\n-7\tprotein_coding\tCDS\t34457198\t34457284\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.bAug10"; exon_number "1"; protein_id "AAA1.bAug10";\n-7\tprotein_coding\tCDS\t34768349\t34768428\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.bAug10"; exon_number "2"; protein_id "AAA1.bAug10";\n-7\tprotein_coding\tCDS\t34797686\t34797710\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.bAug10"; exon_number "3"; protein_id "AAA1.bAug10";\n-7\tprotein_coding\texon\t34386126\t34390459\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.bAug10"; exon_number "1";\n-7\tprotein_coding\texon\t34457191\t34457284\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.bAug10"; exon_number "2";\n-7\tprotein_coding\texon\t34768349\t34768428\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.bAug10"; exon_number "3";\n-7\tprotein_coding\texon\t34797686\t34797884\t.\t-\t0\tgene_id "AAA1"; transcript_id "AAA1.bAug10"; exon_number "4";\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/AceView_ncbi_37.gff3
--- a/fml_gff_converter_programs/t/AceView_ncbi_37.gff3 Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,3164 +0,0 @@\n-##gff-version 3\n-1\tAceView\tgene\t12704566\t12727097\t.\t+\t.\tID=AADACL4;Name=AADACL4\n-1\tAceView\tmRNA\t12704566\t12727097\t.\t+\t.\tID=AADACL4.aAug10;Parent=AADACL4\n-1\tAceView\tCDS\t12704566\t12704733\t.\t+\t0\tParent=AADACL4.aAug10\n-1\tAceView\tCDS\t12711142\t12711358\t.\t+\t0\tParent=AADACL4.aAug10\n-1\tAceView\tCDS\t12721802\t12721865\t.\t+\t2\tParent=AADACL4.aAug10\n-1\tAceView\tCDS\t12725972\t12726746\t.\t+\t1\tParent=AADACL4.aAug10\n-1\tAceView\tthree_prime_UTR\t12726747\t12727097\t.\t+\t.\tParent=AADACL4.aAug10\n-1\tAceView\texon\t12704566\t12704733\t.\t+\t.\tParent=AADACL4.aAug10\n-1\tAceView\texon\t12711142\t12711358\t.\t+\t.\tParent=AADACL4.aAug10\n-1\tAceView\texon\t12721802\t12721865\t.\t+\t.\tParent=AADACL4.aAug10\n-1\tAceView\texon\t12725972\t12727097\t.\t+\t.\tParent=AADACL4.aAug10\n-1\tAceView\tgene\t12776119\t12788726\t.\t+\t.\tID=AADACL3;Name=AADACL3\n-1\tAceView\tmRNA\t12776119\t12788726\t.\t+\t.\tID=AADACL3.bAug10;Parent=AADACL3\n-1\tAceView\tfive_prime_UTR\t12776119\t12776343\t.\t+\t.\tParent=AADACL3.bAug10\n-1\tAceView\tCDS\t12776344\t12776347\t.\t+\t0\tParent=AADACL3.bAug10\n-1\tAceView\tCDS\t12780885\t12780948\t.\t+\t2\tParent=AADACL3.bAug10\n-1\tAceView\tCDS\t12785189\t12785963\t.\t+\t1\tParent=AADACL3.bAug10\n-1\tAceView\tthree_prime_UTR\t12785964\t12788726\t.\t+\t.\tParent=AADACL3.bAug10\n-1\tAceView\texon\t12776119\t12776347\t.\t+\t.\tParent=AADACL3.bAug10\n-1\tAceView\texon\t12780885\t12780948\t.\t+\t.\tParent=AADACL3.bAug10\n-1\tAceView\texon\t12785189\t12788726\t.\t+\t.\tParent=AADACL3.bAug10\n-1\tAceView\tmRNA\t12776119\t12788726\t.\t+\t.\tID=AADACL3.aAug10;Parent=AADACL3\n-1\tAceView\tfive_prime_UTR\t12776119\t12776347\t.\t+\t.\tParent=AADACL3.aAug10\n-1\tAceView\tfive_prime_UTR\t12779477\t12779479\t.\t+\t.\tParent=AADACL3.aAug10\n-1\tAceView\tCDS\t12779480\t12779693\t.\t+\t0\tParent=AADACL3.aAug10\n-1\tAceView\tCDS\t12780885\t12780948\t.\t+\t2\tParent=AADACL3.aAug10\n-1\tAceView\tCDS\t12785189\t12785963\t.\t+\t1\tParent=AADACL3.aAug10\n-1\tAceView\tthree_prime_UTR\t12785964\t12788726\t.\t+\t.\tParent=AADACL3.aAug10\n-1\tAceView\texon\t12776119\t12776347\t.\t+\t.\tParent=AADACL3.aAug10\n-1\tAceView\texon\t12779477\t12779693\t.\t+\t.\tParent=AADACL3.aAug10\n-1\tAceView\texon\t12780885\t12780948\t.\t+\t.\tParent=AADACL3.aAug10\n-1\tAceView\texon\t12785189\t12788726\t.\t+\t.\tParent=AADACL3.aAug10\n-10\tAceView\tgene\t52566307\t52588060\t.\t-\t.\tID=A1CF;Name=A1CF\n-10\tAceView\tmRNA\t52566307\t52645387\t.\t-\t.\tID=A1CF.fAug10;Parent=A1CF\n-10\tAceView\tfive_prime_UTR\t52619701\t52619745\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\tfive_prime_UTR\t52622649\t52622741\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\tfive_prime_UTR\t52623793\t52623840\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\tfive_prime_UTR\t52645341\t52645387\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\tCDS\t52566489\t52566640\t.\t-\t2\tParent=A1CF.fAug10\n-10\tAceView\tCDS\t52569654\t52569802\t.\t-\t1\tParent=A1CF.fAug10\n-10\tAceView\tCDS\t52570800\t52570936\t.\t-\t0\tParent=A1CF.fAug10\n-10\tAceView\tCDS\t52573617\t52573798\t.\t-\t2\tParent=A1CF.fAug10\n-10\tAceView\tCDS\t52575766\t52576039\t.\t-\t0\tParent=A1CF.fAug10\n-10\tAceView\tCDS\t52580312\t52580409\t.\t-\t2\tParent=A1CF.fAug10\n-10\tAceView\tCDS\t52587891\t52588055\t.\t-\t2\tParent=A1CF.fAug10\n-10\tAceView\tCDS\t52595834\t52596072\t.\t-\t1\tParent=A1CF.fAug10\n-10\tAceView\tCDS\t52601622\t52601752\t.\t-\t0\tParent=A1CF.fAug10\n-10\tAceView\tCDS\t52603748\t52603882\t.\t-\t0\tParent=A1CF.fAug10\n-10\tAceView\tCDS\t52619602\t52619700\t.\t-\t0\tParent=A1CF.fAug10\n-10\tAceView\tthree_prime_UTR\t52566307\t52566488\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52566307\t52566640\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52569654\t52569802\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52570800\t52570936\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52573617\t52573798\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52575766\t52576039\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52580312\t52580409\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52587891\t52588055\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52595834\t52596072\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52601622\t52601752\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52603748\t52603882\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52619602\t52619745\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52622649\t52622741\t.\t-\t.\tParent=A1CF.fAug10\n-10\tAceView\texon\t52623793\t52623840\t.\t-\t.\tP'..b'743462\t34800803\t.\t-\t.\tID=AAA1.cAug10;Parent=AAA1\n-7\tAceView\tfive_prime_UTR\t34800803\t34800803\t.\t-\t.\tParent=AAA1.cAug10\n-7\tAceView\tCDS\t34743797\t34743811\t.\t-\t0\tParent=AAA1.cAug10\n-7\tAceView\tCDS\t34768349\t34768428\t.\t-\t2\tParent=AAA1.cAug10\n-7\tAceView\tCDS\t34800724\t34800802\t.\t-\t0\tParent=AAA1.cAug10\n-7\tAceView\tthree_prime_UTR\t34743462\t34743796\t.\t-\t.\tParent=AAA1.cAug10\n-7\tAceView\texon\t34743462\t34743811\t.\t-\t.\tParent=AAA1.cAug10\n-7\tAceView\texon\t34768349\t34768428\t.\t-\t.\tParent=AAA1.cAug10\n-7\tAceView\texon\t34800724\t34800803\t.\t-\t.\tParent=AAA1.cAug10\n-7\tAceView\ttranscript\t34386126\t34797884\t.\t-\t.\tID=AAA1.eAug10;Parent=AAA1\n-7\tAceView\texon\t34386126\t34390459\t.\t-\t.\tParent=AAA1.eAug10\n-7\tAceView\texon\t34457191\t34457284\t.\t-\t.\tParent=AAA1.eAug10\n-7\tAceView\texon\t34609324\t34609473\t.\t-\t.\tParent=AAA1.eAug10\n-7\tAceView\texon\t34768349\t34768428\t.\t-\t.\tParent=AAA1.eAug10\n-7\tAceView\texon\t34797686\t34797884\t.\t-\t.\tParent=AAA1.eAug10\n-7\tAceView\tmRNA\t34607864\t34797884\t.\t-\t.\tID=AAA1.aAug10;Parent=AAA1\n-7\tAceView\tfive_prime_UTR\t34797711\t34797884\t.\t-\t.\tParent=AAA1.aAug10\n-7\tAceView\tCDS\t34609384\t34609473\t.\t-\t0\tParent=AAA1.aAug10\n-7\tAceView\tCDS\t34768349\t34768428\t.\t-\t2\tParent=AAA1.aAug10\n-7\tAceView\tCDS\t34797686\t34797710\t.\t-\t0\tParent=AAA1.aAug10\n-7\tAceView\tthree_prime_UTR\t34607864\t34607984\t.\t-\t.\tParent=AAA1.aAug10\n-7\tAceView\tthree_prime_UTR\t34609324\t34609383\t.\t-\t.\tParent=AAA1.aAug10\n-7\tAceView\texon\t34607864\t34607984\t.\t-\t.\tParent=AAA1.aAug10\n-7\tAceView\texon\t34609324\t34609473\t.\t-\t.\tParent=AAA1.aAug10\n-7\tAceView\texon\t34768349\t34768428\t.\t-\t.\tParent=AAA1.aAug10\n-7\tAceView\texon\t34797686\t34797884\t.\t-\t.\tParent=AAA1.aAug10\n-7\tAceView\tmRNA\t34682839\t34800803\t.\t-\t.\tID=AAA1.dAug10;Parent=AAA1\n-7\tAceView\tfive_prime_UTR\t34800803\t34800803\t.\t-\t.\tParent=AAA1.dAug10\n-7\tAceView\tCDS\t34682958\t34682963\t.\t-\t0\tParent=AAA1.dAug10\n-7\tAceView\tCDS\t34768349\t34768428\t.\t-\t2\tParent=AAA1.dAug10\n-7\tAceView\tCDS\t34800724\t34800802\t.\t-\t0\tParent=AAA1.dAug10\n-7\tAceView\tthree_prime_UTR\t34682839\t34682957\t.\t-\t.\tParent=AAA1.dAug10\n-7\tAceView\texon\t34682839\t34682963\t.\t-\t.\tParent=AAA1.dAug10\n-7\tAceView\texon\t34768349\t34768428\t.\t-\t.\tParent=AAA1.dAug10\n-7\tAceView\texon\t34800724\t34800803\t.\t-\t.\tParent=AAA1.dAug10\n-7\tAceView\ttranscript\t34758479\t34873948\t.\t-\t.\tID=AAA1.gAug10;Parent=AAA1\n-7\tAceView\texon\t34758479\t34759420\t.\t-\t.\tParent=AAA1.gAug10\n-7\tAceView\texon\t34760254\t34760397\t.\t-\t.\tParent=AAA1.gAug10\n-7\tAceView\texon\t34762896\t34763007\t.\t-\t.\tParent=AAA1.gAug10\n-7\tAceView\texon\t34768349\t34768428\t.\t-\t.\tParent=AAA1.gAug10\n-7\tAceView\texon\t34800724\t34800803\t.\t-\t.\tParent=AAA1.gAug10\n-7\tAceView\texon\t34873773\t34873948\t.\t-\t.\tParent=AAA1.gAug10\n-7\tAceView\ttranscript\t34758474\t34873943\t.\t-\t.\tID=AAA1.hAug10;Parent=AAA1\n-7\tAceView\texon\t34758474\t34759420\t.\t-\t.\tParent=AAA1.hAug10\n-7\tAceView\texon\t34762896\t34763007\t.\t-\t.\tParent=AAA1.hAug10\n-7\tAceView\texon\t34768349\t34768428\t.\t-\t.\tParent=AAA1.hAug10\n-7\tAceView\texon\t34807954\t34808052\t.\t-\t.\tParent=AAA1.hAug10\n-7\tAceView\texon\t34873773\t34873943\t.\t-\t.\tParent=AAA1.hAug10\n-7\tAceView\ttranscript\t34390034\t34800803\t.\t-\t.\tID=AAA1.iAug10;Parent=AAA1\n-7\tAceView\texon\t34390034\t34390459\t.\t-\t.\tParent=AAA1.iAug10\n-7\tAceView\texon\t34457191\t34457284\t.\t-\t.\tParent=AAA1.iAug10\n-7\tAceView\texon\t34609324\t34609473\t.\t-\t.\tParent=AAA1.iAug10\n-7\tAceView\texon\t34768349\t34768428\t.\t-\t.\tParent=AAA1.iAug10\n-7\tAceView\texon\t34800724\t34800803\t.\t-\t.\tParent=AAA1.iAug10\n-7\tAceView\tmRNA\t34386126\t34797884\t.\t-\t.\tID=AAA1.bAug10;Parent=AAA1\n-7\tAceView\tfive_prime_UTR\t34797711\t34797884\t.\t-\t.\tParent=AAA1.bAug10\n-7\tAceView\tCDS\t34457198\t34457284\t.\t-\t0\tParent=AAA1.bAug10\n-7\tAceView\tCDS\t34768349\t34768428\t.\t-\t2\tParent=AAA1.bAug10\n-7\tAceView\tCDS\t34797686\t34797710\t.\t-\t0\tParent=AAA1.bAug10\n-7\tAceView\tthree_prime_UTR\t34386126\t34390459\t.\t-\t.\tParent=AAA1.bAug10\n-7\tAceView\tthree_prime_UTR\t34457191\t34457197\t.\t-\t.\tParent=AAA1.bAug10\n-7\tAceView\texon\t34386126\t34390459\t.\t-\t.\tParent=AAA1.bAug10\n-7\tAceView\texon\t34457191\t34457284\t.\t-\t.\tParent=AAA1.bAug10\n-7\tAceView\texon\t34768349\t34768428\t.\t-\t.\tParent=AAA1.bAug10\n-7\tAceView\texon\t34797686\t34797884\t.\t-\t.\tParent=AAA1.bAug10\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/AceView_ncbi_37.gtf
--- a/fml_gff_converter_programs/t/AceView_ncbi_37.gtf Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,3989 +0,0 @@\n-11\tAceView\texon\t111933358\t111934981\t.\t-\t0\tgene_id 2-oxoacid_dh; Gene_type cDNA_supported; transcript_id 2-oxoacid_dh.aAug10-unspliced; exon_number 1\n-19\tAceView\tCDS\t58859154\t58859210\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.aAug10; product_id A1BGAS.aAug10; exon_number 1\n-19\tAceView\texon\t58859153\t58859210\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.aAug10; exon_number 1\n-19\tAceView\tintron\t58859211\t58864686\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.aAug10; type gt_ag\n-19\tAceView\tCDS\t58864687\t58864840\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.aAug10; product_id A1BGAS.aAug10; exon_number 2\n-19\tAceView\texon\t58864687\t58864840\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.aAug10; exon_number 2\n-19\tAceView\tintron\t58864841\t58865079\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.aAug10; type gt_ag\n-19\tAceView\tCDS\t58865080\t58865114\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.aAug10; product_id A1BGAS.aAug10; exon_number 3\n-19\tAceView\texon\t58865080\t58865223\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.aAug10; exon_number 3\n-19\tAceView\tstop_codon\t58865115\t58865117\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.aAug10; product_id A1BGAS.aAug10;\n-19\tAceView\tintron\t58865224\t58865734\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.aAug10; type gt_ag\n-19\tAceView\texon\t58865735\t58866090\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.aAug10; exon_number 4\n-19\tAceView\tstart_codon\t58864404\t58864406\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.bAug10; product_id A1BGAS.bAug10;\n-19\tAceView\tCDS\t58864404\t58864410\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.bAug10; product_id A1BGAS.bAug10; exon_number 1\n-19\tAceView\texon\t58862110\t58864410\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.bAug10; exon_number 1\n-19\tAceView\tintron\t58864411\t58864744\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.bAug10; type gt_ag\n-19\tAceView\tCDS\t58864745\t58864840\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.bAug10; product_id A1BGAS.bAug10; exon_number 2\n-19\tAceView\texon\t58864745\t58864840\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.bAug10; exon_number 2\n-19\tAceView\tintron\t58864841\t58865079\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.bAug10; type gt_ag\n-19\tAceView\tCDS\t58865080\t58865114\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.bAug10; product_id A1BGAS.bAug10; exon_number 3\n-19\tAceView\texon\t58865080\t58865223\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.bAug10; exon_number 3\n-19\tAceView\tstop_codon\t58865115\t58865117\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.bAug10; product_id A1BGAS.bAug10;\n-19\tAceView\tintron\t58865224\t58865734\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.bAug10; type gt_ag\n-19\tAceView\texon\t58865735\t58866548\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.bAug10; exon_number 4\n-19\tAceView\texon\t58859122\t58859210\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.cAug10; exon_number 1\n-19\tAceView\tintron\t58859211\t58864686\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.cAug10; type gt_ag\n-19\tAceView\texon\t58864687\t58864840\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.cAug10; exon_number 2\n-19\tAceView\tintron\t58864841\t58865079\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.cAug10; type gt_ag\n-19\tAceView\tstart_codon\t58865831\t58865833\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; transcript_id A1BGAS.cAug10; product_id A1BGAS.cAug10;\n-19\tAceView\tCDS\t58865831\t58866547\t.\t+\t0\tgene_id A1BGAS; Gene_type cDNA_supported; tran'..b'codon\t219129739\t219129741\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.gAug10; product_id AAMP.gAug10;\n-2\tAceView\tintron\t219129332\t219129738\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.gAug10; type gt_ag\n-2\tAceView\texon\t219128853\t219129331\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.gAug10; exon_number 11\n-2\tAceView\tstart_codon\t219134807\t219134809\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; product_id AAMP.hAug10;\n-2\tAceView\tCDS\t219134689\t219134809\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; product_id AAMP.hAug10; exon_number 1\n-2\tAceView\texon\t219134689\t219134843\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; exon_number 1\n-2\tAceView\tintron\t219134258\t219134688\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; type gt_ag\n-2\tAceView\tCDS\t219134105\t219134257\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; product_id AAMP.hAug10; exon_number 2\n-2\tAceView\texon\t219134105\t219134257\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; exon_number 2\n-2\tAceView\tintron\t219132337\t219134104\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; type gt_ag\n-2\tAceView\tCDS\t219132217\t219132336\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; product_id AAMP.hAug10; exon_number 3\n-2\tAceView\texon\t219132217\t219132336\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; exon_number 3\n-2\tAceView\tintron\t219131710\t219132216\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; type gt_ag\n-2\tAceView\tCDS\t219131570\t219131709\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; product_id AAMP.hAug10; exon_number 4\n-2\tAceView\texon\t219131570\t219131709\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; exon_number 4\n-2\tAceView\tintron\t219131311\t219131569\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; type gt_ag\n-2\tAceView\tCDS\t219131166\t219131310\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; product_id AAMP.hAug10; exon_number 5\n-2\tAceView\texon\t219131166\t219131310\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; exon_number 5\n-2\tAceView\tintron\t219130871\t219131165\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; type gt_ag\n-2\tAceView\tCDS\t219130787\t219130870\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; product_id AAMP.hAug10; exon_number 6\n-2\tAceView\texon\t219130787\t219130870\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; exon_number 6\n-2\tAceView\tintron\t219130670\t219130786\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; type gt_ag\n-2\tAceView\tCDS\t219130392\t219130669\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; product_id AAMP.hAug10; exon_number 7\n-2\tAceView\texon\t219130302\t219130669\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; exon_number 7\n-2\tAceView\tstop_codon\t219130389\t219130391\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; product_id AAMP.hAug10;\n-2\tAceView\tintron\t219130185\t219130301\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; type gt_ag\n-2\tAceView\texon\t219130094\t219130184\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; exon_number 8\n-2\tAceView\tintron\t219129898\t219130093\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; type gt_ag\n-2\tAceView\texon\t219129743\t219129897\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; exon_number 9\n-2\tAceView\tintron\t219129332\t219129742\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; type gt_ag\n-2\tAceView\texon\t219128853\t219129331\t.\t-\t0\tgene_id AAMP; Gene_type cDNA_supported; transcript_id AAMP.hAug10; exon_number 10\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/Aly_JGI.bed
--- a/fml_gff_converter_programs/t/Aly_JGI.bed Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,252 +0,0 @@\n-scaffold_1\t10652\t11944\tTranscript:909750\t.\t-\t10652\t11944\t0\t5\t60,225,159,360,132,\t0,130,392,608,1161,\n-scaffold_1\t464358\t467635\tTranscript:470161\t.\t+\t464358\t467635\t0\t18\t202,63,72,108,81,54,126,90,126,108,117,57,81,5,174,10,20,31,\t0,599,744,929,1130,1329,1606,1827,2037,2253,2453,2649,2797,2915,2978,3174,3209,3247,\n-scaffold_1\t548765\t550572\tTranscript:918872\t.\t-\t548765\t550572\t0\t4\t170,274,174,28,\t0,593,965,1780,\n-scaffold_1\t80941\t82330\tTranscript:470071\t.\t+\t80941\t82330\t0\t2\t277,1014,\t0,376,\n-scaffold_1\t483425\t484126\tTranscript:311324\t.\t+\t483425\t484126\t0\t3\t72,216,54,\t0,343,648,\n-scaffold_1\t124594\t126229\tTranscript:470077\t.\t-\t124594\t126229\t0\t2\t792,700,\t0,936,\n-scaffold_1\t122468\t124310\tTranscript:470076\t.\t+\t122468\t124310\t0\t4\t1626,84,4,79,\t0,1653,1749,1764,\n-scaffold_1\t118008\t119066\tTranscript:470075\t.\t+\t118008\t119066\t0\t6\t49,732,57,44,56,29,\t0,66,819,899,960,1030,\n-scaffold_1\t90714\t113497\tTranscript:470074\t.\t-\t90714\t113497\t0\t22\t358,90,91,197,78,117,96,126,97,71,99,84,171,129,33,97,51,79,82,56,90,75,\t0,18818,19001,19212,19496,19665,19860,20042,20257,20536,20684,20856,21022,21382,21651,21776,21947,22094,22283,22441,22603,22709,\n-scaffold_1\t621551\t622441\tTranscript:909905\t.\t-\t621551\t622441\t0\t5\t85,140,120,77,151,\t0,175,386,578,740,\n-scaffold_1\t134679\t139817\tTranscript:470079\t.\t-\t134679\t139817\t0\t20\t19,354,129,144,126,126,93,133,132,135,108,204,89,150,154,222,186,123,101,172,\t0,42,474,701,922,1186,1410,1591,2068,2300,2516,2754,3045,3252,3555,3824,4128,4399,4621,4967,\n-scaffold_1\t127652\t134697\tTranscript:470078\t.\t+\t127652\t134697\t0\t11\t1166,318,355,423,293,274,500,322,358,650,825,\t0,1371,1874,2318,2863,3443,4043,4627,5028,5477,6221,\n-scaffold_1\t798586\t799140\tTranscript:470261\t.\t-\t798586\t799140\t0\t1\t555,\t0,\n-scaffold_1\t684011\t686672\tTranscript:909919\t.\t-\t684011\t686672\t0\t7\t81,96,78,139,68,72,129,\t0,250,454,1428,1680,2360,2533,\n-scaffold_1\t156948\t159348\tTranscript:311256\t.\t+\t156948\t159348\t0\t8\t195,177,171,182,212,120,479,237,\t0,384,637,888,1149,1393,1594,2164,\n-scaffold_1\t560362\t562206\tTranscript:470187\t.\t-\t560362\t562206\t0\t5\t42,10,290,190,1058,\t0,68,97,508,787,\n-scaffold_1\t860953\t874764\tTranscript:311407\t.\t-\t860953\t874764\t0\t19\t1236,147,307,309,3006,1651,1115,103,700,1166,115,66,192,174,57,136,86,73,182,\t0,1347,1588,2052,2707,5983,7729,8952,9162,10011,11333,11619,11827,12166,12484,12621,12933,13180,13630,\n-scaffold_1\t771817\t774951\tTranscript:909946\t.\t+\t771817\t774951\t0\t11\t108,198,76,72,88,256,116,221,19,72,256,\t0,231,535,682,855,1025,1350,1548,2129,2732,2879,\n-scaffold_1\t479138\t481385\tTranscript:333544\t.\t-\t479138\t481385\t0\t5\t792,141,246,297,108,\t0,874,1318,1748,2140,\n-scaffold_1\t765430\t766468\tTranscript:918940\t.\t-\t765430\t766468\t0\t2\t529,96,\t0,943,\n-scaffold_1\t766738\t768326\tTranscript:918941\t.\t-\t766738\t768326\t0\t3\t211,573,73,\t0,578,1516,\n-scaffold_1\t849099\t851591\tTranscript:470280\t.\t+\t849099\t851591\t0\t15\t76,3,107,44,78,193,31,59,74,87,62,29,136,100,30,\t0,111,179,357,767,954,1237,1349,1498,1683,1859,2016,2146,2311,2463,\n-scaffold_1\t57579\t57871\tTranscript:918741\t.\t+\t57579\t57871\t0\t1\t293,\t0,\n-scaffold_1\t58865\t72177\tTranscript:918742\t.\t+\t58865\t72177\t0\t49\t113,298,229,340,114,129,192,195,138,123,254,104,95,225,162,73,68,146,109,147,48,129,243,138,86,106,96,153,183,230,103,126,45,195,127,101,135,84,147,132,48,147,62,154,192,105,135,201,280,\t0,209,584,904,1435,1635,1838,2119,2442,2671,2877,3232,3571,3753,4317,4563,4746,4949,5205,5406,5644,5766,6016,6374,6597,6778,7127,7380,7687,7945,8270,8456,8667,8886,9339,9568,9964,10175,10675,10910,11151,11293,11507,11767,12025,12306,12503,12726,13033,\n-scaffold_1\t356318\t357400\tTranscript:470139\t.\t-\t356318\t357400\t0\t2\t727,329,\t0,754,\n-scaffold_1\t786367\t786721\tTranscript:918948\t.\t+\t786367\t786721\t0\t1\t355,\t0,\n-scaffold_1\t787193\t787397\tTranscript:918949\t.\t+\t787193\t787397\t0\t1\t205,\t0,\n-scaffold_1\t511272\t518844\tTranscript:333551\t.\t-\t511272\t518844\t0\t21\t178,407,165,906,191,98,180,629,96,162,183,151,234,161,123,173,220,395,211,114,912,\t0,277,781,1027,2017,2290,2433,2661,3383,3564,3812,4083,4319,4644,4887,5097,5359,5667,'..b'66,3239,3467,3671,3861,4147,4363,4706,4918,5333,\n-scaffold_1\t3311\t6198\tTranscript:470048\t.\t-\t3311\t6198\t0\t9\t180,196,273,201,108,7,378,220,568,\t0,202,499,853,1172,1419,1540,2008,2320,\n-scaffold_1\t9512\t10567\tTranscript:470049\t.\t+\t9512\t10567\t0\t3\t53,82,695,\t0,64,361,\n-scaffold_1\t605536\t607891\tTranscript:470198\t.\t+\t605536\t607891\t0\t2\t765,1020,\t0,1336,\n-scaffold_1\t652374\t653539\tTranscript:470212\t.\t-\t652374\t653539\t0\t2\t302,779,\t0,387,\n-scaffold_1\t650407\t652252\tTranscript:470210\t.\t+\t650407\t652252\t0\t4\t44,59,162,1193,\t0,71,447,653,\n-scaffold_1\t436171\t436800\tTranscript:311313\t.\t-\t436171\t436800\t0\t3\t125,91,105,\t0,320,525,\n-scaffold_1\t428322\t429820\tTranscript:311310\t.\t-\t428322\t429820\t0\t7\t264,33,90,44,64,152,100,\t0,304,434,682,800,1084,1399,\n-scaffold_1\t152834\t155670\tTranscript:909785\t.\t-\t152834\t155670\t0\t2\t1675,650,\t0,2187,\n-scaffold_1\t266834\t270418\tTranscript:470119\t.\t-\t266834\t270418\t0\t8\t237,716,244,328,235,195,103,217,\t0,337,1181,1516,2329,2822,3099,3368,\n-scaffold_1\t759661\t760663\tTranscript:470240\t.\t+\t759661\t760663\t0\t6\t141,48,531,103,15,69,\t0,157,223,768,896,934,\n-scaffold_1\t441904\t443720\tTranscript:311315\t.\t+\t441904\t443720\t0\t2\t993,627,\t0,1190,\n-scaffold_1\t246920\t248384\tTranscript:470115\t.\t+\t246920\t248384\t0\t9\t62,353,296,141,98,45,17,18,35,\t0,93,533,905,1131,1248,1320,1372,1430,\n-scaffold_1\t501318\t503489\tTranscript:918864\t.\t+\t501318\t503489\t0\t6\t224,63,117,400,152,171,\t0,353,605,1211,1738,2001,\n-scaffold_1\t236778\t241721\tTranscript:470110\t.\t+\t236778\t241721\t0\t16\t103,180,159,112,120,152,183,35,163,54,93,143,71,84,294,60,\t0,114,768,1022,1245,1446,1715,1992,2454,2779,3173,3418,4243,4395,4571,4884,\n-scaffold_1\t243695\t245459\tTranscript:470113\t.\t-\t243695\t245459\t0\t6\t44,166,111,106,296,330,\t0,68,317,838,1049,1435,\n-scaffold_1\t330470\t334264\tTranscript:909830\t.\t+\t330470\t334264\t0\t3\t785,1961,167,\t0,1600,3628,\n-scaffold_1\t313347\t315496\tTranscript:918815\t.\t-\t313347\t315496\t0\t4\t1048,161,132,191,\t0,1315,1707,1959,\n-scaffold_1\t46396\t48761\tTranscript:470065\t.\t+\t46396\t48761\t0\t9\t164,76,89,90,45,70,140,89,172,\t0,297,454,641,850,991,1267,2009,2194,\n-scaffold_1\t365041\t365523\tTranscript:909838\t.\t-\t365041\t365523\t0\t3\t97,88,67,\t0,207,416,\n-scaffold_1\t367696\t369417\tTranscript:909839\t.\t-\t367696\t369417\t0\t8\t56,76,78,105,35,46,98,52,\t0,195,411,905,1212,1368,1514,1670,\n-scaffold_1\t779234\t780531\tTranscript:470252\t.\t-\t779234\t780531\t0\t5\t39,196,122,348,72,\t0,68,345,860,1226,\n-scaffold_1\t768374\t770507\tTranscript:311385\t.\t+\t768374\t770507\t0\t10\t87,66,75,145,47,58,50,57,248,61,\t0,187,401,562,791,924,1055,1197,1336,2073,\n-scaffold_1\t328181\t328380\tTranscript:918819\t.\t+\t328181\t328380\t0\t1\t200,\t0,\n-scaffold_1\t816662\t816932\tTranscript:918958\t.\t+\t816662\t816932\t0\t1\t271,\t0,\n-scaffold_1\t928527\t930332\tTranscript:470297\t.\t+\t928527\t930332\t0\t8\t70,161,58,309,49,95,80,115,\t0,91,692,829,1236,1365,1538,1691,\n-scaffold_1\t113969\t115315\tTranscript:918756\t.\t-\t113969\t115315\t0\t1\t1347,\t0,\n-scaffold_1\t915290\t920350\tTranscript:470295\t.\t+\t915290\t920350\t0\t11\t515,539,69,198,132,87,441,168,231,116,401,\t0,612,1269,1908,2192,2873,3050,3591,3934,4388,4660,\n-scaffold_1\t731564\t731809\tTranscript:918928\t.\t-\t731564\t731809\t0\t1\t246,\t0,\n-scaffold_1\t835648\t840282\tTranscript:470277\t.\t-\t835648\t840282\t0\t15\t32,47,96,81,145,272,123,159,282,189,184,142,244,636,320,\t0,68,133,312,500,728,1086,1294,1572,2023,2474,2846,3211,3532,4315,\n-scaffold_1\t628623\t630888\tTranscript:918899\t.\t-\t628623\t630888\t0\t6\t531,85,208,48,168,476,\t0,616,930,1217,1528,1790,\n-scaffold_1\t800563\t803042\tTranscript:918952\t.\t-\t800563\t803042\t0\t9\t95,97,101,189,158,232,136,104,291,\t0,235,433,623,894,1134,1449,1701,2189,\n-scaffold_1\t617112\t618613\tTranscript:918895\t.\t+\t617112\t618613\t0\t7\t100,316,53,73,105,81,176,\t0,218,634,797,960,1155,1326,\n-scaffold_1\t601736\t602094\tTranscript:918890\t.\t-\t601736\t602094\t0\t1\t359,\t0,\n-scaffold_1\t602514\t603658\tTranscript:918891\t.\t+\t602514\t603658\t0\t3\t510,145,81,\t0,792,1064,\n-scaffold_1\t930976\t935010\tTranscript:470299\t.\t+\t930976\t935010\t0\t13\t548,281,83,404,194,181,125,90,79,77,107,157,111,\t0,688,1271,1460,1997,2278,2610,2903,3099,3272,3549,3754,3924,\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/Aly_JGI.gff3
--- a/fml_gff_converter_programs/t/Aly_JGI.gff3 Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,3996 +0,0 @@\n-##gff-version 3\n-##sequence-region scaffold_1 1 33132539\n-scaffold_1\tGenomic_canonical\tregion\t1\t33132539\t.\t+\t.\tID=scaffold_1;Name=scaffold_1\n-scaffold_1\tJGI_Filtered\tgene\t47\t2523\t.\t-\t.\tID=scaffold_100001.1;Name=scaffold_100001.1\n-scaffold_1\tJGI_Filtered\tmRNA\t47\t2523\t.\t-\t.\tID=Transcript:918720;Name=Transcript:918720;Parent=scaffold_100001.1\n-scaffold_1\tJGI_Filtered\tthree_prime_UTR\t47\t66\t.\t-\t.\tID=three_prime_UTR:918720;Parent=Transcript:918720\n-scaffold_1\tJGI_Filtered\texon\t47\t252\t.\t-\t.\tID=exon:918720;Parent=Transcript:918720\n-scaffold_1\tJGI_Filtered\tCDS\t67\t252\t.\t-\t0\tID=CDS:918720;Parent=Transcript:918720;Name=scaffold_100001.1\n-scaffold_1\tJGI_Filtered\texon\t407\t782\t.\t-\t.\tID=exon:918720;Parent=Transcript:918720\n-scaffold_1\tJGI_Filtered\tCDS\t407\t782\t.\t-\t0\tID=CDS:918720;Parent=Transcript:918720;Name=scaffold_100001.1\n-scaffold_1\tJGI_Filtered\texon\t1423\t1642\t.\t-\t.\tID=exon:918720;Parent=Transcript:918720\n-scaffold_1\tJGI_Filtered\tCDS\t1423\t1642\t.\t-\t1\tID=CDS:918720;Parent=Transcript:918720;Name=scaffold_100001.1\n-scaffold_1\tJGI_Filtered\texon\t1803\t2035\t.\t-\t.\tID=exon:918720;Parent=Transcript:918720\n-scaffold_1\tJGI_Filtered\tCDS\t1803\t2035\t.\t-\t2\tID=CDS:918720;Parent=Transcript:918720;Name=scaffold_100001.1\n-scaffold_1\tJGI_Filtered\texon\t2124\t2347\t.\t-\t.\tID=exon:918720;Parent=Transcript:918720\n-scaffold_1\tJGI_Filtered\tCDS\t2124\t2347\t.\t-\t1\tID=CDS:918720;Parent=Transcript:918720;Name=scaffold_100001.1\n-scaffold_1\tJGI_Filtered\texon\t2444\t2523\t.\t-\t.\tID=exon:918720;Parent=Transcript:918720\n-scaffold_1\tJGI_Filtered\tCDS\t2444\t2503\t.\t-\t0\tID=CDS:918720;Parent=Transcript:918720;Name=scaffold_100001.1\n-scaffold_1\tJGI_Filtered\tfive_prime_UTR\t2504\t2523\t.\t-\t.\tID=five_prime_UTR:918720;Parent=Transcript:918720\n-scaffold_1\tJGI_Filtered\tgene\t3311\t6198\t.\t-\t.\tID=fgenesh2_kg.1__2__AT1G02190.2;Name=fgenesh2_kg.1__2__AT1G02190.2\n-scaffold_1\tJGI_Filtered\tmRNA\t3311\t6198\t.\t-\t.\tID=Transcript:470048;Name=Transcript:470048;Parent=fgenesh2_kg.1__2__AT1G02190.2\n-scaffold_1\tJGI_Filtered\tthree_prime_UTR\t3311\t3490\t.\t-\t.\tID=three_prime_UTR:470048;Parent=Transcript:470048\n-scaffold_1\tJGI_Filtered\tthree_prime_UTR\t3513\t3528\t.\t-\t.\tID=three_prime_UTR:470048;Parent=Transcript:470048\n-scaffold_1\tJGI_Filtered\texon\t3311\t3490\t.\t-\t.\tID=exon:470048;Parent=Transcript:470048\n-scaffold_1\tJGI_Filtered\texon\t3513\t3708\t.\t-\t.\tID=exon:470048;Parent=Transcript:470048\n-scaffold_1\tJGI_Filtered\tCDS\t3529\t3708\t.\t-\t0\tID=CDS:470048;Parent=Transcript:470048;Name=fgenesh2_kg.1__2__AT1G02190.2\n-scaffold_1\tJGI_Filtered\texon\t3810\t4082\t.\t-\t.\tID=exon:470048;Parent=Transcript:470048\n-scaffold_1\tJGI_Filtered\tCDS\t3810\t4082\t.\t-\t0\tID=CDS:470048;Parent=Transcript:470048;Name=fgenesh2_kg.1__2__AT1G02190.2\n-scaffold_1\tJGI_Filtered\texon\t4164\t4364\t.\t-\t.\tID=exon:470048;Parent=Transcript:470048\n-scaffold_1\tJGI_Filtered\tCDS\t4164\t4364\t.\t-\t0\tID=CDS:470048;Parent=Transcript:470048;Name=fgenesh2_kg.1__2__AT1G02190.2\n-scaffold_1\tJGI_Filtered\texon\t4483\t4590\t.\t-\t.\tID=exon:470048;Parent=Transcript:470048\n-scaffold_1\tJGI_Filtered\tCDS\t4483\t4590\t.\t-\t0\tID=CDS:470048;Parent=Transcript:470048;Name=fgenesh2_kg.1__2__AT1G02190.2\n-scaffold_1\tJGI_Filtered\texon\t4730\t4736\t.\t-\t.\tID=exon:470048;Parent=Transcript:470048\n-scaffold_1\tJGI_Filtered\tCDS\t4730\t4736\t.\t-\t0\tID=CDS:470048;Parent=Transcript:470048;Name=fgenesh2_kg.1__2__AT1G02190.2\n-scaffold_1\tJGI_Filtered\texon\t4851\t5228\t.\t-\t.\tID=exon:470048;Parent=Transcript:470048\n-scaffold_1\tJGI_Filtered\tCDS\t4851\t5228\t.\t-\t1\tID=CDS:470048;Parent=Transcript:470048;Name=fgenesh2_kg.1__2__AT1G02190.2\n-scaffold_1\tJGI_Filtered\texon\t5319\t5538\t.\t-\t.\tID=exon:470048;Parent=Transcript:470048\n-scaffold_1\tJGI_Filtered\tCDS\t5319\t5538\t.\t-\t1\tID=CDS:470048;Parent=Transcript:470048;Name=fgenesh2_kg.1__2__AT1G02190.2\n-scaffold_1\tJGI_Filtered\texon\t5631\t6198\t.\t-\t.\tID=exon:470048;Parent=Transcript:470048\n-scaffold_1\tJGI_Filtered\tCDS\t5631\t6123\t.\t-\t2\tID=CDS:470048;Parent=Transcript:470048;Name=fgenesh2_kg.1__2__AT1G02190.2\n-scaffold_1\tJGI_Filtered\tfive_prime_UTR\t6124\t6198\t.\t-\t.\tID=five_prime_UTR:470048;Parent=Transcri'..b'me=fgenesh2_kg.1__253__AT1G03190.1\n-scaffold_1\tJGI_Filtered\texon\t933586\t933710\t.\t+\t.\tID=exon:470299;Parent=Transcript:470299\n-scaffold_1\tJGI_Filtered\tCDS\t933586\t933710\t.\t+\t0\tID=CDS:470299;Parent=Transcript:470299;Name=fgenesh2_kg.1__253__AT1G03190.1\n-scaffold_1\tJGI_Filtered\texon\t933879\t933968\t.\t+\t.\tID=exon:470299;Parent=Transcript:470299\n-scaffold_1\tJGI_Filtered\tCDS\t933879\t933968\t.\t+\t2\tID=CDS:470299;Parent=Transcript:470299;Name=fgenesh2_kg.1__253__AT1G03190.1\n-scaffold_1\tJGI_Filtered\texon\t934075\t934153\t.\t+\t.\tID=exon:470299;Parent=Transcript:470299\n-scaffold_1\tJGI_Filtered\tCDS\t934075\t934153\t.\t+\t2\tID=CDS:470299;Parent=Transcript:470299;Name=fgenesh2_kg.1__253__AT1G03190.1\n-scaffold_1\tJGI_Filtered\texon\t934248\t934324\t.\t+\t.\tID=exon:470299;Parent=Transcript:470299\n-scaffold_1\tJGI_Filtered\tCDS\t934248\t934324\t.\t+\t0\tID=CDS:470299;Parent=Transcript:470299;Name=fgenesh2_kg.1__253__AT1G03190.1\n-scaffold_1\tJGI_Filtered\texon\t934525\t934631\t.\t+\t.\tID=exon:470299;Parent=Transcript:470299\n-scaffold_1\tJGI_Filtered\tCDS\t934525\t934631\t.\t+\t2\tID=CDS:470299;Parent=Transcript:470299;Name=fgenesh2_kg.1__253__AT1G03190.1\n-scaffold_1\tJGI_Filtered\texon\t934730\t934886\t.\t+\t.\tID=exon:470299;Parent=Transcript:470299\n-scaffold_1\tJGI_Filtered\tCDS\t934730\t934851\t.\t+\t1\tID=CDS:470299;Parent=Transcript:470299;Name=fgenesh2_kg.1__253__AT1G03190.1\n-scaffold_1\tJGI_Filtered\texon\t934900\t935010\t.\t+\t.\tID=exon:470299;Parent=Transcript:470299\n-scaffold_1\tJGI_Filtered\tthree_prime_UTR\t934852\t934886\t.\t+\t.\tID=three_prime_UTR:470299;Parent=Transcript:470299\n-scaffold_1\tJGI_Filtered\tthree_prime_UTR\t934900\t935010\t.\t+\t.\tID=three_prime_UTR:470299;Parent=Transcript:470299\n-scaffold_1\tJGI_Filtered\tgene\t938460\t939704\t.\t+\t.\tID=fgenesh1_pm.C_scaffold_1000202;Name=fgenesh1_pm.C_scaffold_1000202\n-scaffold_1\tJGI_Filtered\tmRNA\t938460\t939704\t.\t+\t.\tID=Transcript:311422;Name=Transcript:311422;Parent=fgenesh1_pm.C_scaffold_1000202\n-scaffold_1\tJGI_Filtered\texon\t938460\t938693\t.\t+\t.\tID=exon:311422;Parent=Transcript:311422\n-scaffold_1\tJGI_Filtered\tCDS\t938460\t938693\t.\t+\t0\tID=CDS:311422;Parent=Transcript:311422;Name=fgenesh1_pm.C_scaffold_1000202\n-scaffold_1\tJGI_Filtered\texon\t938841\t939104\t.\t+\t.\tID=exon:311422;Parent=Transcript:311422\n-scaffold_1\tJGI_Filtered\tCDS\t938841\t939104\t.\t+\t0\tID=CDS:311422;Parent=Transcript:311422;Name=fgenesh1_pm.C_scaffold_1000202\n-scaffold_1\tJGI_Filtered\texon\t939200\t939361\t.\t+\t.\tID=exon:311422;Parent=Transcript:311422\n-scaffold_1\tJGI_Filtered\tCDS\t939200\t939361\t.\t+\t0\tID=CDS:311422;Parent=Transcript:311422;Name=fgenesh1_pm.C_scaffold_1000202\n-scaffold_1\tJGI_Filtered\texon\t939394\t939477\t.\t+\t.\tID=exon:311422;Parent=Transcript:311422\n-scaffold_1\tJGI_Filtered\tCDS\t939394\t939477\t.\t+\t0\tID=CDS:311422;Parent=Transcript:311422;Name=fgenesh1_pm.C_scaffold_1000202\n-scaffold_1\tJGI_Filtered\texon\t939597\t939704\t.\t+\t.\tID=exon:311422;Parent=Transcript:311422\n-scaffold_1\tJGI_Filtered\tCDS\t939597\t939704\t.\t+\t0\tID=CDS:311422;Parent=Transcript:311422;Name=fgenesh1_pm.C_scaffold_1000202\n-scaffold_1\tJGI_Filtered\tgene\t940106\t941321\t.\t-\t.\tID=scaffold_100268.1;Name=scaffold_100268.1\n-scaffold_1\tJGI_Filtered\tmRNA\t940106\t941321\t.\t-\t.\tID=Transcript:918987;Name=Transcript:918987;Parent=scaffold_100268.1\n-scaffold_1\tJGI_Filtered\tthree_prime_UTR\t940106\t940125\t.\t-\t.\tID=three_prime_UTR:918987;Parent=Transcript:918987\n-scaffold_1\tJGI_Filtered\texon\t940106\t940528\t.\t-\t.\tID=exon:918987;Parent=Transcript:918987\n-scaffold_1\tJGI_Filtered\tCDS\t940126\t940528\t.\t-\t0\tID=CDS:918987;Parent=Transcript:918987;Name=scaffold_100268.1\n-scaffold_1\tJGI_Filtered\texon\t940628\t940686\t.\t-\t.\tID=exon:918987;Parent=Transcript:918987\n-scaffold_1\tJGI_Filtered\tCDS\t940628\t940686\t.\t-\t1\tID=CDS:918987;Parent=Transcript:918987;Name=scaffold_100268.1\n-scaffold_1\tJGI_Filtered\texon\t941262\t941321\t.\t-\t.\tID=exon:918987;Parent=Transcript:918987\n-scaffold_1\tJGI_Filtered\tCDS\t941262\t941300\t.\t-\t0\tID=CDS:918987;Parent=Transcript:918987;Name=scaffold_100268.1\n-scaffold_1\tJGI_Filtered\tfive_prime_UTR\t941301\t941321\t.\t-\t.\tID=five_prime_UTR:918987;Parent=Transcript:918987\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/ENSEMBL_mm9.gff3
--- a/fml_gff_converter_programs/t/ENSEMBL_mm9.gff3 Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,4424 +0,0 @@\n-##gff-version 3\n-18\tlincRNA\tgene\t3336414\t3366861\t.\t+\t.\tID=ENSMUSG00000091488;Name=AC124336.2\n-18\tlincRNA\ttranscript\t3336414\t3366861\t.\t+\t.\tID=ENSMUST00000171726;Parent=ENSMUSG00000091488;Name=AC124336.2-201\n-18\tlincRNA\texon\t3336414\t3337176\t.\t+\t.\tParent=ENSMUST00000171726\n-18\tlincRNA\texon\t3365925\t3366861\t.\t+\t.\tParent=ENSMUST00000171726\n-18\tprotein_coding\tgene\t9314042\t9450148\t.\t-\t.\tID=ENSMUSG00000024286;Name=Ccny\n-18\tprotein_coding\tmRNA\t9314042\t9450148\t.\t-\t.\tID=ENSMUST00000053917;Parent=ENSMUSG00000024286;Name=Ccny-201\n-18\tprotein_coding\tfive_prime_UTR\t9449670\t9450148\t.\t-\t.\tParent=ENSMUST00000053917\n-18\tprotein_coding\tCDS\t9316554\t9316670\t.\t-\t0\tParent=ENSMUST00000053917\n-18\tprotein_coding\tCDS\t9319407\t9319569\t.\t-\t1\tParent=ENSMUST00000053917\n-18\tprotein_coding\tCDS\t9332782\t9332948\t.\t-\t0\tParent=ENSMUST00000053917\n-18\tprotein_coding\tCDS\t9345192\t9345311\t.\t-\t0\tParent=ENSMUST00000053917\n-18\tprotein_coding\tCDS\t9345412\t9345469\t.\t-\t1\tParent=ENSMUST00000053917\n-18\tprotein_coding\tCDS\t9349386\t9349421\t.\t-\t1\tParent=ENSMUST00000053917\n-18\tprotein_coding\tCDS\t9353405\t9353505\t.\t-\t0\tParent=ENSMUST00000053917\n-18\tprotein_coding\tCDS\t9377792\t9377826\t.\t-\t2\tParent=ENSMUST00000053917\n-18\tprotein_coding\tCDS\t9386733\t9386807\t.\t-\t2\tParent=ENSMUST00000053917\n-18\tprotein_coding\tCDS\t9449516\t9449669\t.\t-\t0\tParent=ENSMUST00000053917\n-18\tprotein_coding\tthree_prime_UTR\t9314042\t9316553\t.\t-\t.\tParent=ENSMUST00000053917\n-18\tprotein_coding\texon\t9314042\t9316670\t.\t-\t.\tParent=ENSMUST00000053917\n-18\tprotein_coding\texon\t9319407\t9319569\t.\t-\t.\tParent=ENSMUST00000053917\n-18\tprotein_coding\texon\t9332782\t9332948\t.\t-\t.\tParent=ENSMUST00000053917\n-18\tprotein_coding\texon\t9345192\t9345311\t.\t-\t.\tParent=ENSMUST00000053917\n-18\tprotein_coding\texon\t9345412\t9345469\t.\t-\t.\tParent=ENSMUST00000053917\n-18\tprotein_coding\texon\t9349386\t9349421\t.\t-\t.\tParent=ENSMUST00000053917\n-18\tprotein_coding\texon\t9353405\t9353505\t.\t-\t.\tParent=ENSMUST00000053917\n-18\tprotein_coding\texon\t9377792\t9377826\t.\t-\t.\tParent=ENSMUST00000053917\n-18\tprotein_coding\texon\t9386733\t9386807\t.\t-\t.\tParent=ENSMUST00000053917\n-18\tprotein_coding\texon\t9449516\t9450148\t.\t-\t.\tParent=ENSMUST00000053917\n-18\tprotein_coding\tmRNA\t9314042\t9450148\t.\t-\t.\tID=ENSMUST00000115867;Parent=ENSMUSG00000024286;Name=Ccny-202\n-18\tprotein_coding\tfive_prime_UTR\t9449670\t9450148\t.\t-\t.\tParent=ENSMUST00000115867\n-18\tprotein_coding\tCDS\t9316554\t9316670\t.\t-\t0\tParent=ENSMUST00000115867\n-18\tprotein_coding\tCDS\t9319407\t9319569\t.\t-\t1\tParent=ENSMUST00000115867\n-18\tprotein_coding\tCDS\t9332782\t9332948\t.\t-\t0\tParent=ENSMUST00000115867\n-18\tprotein_coding\tCDS\t9345192\t9345311\t.\t-\t0\tParent=ENSMUST00000115867\n-18\tprotein_coding\tCDS\t9345412\t9345469\t.\t-\t1\tParent=ENSMUST00000115867\n-18\tprotein_coding\tCDS\t9349386\t9349421\t.\t-\t1\tParent=ENSMUST00000115867\n-18\tprotein_coding\tCDS\t9353405\t9353505\t.\t-\t0\tParent=ENSMUST00000115867\n-18\tprotein_coding\tCDS\t9377792\t9377826\t.\t-\t2\tParent=ENSMUST00000115867\n-18\tprotein_coding\tCDS\t9449516\t9449669\t.\t-\t0\tParent=ENSMUST00000115867\n-18\tprotein_coding\tthree_prime_UTR\t9314042\t9316553\t.\t-\t.\tParent=ENSMUST00000115867\n-18\tprotein_coding\texon\t9314042\t9316670\t.\t-\t.\tParent=ENSMUST00000115867\n-18\tprotein_coding\texon\t9319407\t9319569\t.\t-\t.\tParent=ENSMUST00000115867\n-18\tprotein_coding\texon\t9332782\t9332948\t.\t-\t.\tParent=ENSMUST00000115867\n-18\tprotein_coding\texon\t9345192\t9345311\t.\t-\t.\tParent=ENSMUST00000115867\n-18\tprotein_coding\texon\t9345412\t9345469\t.\t-\t.\tParent=ENSMUST00000115867\n-18\tprotein_coding\texon\t9349386\t9349421\t.\t-\t.\tParent=ENSMUST00000115867\n-18\tprotein_coding\texon\t9353405\t9353505\t.\t-\t.\tParent=ENSMUST00000115867\n-18\tprotein_coding\texon\t9377792\t9377826\t.\t-\t.\tParent=ENSMUST00000115867\n-18\tprotein_coding\texon\t9449516\t9450148\t.\t-\t.\tParent=ENSMUST00000115867\n-18\tmiRNA\tgene\t10782897\t10782983\t.\t-\t.\tID=ENSMUSG00000065399;Name=Mir133a-1\n-18\tmiRNA\ttranscript\t10782897\t10782983\t.\t-\t.\tID=ENSMUST00000083465;Parent=ENSMUSG00000065399;Name=Mir133a-1-201\n-18\tmiRNA\texon\t10782897\t10782983\t.\t-\t.\tParent=ENSMUST00000083465\n-18\tprotein_coding\tgene\t9726195\t9726668\t.\t-\t.\tID='..b'694\n-NT_166402\tprotein_coding\texon\t36964\t37064\t.\t+\t.\tParent=ENSMUST00000096694\n-NT_166402\tprotein_coding\texon\t37217\t38054\t.\t+\t.\tParent=ENSMUST00000096694\n-NT_166433\tprotein_coding\tgene\t28587\t52512\t.\t+\t.\tID=ENSMUSG00000078423;Name=AC007307.2\n-NT_166433\tprotein_coding\tmRNA\t28587\t52512\t.\t+\t.\tID=ENSMUST00000105217;Parent=ENSMUSG00000078423;Name=AC007307.2-201\n-NT_166433\tprotein_coding\tfive_prime_UTR\t28587\t28657\t.\t+\t.\tParent=ENSMUST00000105217\n-NT_166433\tprotein_coding\tCDS\t28658\t28798\t.\t+\t0\tParent=ENSMUST00000105217\n-NT_166433\tprotein_coding\tCDS\t31129\t31299\t.\t+\t0\tParent=ENSMUST00000105217\n-NT_166433\tprotein_coding\tCDS\t32196\t32249\t.\t+\t0\tParent=ENSMUST00000105217\n-NT_166433\tprotein_coding\tthree_prime_UTR\t32250\t32270\t.\t+\t.\tParent=ENSMUST00000105217\n-NT_166433\tprotein_coding\tthree_prime_UTR\t51920\t52512\t.\t+\t.\tParent=ENSMUST00000105217\n-NT_166433\tprotein_coding\texon\t28587\t28798\t.\t+\t.\tParent=ENSMUST00000105217\n-NT_166433\tprotein_coding\texon\t31129\t31299\t.\t+\t.\tParent=ENSMUST00000105217\n-NT_166433\tprotein_coding\texon\t32196\t32270\t.\t+\t.\tParent=ENSMUST00000105217\n-NT_166433\tprotein_coding\texon\t51920\t52512\t.\t+\t.\tParent=ENSMUST00000105217\n-NT_166433\tprotein_coding\tgene\t47745\t52514\t.\t+\t.\tID=ENSMUSG00000078424;Name=AC007307.3\n-NT_166433\tprotein_coding\tmRNA\t47745\t52514\t.\t+\t.\tID=ENSMUST00000105218;Parent=ENSMUSG00000078424;Name=AC007307.3-201\n-NT_166433\tprotein_coding\tfive_prime_UTR\t47745\t47746\t.\t+\t.\tParent=ENSMUST00000105218\n-NT_166433\tprotein_coding\tCDS\t47747\t47845\t.\t+\t0\tParent=ENSMUST00000105218\n-NT_166433\tprotein_coding\tCDS\t50322\t50492\t.\t+\t0\tParent=ENSMUST00000105218\n-NT_166433\tprotein_coding\tCDS\t51351\t51404\t.\t+\t0\tParent=ENSMUST00000105218\n-NT_166433\tprotein_coding\tthree_prime_UTR\t51405\t51425\t.\t+\t.\tParent=ENSMUST00000105218\n-NT_166433\tprotein_coding\tthree_prime_UTR\t51920\t52514\t.\t+\t.\tParent=ENSMUST00000105218\n-NT_166433\tprotein_coding\texon\t47745\t47845\t.\t+\t.\tParent=ENSMUST00000105218\n-NT_166433\tprotein_coding\texon\t50322\t50492\t.\t+\t.\tParent=ENSMUST00000105218\n-NT_166433\tprotein_coding\texon\t51351\t51425\t.\t+\t.\tParent=ENSMUST00000105218\n-NT_166433\tprotein_coding\texon\t51920\t52514\t.\t+\t.\tParent=ENSMUST00000105218\n-NT_166433\tprotein_coding\tmRNA\t47928\t52514\t.\t+\t.\tID=ENSMUST00000105219;Parent=ENSMUSG00000078424;Name=AC007307.3-202\n-NT_166433\tprotein_coding\tfive_prime_UTR\t47928\t47985\t.\t+\t.\tParent=ENSMUST00000105219\n-NT_166433\tprotein_coding\tCDS\t47986\t48129\t.\t+\t0\tParent=ENSMUST00000105219\n-NT_166433\tprotein_coding\tCDS\t50322\t50492\t.\t+\t0\tParent=ENSMUST00000105219\n-NT_166433\tprotein_coding\tCDS\t51351\t51404\t.\t+\t0\tParent=ENSMUST00000105219\n-NT_166433\tprotein_coding\tthree_prime_UTR\t51405\t51425\t.\t+\t.\tParent=ENSMUST00000105219\n-NT_166433\tprotein_coding\tthree_prime_UTR\t51920\t52514\t.\t+\t.\tParent=ENSMUST00000105219\n-NT_166433\tprotein_coding\texon\t47928\t48129\t.\t+\t.\tParent=ENSMUST00000105219\n-NT_166433\tprotein_coding\texon\t50322\t50492\t.\t+\t.\tParent=ENSMUST00000105219\n-NT_166433\tprotein_coding\texon\t51351\t51425\t.\t+\t.\tParent=ENSMUST00000105219\n-NT_166433\tprotein_coding\texon\t51920\t52514\t.\t+\t.\tParent=ENSMUST00000105219\n-NT_166433\tprotein_coding\tgene\t11955\t18898\t.\t+\t.\tID=ENSMUSG00000000702;Name=AC007307.1\n-NT_166433\tprotein_coding\tmRNA\t11955\t18898\t.\t+\t.\tID=ENSMUST00000105216;Parent=ENSMUSG00000000702;Name=AC007307.1-201\n-NT_166433\tprotein_coding\tfive_prime_UTR\t11955\t12025\t.\t+\t.\tParent=ENSMUST00000105216\n-NT_166433\tprotein_coding\tCDS\t12026\t12166\t.\t+\t0\tParent=ENSMUST00000105216\n-NT_166433\tprotein_coding\tCDS\t16677\t16841\t.\t+\t0\tParent=ENSMUST00000105216\n-NT_166433\tprotein_coding\tCDS\t17745\t17789\t.\t+\t0\tParent=ENSMUST00000105216\n-NT_166433\tprotein_coding\tthree_prime_UTR\t17790\t17814\t.\t+\t.\tParent=ENSMUST00000105216\n-NT_166433\tprotein_coding\tthree_prime_UTR\t18309\t18898\t.\t+\t.\tParent=ENSMUST00000105216\n-NT_166433\tprotein_coding\texon\t11955\t12166\t.\t+\t.\tParent=ENSMUST00000105216\n-NT_166433\tprotein_coding\texon\t16677\t16841\t.\t+\t.\tParent=ENSMUST00000105216\n-NT_166433\tprotein_coding\texon\t17745\t17814\t.\t+\t.\tParent=ENSMUST00000105216\n-NT_166433\tprotein_coding\texon\t18309\t18898\t.\t+\t.\tParent=ENSMUST00000105216\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/ENSEMBL_mm9.gtf
--- a/fml_gff_converter_programs/t/ENSEMBL_mm9.gtf Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,3974 +0,0 @@\n-NT_166433\tprotein_coding\texon\t11955\t12166\t.\t+\t.\t gene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "1"; gene_name "AC007307.1"; transcript_name "AC007307.1-201";\n-NT_166433\tprotein_coding\tCDS\t12026\t12166\t.\t+\t0\t gene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "1"; gene_name "AC007307.1"; transcript_name "AC007307.1-201"; protein_id "ENSMUSP00000100851";\n-NT_166433\tprotein_coding\tstart_codon\t12026\t12028\t.\t+\t0\t gene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "1"; gene_name "AC007307.1"; transcript_name "AC007307.1-201";\n-NT_166433\tprotein_coding\texon\t16677\t16841\t.\t+\t.\t gene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "2"; gene_name "AC007307.1"; transcript_name "AC007307.1-201";\n-NT_166433\tprotein_coding\tCDS\t16677\t16841\t.\t+\t0\t gene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "2"; gene_name "AC007307.1"; transcript_name "AC007307.1-201"; protein_id "ENSMUSP00000100851";\n-NT_166433\tprotein_coding\texon\t17745\t17814\t.\t+\t.\t gene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "3"; gene_name "AC007307.1"; transcript_name "AC007307.1-201";\n-NT_166433\tprotein_coding\tCDS\t17745\t17786\t.\t+\t0\t gene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "3"; gene_name "AC007307.1"; transcript_name "AC007307.1-201"; protein_id "ENSMUSP00000100851";\n-NT_166433\tprotein_coding\tstop_codon\t17787\t17789\t.\t+\t0\t gene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "3"; gene_name "AC007307.1"; transcript_name "AC007307.1-201";\n-NT_166433\tprotein_coding\texon\t18309\t18898\t.\t+\t.\t gene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "4"; gene_name "AC007307.1"; transcript_name "AC007307.1-201";\n-NT_166433\tprotein_coding\texon\t28587\t28798\t.\t+\t.\t gene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "1"; gene_name "AC007307.2"; transcript_name "AC007307.2-201";\n-NT_166433\tprotein_coding\tCDS\t28658\t28798\t.\t+\t0\t gene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "1"; gene_name "AC007307.2"; transcript_name "AC007307.2-201"; protein_id "ENSMUSP00000100852";\n-NT_166433\tprotein_coding\tstart_codon\t28658\t28660\t.\t+\t0\t gene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "1"; gene_name "AC007307.2"; transcript_name "AC007307.2-201";\n-NT_166433\tprotein_coding\texon\t31129\t31299\t.\t+\t.\t gene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "2"; gene_name "AC007307.2"; transcript_name "AC007307.2-201";\n-NT_166433\tprotein_coding\tCDS\t31129\t31299\t.\t+\t0\t gene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "2"; gene_name "AC007307.2"; transcript_name "AC007307.2-201"; protein_id "ENSMUSP00000100852";\n-NT_166433\tprotein_coding\texon\t32196\t32270\t.\t+\t.\t gene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "3"; gene_name "AC007307.2"; transcript_name "AC007307.2-201";\n-NT_166433\tprotein_coding\tCDS\t32196\t32246\t.\t+\t0\t gene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "3"; gene_name "AC007307.2"; transcript_name "AC007307.2-201"; protein_id "ENSMUSP00000100852";\n-NT_166433\tprotein_coding\tstop_codon\t32247\t32249\t.\t+\t0\t gene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "3"; gene_name "AC007307.2"; transcript_name "AC007307.2-201";\n-NT_166433\tprotein_coding\texon\t51920\t52512\t.\t+\t.\t gene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "4"; gene_name "AC007307.2"; transcript_name "AC007307.2-201";\n-NT_166433\tprotein_coding\texon\t47745\t47845\t.\t+\t.\t gene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105218"; exon_number "1"; gene_name "AC007307.3"; transcript_name "AC007307.3-201";\n-NT_166433\tprotein_coding\tCDS\t47747\t47845\t.\t+\t0\t gene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105218"; exon_number "1"; g'..b'32"; gene_name "Lama3"; transcript_name "Lama3-202";\n-18\tprotein_coding\tCDS\t12726175\t12726341\t.\t+\t0\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "32"; gene_name "Lama3"; transcript_name "Lama3-202"; protein_id "ENSMUSP00000111524";\n-18\tprotein_coding\texon\t12730409\t12730592\t.\t+\t.\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "33"; gene_name "Lama3"; transcript_name "Lama3-202";\n-18\tprotein_coding\tCDS\t12730409\t12730592\t.\t+\t1\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "33"; gene_name "Lama3"; transcript_name "Lama3-202"; protein_id "ENSMUSP00000111524";\n-18\tprotein_coding\texon\t12732723\t12732863\t.\t+\t.\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "34"; gene_name "Lama3"; transcript_name "Lama3-202";\n-18\tprotein_coding\tCDS\t12732723\t12732863\t.\t+\t0\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "34"; gene_name "Lama3"; transcript_name "Lama3-202"; protein_id "ENSMUSP00000111524";\n-18\tprotein_coding\texon\t12736261\t12736420\t.\t+\t.\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "35"; gene_name "Lama3"; transcript_name "Lama3-202";\n-18\tprotein_coding\tCDS\t12736261\t12736420\t.\t+\t0\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "35"; gene_name "Lama3"; transcript_name "Lama3-202"; protein_id "ENSMUSP00000111524";\n-18\tprotein_coding\texon\t12736523\t12736653\t.\t+\t.\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "36"; gene_name "Lama3"; transcript_name "Lama3-202";\n-18\tprotein_coding\tCDS\t12736523\t12736653\t.\t+\t2\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "36"; gene_name "Lama3"; transcript_name "Lama3-202"; protein_id "ENSMUSP00000111524";\n-18\tprotein_coding\texon\t12738795\t12738888\t.\t+\t.\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "37"; gene_name "Lama3"; transcript_name "Lama3-202";\n-18\tprotein_coding\tCDS\t12738795\t12738888\t.\t+\t0\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "37"; gene_name "Lama3"; transcript_name "Lama3-202"; protein_id "ENSMUSP00000111524";\n-18\tprotein_coding\texon\t12740202\t12740321\t.\t+\t.\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "38"; gene_name "Lama3"; transcript_name "Lama3-202";\n-18\tprotein_coding\tCDS\t12740202\t12740321\t.\t+\t2\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "38"; gene_name "Lama3"; transcript_name "Lama3-202"; protein_id "ENSMUSP00000111524";\n-18\tprotein_coding\texon\t12741027\t12741522\t.\t+\t.\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "39"; gene_name "Lama3"; transcript_name "Lama3-202";\n-18\tprotein_coding\tCDS\t12741027\t12741169\t.\t+\t2\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "39"; gene_name "Lama3"; transcript_name "Lama3-202"; protein_id "ENSMUSP00000111524";\n-18\tprotein_coding\tstop_codon\t12741170\t12741172\t.\t+\t0\t gene_id "ENSMUSG00000024421"; transcript_id "ENSMUST00000115858"; exon_number "39"; gene_name "Lama3"; transcript_name "Lama3-202";\n-18\tprotein_coding\texon\t12657194\t12657637\t.\t-\t.\t gene_id "ENSMUSG00000090309"; transcript_id "ENSMUST00000172267"; exon_number "1"; gene_name "AC102131.1"; transcript_name "AC102131.1-201";\n-18\tprotein_coding\tCDS\t12657197\t12657637\t.\t-\t0\t gene_id "ENSMUSG00000090309"; transcript_id "ENSMUST00000172267"; exon_number "1"; gene_name "AC102131.1"; transcript_name "AC102131.1-201"; protein_id "ENSMUSP00000129942";\n-18\tprotein_coding\tstop_codon\t12657194\t12657196\t.\t-\t0\t gene_id "ENSMUSG00000090309"; transcript_id "ENSMUST00000172267"; exon_number "1"; gene_name "AC102131.1"; transcript_name "AC102131.1-201";\n-18\trRNA\texon\t12736933\t12737046\t.\t-\t.\t gene_id "ENSMUSG00000088342"; transcript_id "ENSMUST00000157717"; exon_number "1"; gene_name "5S_rRNA.42"; transcript_name "5S_rRNA.42-201";\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/ENSEMBL_mm9_gff3_to_gtf.gtf
--- a/fml_gff_converter_programs/t/ENSEMBL_mm9_gff3_to_gtf.gtf Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,3709 +0,0 @@\n-##gff-version 2.5\n-18\ttranscript\texon\t3336414\t3337176\t.\t+\t0\tgene_id "ENSMUSG00000091488"; transcript_id "ENSMUST00000171726"; exon_number "1";\n-18\ttranscript\texon\t3365925\t3366861\t.\t+\t0\tgene_id "ENSMUSG00000091488"; transcript_id "ENSMUST00000171726"; exon_number "2";\n-18\tprotein_coding\tCDS\t9316554\t9316670\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "1"; protein_id "ENSMUST00000053917";\n-18\tprotein_coding\tCDS\t9319407\t9319569\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "2"; protein_id "ENSMUST00000053917";\n-18\tprotein_coding\tCDS\t9332782\t9332948\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "3"; protein_id "ENSMUST00000053917";\n-18\tprotein_coding\tCDS\t9345192\t9345311\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "4"; protein_id "ENSMUST00000053917";\n-18\tprotein_coding\tCDS\t9345412\t9345469\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "5"; protein_id "ENSMUST00000053917";\n-18\tprotein_coding\tCDS\t9349386\t9349421\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "6"; protein_id "ENSMUST00000053917";\n-18\tprotein_coding\tCDS\t9353405\t9353505\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "7"; protein_id "ENSMUST00000053917";\n-18\tprotein_coding\tCDS\t9377792\t9377826\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "8"; protein_id "ENSMUST00000053917";\n-18\tprotein_coding\tCDS\t9386733\t9386807\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "9"; protein_id "ENSMUST00000053917";\n-18\tprotein_coding\tCDS\t9449516\t9449669\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "10"; protein_id "ENSMUST00000053917";\n-18\tprotein_coding\texon\t9314042\t9316670\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "1";\n-18\tprotein_coding\texon\t9319407\t9319569\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "2";\n-18\tprotein_coding\texon\t9332782\t9332948\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "3";\n-18\tprotein_coding\texon\t9345192\t9345311\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "4";\n-18\tprotein_coding\texon\t9345412\t9345469\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "5";\n-18\tprotein_coding\texon\t9349386\t9349421\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "6";\n-18\tprotein_coding\texon\t9353405\t9353505\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "7";\n-18\tprotein_coding\texon\t9377792\t9377826\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "8";\n-18\tprotein_coding\texon\t9386733\t9386807\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "9";\n-18\tprotein_coding\texon\t9449516\t9450148\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000053917"; exon_number "10";\n-18\tprotein_coding\tCDS\t9316554\t9316670\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000115867"; exon_number "1"; protein_id "ENSMUST00000115867";\n-18\tprotein_coding\tCDS\t9319407\t9319569\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000115867"; exon_number "2"; protein_id "ENSMUST00000115867";\n-18\tprotein_coding\tCDS\t9332782\t9332948\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000115867"; exon_number "3"; protein_id "ENSMUST00000115867";\n-18\tprotein_coding\tCDS\t9345192\t9345311\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000115867"; exon_number "4"; protein_id "ENSMUST00000115867";\n-18\tprotein_coding\tCDS\t9345412\t9345469\t.\t-\t0\tgene_id "ENSMUSG00000024286"; transcript_id "ENSMUST00000115867"; exon_number "5"; protein_id "ENSMUST00000115867";\n-18\t'..b'nscript_id "ENSMUST00000105217"; exon_number "1"; protein_id "ENSMUST00000105217";\n-NT_166433\tprotein_coding\tCDS\t31129\t31299\t.\t+\t0\tgene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "2"; protein_id "ENSMUST00000105217";\n-NT_166433\tprotein_coding\tCDS\t32196\t32249\t.\t+\t0\tgene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "3"; protein_id "ENSMUST00000105217";\n-NT_166433\tprotein_coding\texon\t28587\t28798\t.\t+\t0\tgene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "1";\n-NT_166433\tprotein_coding\texon\t31129\t31299\t.\t+\t0\tgene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "2";\n-NT_166433\tprotein_coding\texon\t32196\t32270\t.\t+\t0\tgene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "3";\n-NT_166433\tprotein_coding\texon\t51920\t52512\t.\t+\t0\tgene_id "ENSMUSG00000078423"; transcript_id "ENSMUST00000105217"; exon_number "4";\n-NT_166433\tprotein_coding\tCDS\t47747\t47845\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105218"; exon_number "1"; protein_id "ENSMUST00000105218";\n-NT_166433\tprotein_coding\tCDS\t50322\t50492\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105218"; exon_number "2"; protein_id "ENSMUST00000105218";\n-NT_166433\tprotein_coding\tCDS\t51351\t51404\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105218"; exon_number "3"; protein_id "ENSMUST00000105218";\n-NT_166433\tprotein_coding\texon\t47745\t47845\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105218"; exon_number "1";\n-NT_166433\tprotein_coding\texon\t50322\t50492\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105218"; exon_number "2";\n-NT_166433\tprotein_coding\texon\t51351\t51425\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105218"; exon_number "3";\n-NT_166433\tprotein_coding\texon\t51920\t52514\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105218"; exon_number "4";\n-NT_166433\tprotein_coding\tCDS\t47986\t48129\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105219"; exon_number "1"; protein_id "ENSMUST00000105219";\n-NT_166433\tprotein_coding\tCDS\t50322\t50492\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105219"; exon_number "2"; protein_id "ENSMUST00000105219";\n-NT_166433\tprotein_coding\tCDS\t51351\t51404\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105219"; exon_number "3"; protein_id "ENSMUST00000105219";\n-NT_166433\tprotein_coding\texon\t47928\t48129\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105219"; exon_number "1";\n-NT_166433\tprotein_coding\texon\t50322\t50492\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105219"; exon_number "2";\n-NT_166433\tprotein_coding\texon\t51351\t51425\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105219"; exon_number "3";\n-NT_166433\tprotein_coding\texon\t51920\t52514\t.\t+\t0\tgene_id "ENSMUSG00000078424"; transcript_id "ENSMUST00000105219"; exon_number "4";\n-NT_166433\tprotein_coding\tCDS\t12026\t12166\t.\t+\t0\tgene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "1"; protein_id "ENSMUST00000105216";\n-NT_166433\tprotein_coding\tCDS\t16677\t16841\t.\t+\t0\tgene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "2"; protein_id "ENSMUST00000105216";\n-NT_166433\tprotein_coding\tCDS\t17745\t17789\t.\t+\t0\tgene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "3"; protein_id "ENSMUST00000105216";\n-NT_166433\tprotein_coding\texon\t11955\t12166\t.\t+\t0\tgene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "1";\n-NT_166433\tprotein_coding\texon\t16677\t16841\t.\t+\t0\tgene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "2";\n-NT_166433\tprotein_coding\texon\t17745\t17814\t.\t+\t0\tgene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "3";\n-NT_166433\tprotein_coding\texon\t18309\t18898\t.\t+\t0\tgene_id "ENSMUSG00000000702"; transcript_id "ENSMUST00000105216"; exon_number "4";\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/JGI_genes.gff3
--- a/fml_gff_converter_programs/t/JGI_genes.gff3 Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,4626 +0,0 @@\n-##gff-version 3\n-scaffold_1\tJGI\tgene\t1237411\t1237623\t.\t+\t.\tID=e_gw2.1.2098.1;Name=e_gw2.1.2098.1\n-scaffold_1\tJGI\tmRNA\t1237411\t1237623\t.\t+\t.\tID=1027156;Parent=e_gw2.1.2098.1\n-scaffold_1\tJGI\tCDS\t1237411\t1237456\t.\t+\t0\tParent=1027156\n-scaffold_1\tJGI\tCDS\t1237502\t1237623\t.\t+\t2\tParent=1027156\n-scaffold_1\tJGI\texon\t1237411\t1237456\t.\t+\t.\tParent=1027156\n-scaffold_1\tJGI\texon\t1237502\t1237623\t.\t+\t.\tParent=1027156\n-scaffold_1\tJGI\tgene\t5902548\t5912971\t.\t+\t.\tID=estExt_Genewise2Plus.C_10864;Name=estExt_Genewise2Plus.C_10864\n-scaffold_1\tJGI\tmRNA\t5902548\t5912971\t.\t+\t.\tID=1045793;Parent=estExt_Genewise2Plus.C_10864\n-scaffold_1\tJGI\tCDS\t5902548\t5902888\t.\t+\t0\tParent=1045793\n-scaffold_1\tJGI\tCDS\t5902966\t5909048\t.\t+\t1\tParent=1045793\n-scaffold_1\tJGI\tCDS\t5909123\t5910952\t.\t+\t2\tParent=1045793\n-scaffold_1\tJGI\tCDS\t5911034\t5911827\t.\t+\t2\tParent=1045793\n-scaffold_1\tJGI\tCDS\t5911903\t5912205\t.\t+\t0\tParent=1045793\n-scaffold_1\tJGI\tCDS\t5912269\t5912760\t.\t+\t0\tParent=1045793\n-scaffold_1\tJGI\tthree_prime_UTR\t5912761\t5912971\t.\t+\t.\tParent=1045793\n-scaffold_1\tJGI\texon\t5902548\t5902888\t.\t+\t.\tParent=1045793\n-scaffold_1\tJGI\texon\t5902966\t5909048\t.\t+\t.\tParent=1045793\n-scaffold_1\tJGI\texon\t5909123\t5910952\t.\t+\t.\tParent=1045793\n-scaffold_1\tJGI\texon\t5911034\t5911827\t.\t+\t.\tParent=1045793\n-scaffold_1\tJGI\texon\t5911903\t5912205\t.\t+\t.\tParent=1045793\n-scaffold_1\tJGI\texon\t5912269\t5912971\t.\t+\t.\tParent=1045793\n-scaffold_1\tJGI\tgene\t5701930\t5702697\t.\t-\t.\tID=fgenesh2_pg.1_#_81;Name=fgenesh2_pg.1_#_81\n-scaffold_1\tJGI\tmRNA\t5701930\t5702697\t.\t-\t.\tID=1066497;Parent=fgenesh2_pg.1_#_81\n-scaffold_1\tJGI\tCDS\t5701930\t5702605\t.\t-\t1\tParent=1066497\n-scaffold_1\tJGI\tCDS\t5702660\t5702697\t.\t-\t0\tParent=1066497\n-scaffold_1\tJGI\texon\t5701930\t5702605\t.\t-\t.\tParent=1066497\n-scaffold_1\tJGI\texon\t5702660\t5702697\t.\t-\t.\tParent=1066497\n-scaffold_1\tJGI\tgene\t6192379\t6193551\t.\t-\t.\tID=estExt_Genewise2Plus.C_10944;Name=estExt_Genewise2Plus.C_10944\n-scaffold_1\tJGI\tmRNA\t6192379\t6193551\t.\t-\t.\tID=1045858;Parent=estExt_Genewise2Plus.C_10944\n-scaffold_1\tJGI\tfive_prime_UTR\t6193484\t6193551\t.\t-\t.\tParent=1045858\n-scaffold_1\tJGI\tCDS\t6192379\t6192999\t.\t-\t0\tParent=1045858\n-scaffold_1\tJGI\tCDS\t6193076\t6193367\t.\t-\t1\tParent=1045858\n-scaffold_1\tJGI\tCDS\t6193440\t6193483\t.\t-\t0\tParent=1045858\n-scaffold_1\tJGI\texon\t6192379\t6192999\t.\t-\t.\tParent=1045858\n-scaffold_1\tJGI\texon\t6193076\t6193367\t.\t-\t.\tParent=1045858\n-scaffold_1\tJGI\texon\t6193440\t6193551\t.\t-\t.\tParent=1045858\n-scaffold_1\tJGI\tgene\t5709177\t5710596\t.\t-\t.\tID=fgenesh2_pg.1_#_84;Name=fgenesh2_pg.1_#_84\n-scaffold_1\tJGI\tmRNA\t5709177\t5710596\t.\t-\t.\tID=1066500;Parent=fgenesh2_pg.1_#_84\n-scaffold_1\tJGI\tCDS\t5709177\t5709216\t.\t-\t1\tParent=1066500\n-scaffold_1\tJGI\tCDS\t5709320\t5710164\t.\t-\t0\tParent=1066500\n-scaffold_1\tJGI\tCDS\t5710228\t5710596\t.\t-\t0\tParent=1066500\n-scaffold_1\tJGI\texon\t5709177\t5709216\t.\t-\t.\tParent=1066500\n-scaffold_1\tJGI\texon\t5709320\t5710164\t.\t-\t.\tParent=1066500\n-scaffold_1\tJGI\texon\t5710228\t5710596\t.\t-\t.\tParent=1066500\n-scaffold_1\tJGI\tgene\t3582929\t3583102\t.\t+\t.\tID=e_gw2.1.2720.1;Name=e_gw2.1.2720.1\n-scaffold_1\tJGI\tmRNA\t3582929\t3583102\t.\t+\t.\tID=1026247;Parent=e_gw2.1.2720.1\n-scaffold_1\tJGI\tCDS\t3582929\t3583102\t.\t+\t0\tParent=1026247\n-scaffold_1\tJGI\texon\t3582929\t3583102\t.\t+\t.\tParent=1026247\n-scaffold_1\tJGI\tgene\t5061339\t5072066\t.\t-\t.\tID=gm1.335_g;Name=gm1.335_g\n-scaffold_1\tJGI\tmRNA\t5061339\t5072066\t.\t-\t.\tID=204986;Parent=gm1.335_g\n-scaffold_1\tJGI\tCDS\t5061339\t5061410\t.\t-\t0\tParent=204986\n-scaffold_1\tJGI\tCDS\t5070743\t5070801\t.\t-\t2\tParent=204986\n-scaffold_1\tJGI\tCDS\t5071914\t5071951\t.\t-\t1\tParent=204986\n-scaffold_1\tJGI\tCDS\t5072047\t5072066\t.\t-\t0\tParent=204986\n-scaffold_1\tJGI\texon\t5061339\t5061410\t.\t-\t.\tParent=204986\n-scaffold_1\tJGI\texon\t5070743\t5070801\t.\t-\t.\tParent=204986\n-scaffold_1\tJGI\texon\t5071914\t5071951\t.\t-\t.\tParent=204986\n-scaffold_1\tJGI\texon\t5072047\t5072066\t.\t-\t.\tParent=204986\n-scaffold_1\tJGI\tgene\t6220485\t6222489\t.\t+\t.\tID=gm1.515_g;Name=gm1.515_g\n-scaffold_1\tJGI\tmRNA\t6220485\t6222489\t.\t+\t.\tID=205166;Parent=gm1.515_g\n-scaffold_1\tJGI\tCDS\t6220485\t6220508\t.\t+\t0\tParent=205166\n-scaffold_1\tJGI\tCDS\t6220551\t622'..b'Genemark2.C_10474\n-scaffold_1\tJGI\tmRNA\t6068472\t6069874\t.\t-\t.\tID=1090684;Parent=estExt_Genemark2.C_10474\n-scaffold_1\tJGI\tCDS\t6068473\t6068941\t.\t-\t1\tParent=1090684\n-scaffold_1\tJGI\tCDS\t6068976\t6069014\t.\t-\t1\tParent=1090684\n-scaffold_1\tJGI\tCDS\t6069080\t6069105\t.\t-\t0\tParent=1090684\n-scaffold_1\tJGI\tCDS\t6069143\t6069420\t.\t-\t2\tParent=1090684\n-scaffold_1\tJGI\tCDS\t6069507\t6069611\t.\t-\t2\tParent=1090684\n-scaffold_1\tJGI\tCDS\t6069662\t6069669\t.\t-\t1\tParent=1090684\n-scaffold_1\tJGI\tCDS\t6069778\t6069797\t.\t-\t0\tParent=1090684\n-scaffold_1\tJGI\tCDS\t6069836\t6069874\t.\t-\t0\tParent=1090684\n-scaffold_1\tJGI\tthree_prime_UTR\t6068472\t6068472\t.\t-\t.\tParent=1090684\n-scaffold_1\tJGI\texon\t6068472\t6068941\t.\t-\t.\tParent=1090684\n-scaffold_1\tJGI\texon\t6068976\t6069014\t.\t-\t.\tParent=1090684\n-scaffold_1\tJGI\texon\t6069080\t6069105\t.\t-\t.\tParent=1090684\n-scaffold_1\tJGI\texon\t6069143\t6069420\t.\t-\t.\tParent=1090684\n-scaffold_1\tJGI\texon\t6069507\t6069611\t.\t-\t.\tParent=1090684\n-scaffold_1\tJGI\texon\t6069662\t6069669\t.\t-\t.\tParent=1090684\n-scaffold_1\tJGI\texon\t6069778\t6069797\t.\t-\t.\tParent=1090684\n-scaffold_1\tJGI\texon\t6069836\t6069874\t.\t-\t.\tParent=1090684\n-scaffold_1\tJGI\tgene\t6222830\t6223956\t.\t-\t.\tID=fgenesh2_kg.1_#_199_#_Contig10010;Name=fgenesh2_kg.1_#_199_#_Contig10010\n-scaffold_1\tJGI\tmRNA\t6222830\t6223956\t.\t-\t.\tID=1059305;Parent=fgenesh2_kg.1_#_199_#_Contig10010\n-scaffold_1\tJGI\tfive_prime_UTR\t6223850\t6223956\t.\t-\t.\tParent=1059305\n-scaffold_1\tJGI\tCDS\t6223123\t6223377\t.\t-\t0\tParent=1059305\n-scaffold_1\tJGI\tCDS\t6223466\t6223513\t.\t-\t0\tParent=1059305\n-scaffold_1\tJGI\tCDS\t6223599\t6223636\t.\t-\t2\tParent=1059305\n-scaffold_1\tJGI\tCDS\t6223717\t6223849\t.\t-\t0\tParent=1059305\n-scaffold_1\tJGI\tthree_prime_UTR\t6222830\t6223122\t.\t-\t.\tParent=1059305\n-scaffold_1\tJGI\texon\t6222830\t6223377\t.\t-\t.\tParent=1059305\n-scaffold_1\tJGI\texon\t6223466\t6223513\t.\t-\t.\tParent=1059305\n-scaffold_1\tJGI\texon\t6223599\t6223636\t.\t-\t.\tParent=1059305\n-scaffold_1\tJGI\texon\t6223717\t6223956\t.\t-\t.\tParent=1059305\n-scaffold_1\tJGI\tgene\t6194330\t6196070\t.\t+\t.\tID=fgenesh2_kg.1_#_188_#_Contig951;Name=fgenesh2_kg.1_#_188_#_Contig951\n-scaffold_1\tJGI\tmRNA\t6194330\t6196070\t.\t+\t.\tID=1059294;Parent=fgenesh2_kg.1_#_188_#_Contig951\n-scaffold_1\tJGI\tfive_prime_UTR\t6194330\t6194523\t.\t+\t.\tParent=1059294\n-scaffold_1\tJGI\tCDS\t6194524\t6194757\t.\t+\t0\tParent=1059294\n-scaffold_1\tJGI\tCDS\t6194844\t6195164\t.\t+\t0\tParent=1059294\n-scaffold_1\tJGI\tthree_prime_UTR\t6195165\t6195696\t.\t+\t.\tParent=1059294\n-scaffold_1\tJGI\tthree_prime_UTR\t6195774\t6196070\t.\t+\t.\tParent=1059294\n-scaffold_1\tJGI\texon\t6194330\t6194757\t.\t+\t.\tParent=1059294\n-scaffold_1\tJGI\texon\t6194844\t6195696\t.\t+\t.\tParent=1059294\n-scaffold_1\tJGI\texon\t6195774\t6196070\t.\t+\t.\tParent=1059294\n-scaffold_1\tJGI\tgene\t4208772\t4210540\t.\t-\t.\tID=gm1.273_g;Name=gm1.273_g\n-scaffold_1\tJGI\tmRNA\t4208772\t4210540\t.\t-\t.\tID=204924;Parent=gm1.273_g\n-scaffold_1\tJGI\tCDS\t4208772\t4208899\t.\t-\t2\tParent=204924\n-scaffold_1\tJGI\tCDS\t4208964\t4209686\t.\t-\t2\tParent=204924\n-scaffold_1\tJGI\tCDS\t4209714\t4209734\t.\t-\t2\tParent=204924\n-scaffold_1\tJGI\tCDS\t4209782\t4210451\t.\t-\t0\tParent=204924\n-scaffold_1\tJGI\tCDS\t4210511\t4210540\t.\t-\t0\tParent=204924\n-scaffold_1\tJGI\texon\t4208772\t4208899\t.\t-\t.\tParent=204924\n-scaffold_1\tJGI\texon\t4208964\t4209686\t.\t-\t.\tParent=204924\n-scaffold_1\tJGI\texon\t4209714\t4209734\t.\t-\t.\tParent=204924\n-scaffold_1\tJGI\texon\t4209782\t4210451\t.\t-\t.\tParent=204924\n-scaffold_1\tJGI\texon\t4210511\t4210540\t.\t-\t.\tParent=204924\n-scaffold_1\tJGI\tgene\t5585408\t5590273\t.\t+\t.\tID=estExt_fgenesh2_pm.C_10049;Name=estExt_fgenesh2_pm.C_10049\n-scaffold_1\tJGI\tmRNA\t5585408\t5590273\t.\t+\t.\tID=1073299;Parent=estExt_fgenesh2_pm.C_10049\n-scaffold_1\tJGI\tfive_prime_UTR\t5585408\t5585504\t.\t+\t.\tParent=1073299\n-scaffold_1\tJGI\tCDS\t5585505\t5585741\t.\t+\t0\tParent=1073299\n-scaffold_1\tJGI\tCDS\t5585815\t5586858\t.\t+\t0\tParent=1073299\n-scaffold_1\tJGI\tthree_prime_UTR\t5586859\t5587647\t.\t+\t.\tParent=1073299\n-scaffold_1\tJGI\tthree_prime_UTR\t5587746\t5590273\t.\t+\t.\tParent=1073299\n-scaffold_1\tJGI\texon\t5585408\t5585741\t.\t+\t.\tParent=1073299\n-scaffold_1\tJGI\texon\t5585815\t5587647\t.\t+\t.\tParent=1073299\n-scaffold_1\tJGI\texon\t5587746\t5590273\t.\t+\t.\tParent=1073299\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/JGI_genes.gtf
--- a/fml_gff_converter_programs/t/JGI_genes.gtf Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,3994 +0,0 @@\n-scaffold_1\tJGI\texon\t7\t12\t.\t+\t.\tname "fgenesh2_pg.1_#_1"; transcriptId 1066417\n-scaffold_1\tJGI\tCDS\t7\t12\t.\t+\t0\tname "fgenesh2_pg.1_#_1"; proteinId 1066417; exonNumber 1\n-scaffold_1\tJGI\tstart_codon\t7\t9\t.\t+\t0\tname "fgenesh2_pg.1_#_1"\n-scaffold_1\tJGI\texon\t363\t902\t.\t+\t.\tname "fgenesh2_pg.1_#_1"; transcriptId 1066417\n-scaffold_1\tJGI\tCDS\t363\t902\t.\t+\t0\tname "fgenesh2_pg.1_#_1"; proteinId 1066417; exonNumber 2\n-scaffold_1\tJGI\texon\t954\t1160\t.\t+\t.\tname "fgenesh2_pg.1_#_1"; transcriptId 1066417\n-scaffold_1\tJGI\tCDS\t954\t1160\t.\t+\t0\tname "fgenesh2_pg.1_#_1"; proteinId 1066417; exonNumber 3\n-scaffold_1\tJGI\tstop_codon\t1158\t1160\t.\t+\t0\tname "fgenesh2_pg.1_#_1"\n-scaffold_1\tJGI\texon\t17310\t18075\t.\t-\t.\tname "estExt_Genewise2Plus.C_10002"; transcriptId 1045566\n-scaffold_1\tJGI\tCDS\t17597\t18075\t.\t-\t2\tname "estExt_Genewise2Plus.C_10002"; proteinId 1045566; exonNumber 3\n-scaffold_1\tJGI\tstop_codon\t17597\t17599\t.\t-\t0\tname "estExt_Genewise2Plus.C_10002"\n-scaffold_1\tJGI\texon\t18135\t18268\t.\t-\t.\tname "estExt_Genewise2Plus.C_10002"; transcriptId 1045566\n-scaffold_1\tJGI\tCDS\t18135\t18268\t.\t-\t1\tname "estExt_Genewise2Plus.C_10002"; proteinId 1045566; exonNumber 2\n-scaffold_1\tJGI\texon\t18353\t19188\t.\t-\t.\tname "estExt_Genewise2Plus.C_10002"; transcriptId 1045566\n-scaffold_1\tJGI\tCDS\t18353\t19188\t.\t-\t0\tname "estExt_Genewise2Plus.C_10002"; proteinId 1045566; exonNumber 1\n-scaffold_1\tJGI\tstart_codon\t19186\t19188\t.\t-\t0\tname "estExt_Genewise2Plus.C_10002"\n-scaffold_1\tJGI\texon\t29168\t29186\t.\t-\t.\tname "estExt_Genemark2.C_10004"; transcriptId 1090574\n-scaffold_1\tJGI\texon\t31979\t32005\t.\t-\t.\tname "estExt_Genemark2.C_10004"; transcriptId 1090574\n-scaffold_1\tJGI\texon\t32085\t32211\t.\t-\t.\tname "estExt_Genemark2.C_10004"; transcriptId 1090574\n-scaffold_1\tJGI\tCDS\t32170\t32211\t.\t-\t0\tname "estExt_Genemark2.C_10004"; proteinId 1090574; exonNumber 3\n-scaffold_1\tJGI\tstop_codon\t32170\t32172\t.\t-\t0\tname "estExt_Genemark2.C_10004"\n-scaffold_1\tJGI\texon\t32249\t32298\t.\t-\t.\tname "estExt_Genemark2.C_10004"; transcriptId 1090574\n-scaffold_1\tJGI\tCDS\t32249\t32298\t.\t-\t2\tname "estExt_Genemark2.C_10004"; proteinId 1090574; exonNumber 2\n-scaffold_1\tJGI\texon\t39594\t39912\t.\t-\t.\tname "estExt_Genemark2.C_10004"; transcriptId 1090574\n-scaffold_1\tJGI\tCDS\t39594\t39912\t.\t-\t0\tname "estExt_Genemark2.C_10004"; proteinId 1090574; exonNumber 1\n-scaffold_1\tJGI\texon\t48024\t49977\t.\t-\t.\tname "fgenesh2_kg.1_#_3_#_Contig2893"; transcriptId 1059109\n-scaffold_1\tJGI\tCDS\t48459\t49977\t.\t-\t1\tname "fgenesh2_kg.1_#_3_#_Contig2893"; proteinId 1059109; exonNumber 4\n-scaffold_1\tJGI\tstop_codon\t48459\t48461\t.\t-\t0\tname "fgenesh2_kg.1_#_3_#_Contig2893"\n-scaffold_1\tJGI\texon\t50732\t50830\t.\t-\t.\tname "fgenesh2_kg.1_#_3_#_Contig2893"; transcriptId 1059109\n-scaffold_1\tJGI\tCDS\t50732\t50830\t.\t-\t1\tname "fgenesh2_kg.1_#_3_#_Contig2893"; proteinId 1059109; exonNumber 3\n-scaffold_1\tJGI\texon\t50924\t51099\t.\t-\t.\tname "fgenesh2_kg.1_#_3_#_Contig2893"; transcriptId 1059109\n-scaffold_1\tJGI\tCDS\t50924\t51099\t.\t-\t0\tname "fgenesh2_kg.1_#_3_#_Contig2893"; proteinId 1059109; exonNumber 2\n-scaffold_1\tJGI\texon\t51227\t51548\t.\t-\t.\tname "fgenesh2_kg.1_#_3_#_Contig2893"; transcriptId 1059109\n-scaffold_1\tJGI\tCDS\t51227\t51403\t.\t-\t0\tname "fgenesh2_kg.1_#_3_#_Contig2893"; proteinId 1059109; exonNumber 1\n-scaffold_1\tJGI\tstart_codon\t51401\t51403\t.\t-\t0\tname "fgenesh2_kg.1_#_3_#_Contig2893"\n-scaffold_1\tJGI\texon\t62584\t62802\t.\t+\t.\tname "e_gw2.1.1272.1"; transcriptId 1026707\n-scaffold_1\tJGI\tCDS\t62584\t62802\t.\t+\t0\tname "e_gw2.1.1272.1"; proteinId 1026707; exonNumber 1\n-scaffold_1\tJGI\tstop_codon\t62800\t62802\t.\t+\t0\tname "e_gw2.1.1272.1"\n-scaffold_1\tJGI\texon\t82115\t82306\t.\t-\t.\tname "e_gw2.1.1271.1"; transcriptId 1026712\n-scaffold_1\tJGI\tCDS\t82115\t82306\t.\t-\t0\tname "e_gw2.1.1271.1"; proteinId 1026712; exonNumber 1\n-scaffold_1\tJGI\tstop_codon\t82115\t82117\t.\t-\t0\tname "e_gw2.1.1271.1"\n-scaffold_1\tJGI\texon\t120058\t120311\t.\t-\t.\tname "fgenesh2_kg.1_#_6_#_Contig4211"; transcriptId 1059112\n-scaffold_1\tJGI\tCDS\t120203\t120311\t.\t-\t1\tname "fgenesh2_kg.1_#_6_#_Contig4211"; proteinId 1059112; exonNumber 5\n-scaffol'..b'n\t6530335\t6530597\t.\t-\t.\tname "estExt_fgenesh2_pg.C_10280"; transcriptId 1078583\n-scaffold_1\tJGI\tCDS\t6530335\t6530597\t.\t-\t1\tname "estExt_fgenesh2_pg.C_10280"; proteinId 1078583; exonNumber 2\n-scaffold_1\tJGI\texon\t6530713\t6530971\t.\t-\t.\tname "estExt_fgenesh2_pg.C_10280"; transcriptId 1078583\n-scaffold_1\tJGI\tCDS\t6530713\t6530909\t.\t-\t0\tname "estExt_fgenesh2_pg.C_10280"; proteinId 1078583; exonNumber 1\n-scaffold_1\tJGI\tstart_codon\t6530907\t6530909\t.\t-\t0\tname "estExt_fgenesh2_pg.C_10280"\n-scaffold_1\tJGI\texon\t6530961\t6531797\t.\t-\t.\tname "estExt_fgenesh2_pg.C_10281"; transcriptId 1078584\n-scaffold_1\tJGI\tCDS\t6531507\t6531797\t.\t-\t0\tname "estExt_fgenesh2_pg.C_10281"; proteinId 1078584; exonNumber 2\n-scaffold_1\tJGI\tstop_codon\t6531507\t6531509\t.\t-\t0\tname "estExt_fgenesh2_pg.C_10281"\n-scaffold_1\tJGI\texon\t6531869\t6532149\t.\t-\t.\tname "estExt_fgenesh2_pg.C_10281"; transcriptId 1078584\n-scaffold_1\tJGI\tCDS\t6531869\t6532027\t.\t-\t0\tname "estExt_fgenesh2_pg.C_10281"; proteinId 1078584; exonNumber 1\n-scaffold_1\tJGI\tstart_codon\t6532025\t6532027\t.\t-\t0\tname "estExt_fgenesh2_pg.C_10281"\n-scaffold_1\tJGI\texon\t6532672\t6535468\t.\t+\t.\tname "estExt_Genemark2.C_10601"; transcriptId 1090763\n-scaffold_1\tJGI\tCDS\t6532672\t6535468\t.\t+\t0\tname "estExt_Genemark2.C_10601"; proteinId 1090763; exonNumber 1\n-scaffold_1\tJGI\tstart_codon\t6532672\t6532674\t.\t+\t0\tname "estExt_Genemark2.C_10601"\n-scaffold_1\tJGI\texon\t6535501\t6536357\t.\t+\t.\tname "estExt_Genemark2.C_10601"; transcriptId 1090763\n-scaffold_1\tJGI\tCDS\t6535501\t6536357\t.\t+\t2\tname "estExt_Genemark2.C_10601"; proteinId 1090763; exonNumber 2\n-scaffold_1\tJGI\texon\t6536382\t6536530\t.\t+\t.\tname "estExt_Genemark2.C_10601"; transcriptId 1090763\n-scaffold_1\tJGI\tCDS\t6536382\t6536530\t.\t+\t0\tname "estExt_Genemark2.C_10601"; proteinId 1090763; exonNumber 3\n-scaffold_1\tJGI\texon\t6536577\t6536589\t.\t+\t.\tname "estExt_Genemark2.C_10601"; transcriptId 1090763\n-scaffold_1\tJGI\tCDS\t6536577\t6536589\t.\t+\t1\tname "estExt_Genemark2.C_10601"; proteinId 1090763; exonNumber 4\n-scaffold_1\tJGI\tstop_codon\t6536587\t6536589\t.\t+\t0\tname "estExt_Genemark2.C_10601"\n-scaffold_1\tJGI\texon\t6536649\t6537413\t.\t-\t.\tname "fgenesh2_kg.1_#_289_#_Contig6235"; transcriptId 1059395\n-scaffold_1\tJGI\tCDS\t6536793\t6537413\t.\t-\t0\tname "fgenesh2_kg.1_#_289_#_Contig6235"; proteinId 1059395; exonNumber 2\n-scaffold_1\tJGI\tstop_codon\t6536793\t6536795\t.\t-\t0\tname "fgenesh2_kg.1_#_289_#_Contig6235"\n-scaffold_1\tJGI\texon\t6537494\t6537976\t.\t-\t.\tname "fgenesh2_kg.1_#_289_#_Contig6235"; transcriptId 1059395\n-scaffold_1\tJGI\tCDS\t6537494\t6537910\t.\t-\t0\tname "fgenesh2_kg.1_#_289_#_Contig6235"; proteinId 1059395; exonNumber 1\n-scaffold_1\tJGI\tstart_codon\t6537908\t6537910\t.\t-\t0\tname "fgenesh2_kg.1_#_289_#_Contig6235"\n-scaffold_1\tJGI\texon\t6537938\t6537972\t.\t+\t.\tname "gm1.603_g"; transcriptId 205254\n-scaffold_1\tJGI\tCDS\t6537938\t6537972\t.\t+\t0\tname "gm1.603_g"; proteinId 205254; exonNumber 1\n-scaffold_1\tJGI\tstart_codon\t6537938\t6537940\t.\t+\t0\tname "gm1.603_g"\n-scaffold_1\tJGI\texon\t6538014\t6538046\t.\t+\t.\tname "gm1.603_g"; transcriptId 205254\n-scaffold_1\tJGI\tCDS\t6538014\t6538046\t.\t+\t1\tname "gm1.603_g"; proteinId 205254; exonNumber 2\n-scaffold_1\tJGI\texon\t6538107\t6538156\t.\t+\t.\tname "gm1.603_g"; transcriptId 205254\n-scaffold_1\tJGI\tCDS\t6538107\t6538156\t.\t+\t1\tname "gm1.603_g"; proteinId 205254; exonNumber 3\n-scaffold_1\tJGI\texon\t6538201\t6538314\t.\t+\t.\tname "gm1.603_g"; transcriptId 205254\n-scaffold_1\tJGI\tCDS\t6538201\t6538314\t.\t+\t2\tname "gm1.603_g"; proteinId 205254; exonNumber 4\n-scaffold_1\tJGI\texon\t6538367\t6538787\t.\t+\t.\tname "gm1.603_g"; transcriptId 205254\n-scaffold_1\tJGI\tCDS\t6538367\t6538787\t.\t+\t2\tname "gm1.603_g"; proteinId 205254; exonNumber 5\n-scaffold_1\tJGI\texon\t6538843\t6539173\t.\t+\t.\tname "gm1.603_g"; transcriptId 205254\n-scaffold_1\tJGI\tCDS\t6538843\t6539173\t.\t+\t1\tname "gm1.603_g"; proteinId 205254; exonNumber 6\n-scaffold_1\tJGI\texon\t6539217\t6539489\t.\t+\t.\tname "gm1.603_g"; transcriptId 205254\n-scaffold_1\tJGI\tCDS\t6539217\t6539489\t.\t+\t0\tname "gm1.603_g"; proteinId 205254; exonNumber 7\n-scaffold_1\tJGI\tstop_codon\t6539487\t6539489\t.\t+\t0\tname "gm1.603_g"\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/MB7_3R.bed
--- a/fml_gff_converter_programs/t/MB7_3R.bed Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,241 +0,0 @@\n-3R\t60868\t66780\tCG1106-RD\t2\t+\t60868\t66780\t0\t10\t179,61,43,252,193,75,196,1706,126,181,\t0,942,1093,1407,2479,3157,3424,3673,5443,5732,\n-3R\t60868\t66780\tCG1106-RF\t3\t+\t60868\t66780\t0\t8\t179,223,193,75,196,1706,126,181,\t0,1436,2479,3157,3424,3673,5443,5732,\n-3R\t485305\t530979\tCG31531-RC\t2\t+\t485305\t530979\t0\t9\t311,161,54,59,352,207,143,157,4100,\t0,656,11214,21196,37608,38466,40134,40588,41575,\n-3R\t62515\t66780\tCG1106-RA\t1\t+\t62515\t66780\t0\t7\t147,193,75,196,1706,126,181,\t0,832,1510,1777,2026,3796,4085,\n-3R\t60868\t66780\tCG1106-RB\t1\t+\t60868\t66780\t0\t8\t179,252,193,75,196,1706,126,181,\t0,1407,2479,3157,3424,3673,5443,5732,\n-3R\t44184\t45852\tCG31516.a\t31\t-\t44184\t45852\t0\t1\t1669,\t0,\n-3R\t60868\t66780\tCG1106-RH\t1\t+\t60868\t66780\t0\t9\t179,61,252,193,75,196,1706,126,181,\t0,942,1407,2479,3157,3424,3673,5443,5732,\n-3R\t612767\t627445\tCG42574.b\t3\t-\t612767\t627445\t0\t9\t1243,202,189,1836,989,1105,1847,987,511,\t0,1329,1636,1888,3831,4913,6173,12647,14168,\n-3R\t612767\t627445\tCG42574.a\t3\t-\t612767\t627445\t0\t11\t1243,202,189,1836,989,1105,1847,87,579,987,511,\t0,1329,1636,1888,3831,4913,6173,8692,9444,12647,14168,\n-3R\t538609\t539918\tCG9769-RA.3d\t31\t-\t538609\t539918\t0\t1\t1310,\t0,\n-3R\t94943\t103515\tCG9766-RB\t1\t-\t94943\t103515\t0\t4\t568,205,246,165,\t0,627,888,8408,\n-3R\t17136\t21871\tDMG5-MB6.chr3R.1.002.a.a\t2\t+\t17136\t21871\t0\t6\t116,95,486,540,168,281,\t0,2817,2978,3535,4231,4455,\n-3R\t44179\t45852\tCG31516-RA\t34\t-\t44179\t45852\t0\t1\t1674,\t0,\n-3R\t228771\t232914\tCG14648-RA\t1\t+\t228771\t232914\t0\t6\t232,171,719,69,211,1326,\t0,1420,1652,2431,2552,2818,\n-3R\t1053010\t1057940\tCG10229-RA\t1\t-\t1053010\t1057940\t0\t6\t903,560,122,604,222,162,\t0,989,2419,2734,3705,4769,\n-3R\t310763\t313185\tCG1078-RA\t1\t+\t310763\t313185\t0\t4\t1491,93,71,516,\t0,1555,1780,1907,\n-3R\t15388\t16170\tCG18090-RA\t34\t-\t15388\t16170\t0\t1\t783,\t0,\n-3R\t74439\t76518\tCG14643-RA\t1\t-\t74439\t76518\t0\t4\t301,655,608,62,\t0,474,1190,2018,\n-3R\t117996\t120558\tCG9780-RA\t1\t-\t117996\t120558\t0\t2\t1094,1084,\t0,1479,\n-3R\t1090664\t1094197\tCG31543-RA\t1\t+\t1090664\t1094197\t0\t5\t696,280,81,140,1014,\t0,1811,2179,2321,2520,\n-3R\t1082763\t1094197\tCG31543-RC\t1\t+\t1082763\t1094197\t0\t5\t1081,280,81,140,1014,\t0,9712,10080,10222,10421,\n-3R\t1063223\t1077468\tCG12163-RA\t1\t-\t1063223\t1077468\t0\t6\t656,182,744,128,417,323,\t0,712,953,1991,6337,13923,\n-3R\t470349\t471316\tCG9771-RA\t1\t-\t470349\t471316\t0\t2\t450,414,\t0,554,\n-3R\t92676\t94166\tCG1092.a\t1\t+\t92676\t94166\t0\t2\t252,1184,\t0,307,\n-3R\t306534\t309943\tCG14651-RB\t34\t+\t306534\t309943\t0\t1\t3410,\t0,\n-3R\t160820\t161237\tCG14645-RA\t34\t+\t160820\t161237\t0\t1\t418,\t0,\n-3R\t161164\t163374\tCG9772-RD\t1\t-\t161164\t163374\t0\t4\t288,160,400,1184,\t0,344,567,1027,\n-3R\t255639\t259652\tCG9805-RA\t1\t-\t255639\t259652\t0\t4\t252,2239,1085,232,\t0,311,2600,3782,\n-3R\t161143\t163408\tCG9772-RA\t2\t-\t161143\t163408\t0\t5\t309,160,400,874,139,\t0,365,588,1048,2127,\n-3R\t23030\t30295\tCG12582-RB\t1\t+\t23030\t30295\t0\t8\t564,514,97,289,480,422,381,361,\t0,951,1522,1697,4535,5068,6474,6905,\n-3R\t22997\t30295\tCG12582-RA\t1\t+\t22997\t30295\t0\t9\t288,135,514,97,289,480,422,381,361,\t0,462,984,1555,1730,4568,5101,6507,6938,\n-3R\t161143\t165287\tCG9772-RB\t2\t-\t161143\t165287\t0\t5\t309,160,400,874,199,\t0,365,588,1048,3946,\n-3R\t15414\t15982\tCG18090.a\t31\t-\t15414\t15982\t0\t1\t569,\t0,\n-3R\t135364\t136669\tCG9779-RA\t1\t-\t135364\t136669\t0\t3\t717,241,161,\t0,772,1145,\n-3R\t438597\t459031\tCG1056-RA\t2\t+\t438597\t459031\t0\t7\t476,1116,225,199,444,393,1502,\t0,9634,14098,14384,15031,17254,18933,\n-3R\t467696\t470358\tCG18271-RB\t1\t+\t467696\t470358\t0\t2\t35,1093,\t0,1570,\n-3R\t1045390\t1047270\tCG1116.a\t3\t+\t1045390\t1047270\t0\t6\t188,218,272,150,577,157,\t0,248,531,868,1081,1724,\n-3R\t145412\t151817\tCG9795-RB\t3\t+\t145412\t151817\t0\t10\t167,203,392,203,212,174,85,82,589,399,\t0,2155,3541,3997,4263,4647,4898,5040,5183,6007,\n-3R\t145412\t151817\tCG9795-RC\t3\t+\t145412\t151817\t0\t10\t201,203,392,203,212,174,85,82,589,399,\t0,2155,3541,3997,4263,4647,4898,5040,5183,6007,\n-3R\t248610\t255054\tCG9809-RD\t1\t-\t248610\t255054\t0\t7\t777,146,1832,795,106,119,358,\t0,840,1045,2943,3835,3994,6087,\n-3R\t72744\t74040\tCG14639-RA\t1\t+\t72744\t74040\t0\t2\t63,1175,\t0,122,\n-3R\t248220\t255054\tCG9809-RB\t1\t-\t248220\t255054\t0\t7\t1167,146,1832,'..b',1752,\t0,2490,4042,7888,14945,15384,17304,19606,\n-3R\t107427\t127263\tCG32490.d\t3\t+\t107427\t127263\t0\t8\t167,161,181,12,158,133,1098,1752,\t0,969,2521,6367,13424,13863,15783,18085,\n-3R\t107427\t127263\tCG32490.c\t3\t+\t107427\t127263\t0\t8\t62,161,181,12,158,133,1098,1752,\t0,969,2521,6367,13424,13863,15783,18085,\n-3R\t107627\t127263\tCG32490.b\t3\t+\t107627\t127263\t0\t8\t99,161,181,12,158,133,1098,1752,\t0,769,2321,6167,13224,13663,15583,17885,\n-3R\t107886\t127263\tCG32490.a\t3\t+\t107886\t127263\t0\t8\t151,161,181,12,158,133,1098,1752,\t0,510,2062,5908,12965,13404,15324,17626,\n-3R\t340352\t383789\tCG34357-RB\t1\t-\t340352\t383789\t0\t6\t490,100,377,885,311,126,\t0,8358,19519,23175,42066,43312,\n-3R\t160820\t161223\tCG14645.a\t31\t+\t160820\t161223\t0\t1\t404,\t0,\n-3R\t1079070\t1081125\tCG1113-RA\t3\t+\t1079070\t1081125\t0\t3\t322,1045,430,\t0,527,1626,\n-3R\t23013\t30295\tCG12582.a\t3\t+\t23013\t30295\t0\t9\t272,135,552,97,289,480,422,381,361,\t0,446,930,1539,1714,4552,5085,6491,6922,\n-3R\t403661\t404368\tCG32945.a\t34\t-\t403661\t404368\t0\t1\t708,\t0,\n-3R\t107427\t128309\tCG32490.i\t2\t+\t107427\t128309\t0\t6\t167,161,181,167,133,62,\t0,969,2521,13415,13863,20821,\n-3R\t107427\t127263\tCG32490.h\t3\t+\t107427\t127263\t0\t7\t167,161,181,167,133,1098,1752,\t0,969,2521,13415,13863,15783,18085,\n-3R\t1045390\t1047270\tCG1116-RA\t1\t+\t1045390\t1047270\t0\t6\t188,215,272,150,577,157,\t0,251,531,868,1081,1724,\n-3R\t1045390\t1047270\tCG1116-RB\t3\t+\t1045390\t1047270\t0\t5\t466,272,150,577,157,\t0,531,868,1081,1724,\n-3R\t60868\t66781\tCG1106.d\t3\t+\t60868\t66781\t0\t7\t179,223,193,75,196,1896,182,\t0,1436,2479,3157,3424,3673,5732,\n-3R\t60868\t66781\tCG1106.e\t3\t+\t60868\t66781\t0\t6\t179,193,75,196,1896,182,\t0,2479,3157,3424,3673,5732,\n-3R\t64259\t66780\tCG1106.f\t3\t+\t64259\t66780\t0\t4\t229,1706,126,181,\t0,282,2052,2341,\n-3R\t64259\t66781\tCG1106.g\t3\t+\t64259\t66781\t0\t3\t229,1896,182,\t0,282,2341,\n-3R\t60942\t66780\tCG1106.a\t2\t+\t60942\t66780\t0\t8\t105,387,193,75,196,1706,126,181,\t0,1333,2405,3083,3350,3599,5369,5658,\n-3R\t60868\t66781\tCG1106.b\t3\t+\t60868\t66781\t0\t8\t179,61,252,193,75,196,1896,182,\t0,942,1407,2479,3157,3424,3673,5732,\n-3R\t60868\t66780\tCG1106.c\t3\t+\t60868\t66780\t0\t7\t179,193,75,196,1706,126,181,\t0,2479,3157,3424,3673,5443,5732,\n-3R\t560136\t574777\tCG9765-RA\t1\t-\t560136\t574777\t0\t10\t933,150,162,105,72,62,308,2331,202,265,\t0,994,1228,2609,2767,2900,3059,7438,10050,14377,\n-3R\t560132\t572136\tCG9765-RC\t1\t-\t560132\t572136\t0\t10\t937,150,162,105,72,65,266,2331,202,618,\t0,998,1232,2613,2771,2904,3063,7442,10054,11387,\n-3R\t560132\t572136\tCG9765-RD\t3\t-\t560132\t572136\t0\t10\t937,150,162,105,72,65,266,2331,202,573,\t0,998,1232,2613,2771,2904,3063,7442,10054,11432,\n-3R\t233159\t241835\tCG32944-RE\t2\t-\t233159\t241835\t0\t8\t567,182,126,127,71,166,56,431,\t0,622,5444,5804,5999,7096,7323,8246,\n-3R\t560132\t565269\tCG9765-RF\t2\t-\t560132\t565269\t0\t8\t937,150,162,105,72,65,266,37,\t0,998,1232,2613,2771,2904,3063,5101,\n-3R\t216097\t217839\tCG14646-RA\t1\t+\t216097\t217839\t0\t2\t684,1007,\t0,736,\n-3R\t263102\t267050\tCG14650-RA\t1\t+\t263102\t267050\t0\t6\t2380,154,125,103,293,582,\t0,2437,2647,2841,3013,3367,\n-3R\t538609\t540025\tCG9769.a\t34\t-\t538609\t540025\t0\t1\t1417,\t0,\n-3R\t655633\t656232\tCG14658-RA\t34\t-\t655633\t656232\t0\t1\t600,\t0,\n-3R\t145412\t151817\tCG9795-RA\t1\t+\t145412\t151817\t0\t11\t201,203,148,392,203,212,174,85,82,589,399,\t0,2155,2658,3541,3997,4263,4647,4898,5040,5183,6007,\n-3R\t485305\t530979\tCG31531.a\t2\t+\t485305\t530979\t0\t9\t94,164,54,59,352,207,143,157,4100,\t0,653,11214,21196,37608,38466,40134,40588,41575,\n-3R\t226212\t227739\tCG14647.a\t1\t+\t226212\t227739\t0\t3\t649,132,417,\t0,864,1111,\n-3R\t204401\t206932\tCG11739-RA\t2\t+\t204401\t206932\t0\t7\t331,169,150,118,162,128,508,\t0,771,996,1249,1430,1825,2024,\n-3R\t205029\t206932\tCG11739-RB\t1\t+\t205029\t206932\t0\t6\t312,150,118,162,128,508,\t0,368,621,802,1197,1396,\n-3R\t204386\t206932\tCG11739-RC\t1\t+\t204386\t206932\t0\t7\t66,169,150,118,162,128,508,\t0,786,1011,1264,1445,1840,2039,\n-3R\t204644\t206932\tCG11739-RD\t2\t+\t204644\t206932\t0\t7\t88,169,150,118,162,128,508,\t0,528,753,1006,1187,1582,1781,\n-3R\t639058\t641751\tCG1129-RB\t1\t+\t639058\t641751\t0\t4\t273,306,535,743,\t0,335,903,1951,\n-3R\t639076\t641751\tCG1129-RA\t1\t+\t639076\t641751\t0\t5\t26,90,306,535,743,\t0,165,317,885,1933,\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/MB7_3R.gff3
--- a/fml_gff_converter_programs/t/MB7_3R.gff3 Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,3971 +0,0 @@\n-##gff-version 3\n-3R\tMB7\tgene\t361\t10200\t0\t+\t.\tID=CG12581;Name=CG12581\n-3R\tMB7\tmRNA\t361\t10200\t3\t+\t.\tID=CG12581-RB;Parent=CG12581;Name=CG12581-RB\n-3R\tMB7\texon\t361\t509\t0\t+\t.\tParent=CG12581-RB\n-3R\tMB7\texon\t578\t1913\t0\t+\t.\tParent=CG12581-RB\n-3R\tMB7\texon\t7784\t8649\t0\t+\t.\tParent=CG12581-RB\n-3R\tMB7\texon\t9439\t10200\t0\t+\t.\tParent=CG12581-RB\n-3R\tMB7\tfive_prime_UTR\t361\t509\t0\t+\t.\tParent=CG12581-RB\n-3R\tMB7\tfive_prime_UTR\t578\t1114\t0\t+\t.\tParent=CG12581-RB\n-3R\tMB7\tstart_codon\t1115\t1117\t0\t+\t0\tParent=CG12581-RB\n-3R\tMB7\tCDS\t1115\t1913\t0\t+\t0\tParent=CG12581-RB\n-3R\tMB7\tCDS\t7784\t8649\t0\t+\t2\tParent=CG12581-RB\n-3R\tMB7\tCDS\t9439\t9771\t0\t+\t0\tParent=CG12581-RB\n-3R\tMB7\tstop_codon\t9769\t9771\t0\t+\t0\tParent=CG12581-RB\n-3R\tMB7\tthree_prime_UTR\t9772\t10200\t0\t+\t.\tParent=CG12581-RB\n-3R\tMB7\tmRNA\t380\t10200\t1\t+\t.\tID=CG12581-RA;Parent=CG12581;Name=CG12581-RA\n-3R\tMB7\texon\t380\t1913\t0\t+\t.\tParent=CG12581-RA\n-3R\tMB7\texon\t7784\t8649\t0\t+\t.\tParent=CG12581-RA\n-3R\tMB7\texon\t9439\t10200\t0\t+\t.\tParent=CG12581-RA\n-3R\tMB7\tfive_prime_UTR\t380\t1114\t0\t+\t.\tParent=CG12581-RA\n-3R\tMB7\tstart_codon\t1115\t1117\t0\t+\t0\tParent=CG12581-RA\n-3R\tMB7\tCDS\t1115\t1913\t0\t+\t0\tParent=CG12581-RA\n-3R\tMB7\tCDS\t7784\t8649\t0\t+\t2\tParent=CG12581-RA\n-3R\tMB7\tCDS\t9439\t9771\t0\t+\t0\tParent=CG12581-RA\n-3R\tMB7\tstop_codon\t9769\t9771\t0\t+\t0\tParent=CG12581-RA\n-3R\tMB7\tthree_prime_UTR\t9772\t10200\t0\t+\t.\tParent=CG12581-RA\n-3R\tMB7\tgene\t15388\t16170\t0\t-\t.\tID=CG18090;Name=CG18090\n-3R\tMB7\tmRNA\t15414\t15982\t31\t-\t.\tID=CG18090.a;Parent=CG18090;Name=CG18090.a\n-3R\tMB7\texon\t15414\t15982\t0\t-\t.\tParent=CG18090.a\n-3R\tMB7\tthree_prime_UTR\t15414\t15529\t0\t-\t.\tParent=CG18090.a\n-3R\tMB7\tstop_codon\t15530\t15532\t0\t-\t0\tParent=CG18090.a\n-3R\tMB7\tCDS\t15530\t15955\t0\t-\t0\tParent=CG18090.a\n-3R\tMB7\tstart_codon\t15953\t15955\t0\t-\t0\tParent=CG18090.a\n-3R\tMB7\tfive_prime_UTR\t15956\t15982\t0\t-\t.\tParent=CG18090.a\n-3R\tMB7\tmRNA\t15388\t16170\t34\t-\t.\tID=CG18090-RA;Parent=CG18090;Name=CG18090-RA\n-3R\tMB7\texon\t15388\t16170\t0\t-\t.\tParent=CG18090-RA\n-3R\tMB7\tthree_prime_UTR\t15388\t15529\t0\t-\t.\tParent=CG18090-RA\n-3R\tMB7\tstop_codon\t15530\t15532\t0\t-\t0\tParent=CG18090-RA\n-3R\tMB7\tCDS\t15530\t15955\t0\t-\t0\tParent=CG18090-RA\n-3R\tMB7\tstart_codon\t15953\t15955\t0\t-\t0\tParent=CG18090-RA\n-3R\tMB7\tfive_prime_UTR\t15956\t16170\t0\t-\t.\tParent=CG18090-RA\n-3R\tMB7\tgene\t17136\t21871\t0\t+\t.\tID=DMG5-MB6.chr3R.1.002.a;Name=DMG5-MB6.chr3R.1.002.a\n-3R\tMB7\tmRNA\t17136\t21871\t2\t+\t.\tID=DMG5-MB6.chr3R.1.002.a.a;Parent=DMG5-MB6.chr3R.1.002.a;Name=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\texon\t17136\t17251\t0\t+\t.\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\texon\t19953\t20047\t0\t+\t.\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\texon\t20114\t20599\t0\t+\t.\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\texon\t20671\t21210\t0\t+\t.\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\texon\t21367\t21534\t0\t+\t.\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\texon\t21591\t21871\t0\t+\t.\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\tstart_codon\t17136\t17138\t0\t+\t0\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\tCDS\t17136\t17251\t0\t+\t0\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\tCDS\t19953\t20047\t0\t+\t1\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\tCDS\t20114\t20599\t0\t+\t2\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\tCDS\t20671\t20759\t0\t+\t2\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\tstop_codon\t20757\t20759\t0\t+\t0\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\tthree_prime_UTR\t20760\t21210\t0\t+\t.\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\tthree_prime_UTR\t21367\t21534\t0\t+\t.\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\tthree_prime_UTR\t21591\t21871\t0\t+\t.\tParent=DMG5-MB6.chr3R.1.002.a.a\n-3R\tMB7\tgene\t22931\t30295\t0\t+\t.\tID=CG12582;Name=CG12582\n-3R\tMB7\tmRNA\t23013\t30295\t3\t+\t.\tID=CG12582.a;Parent=CG12582;Name=CG12582.a\n-3R\tMB7\texon\t23013\t23284\t0\t+\t.\tParent=CG12582.a\n-3R\tMB7\texon\t23459\t23593\t0\t+\t.\tParent=CG12582.a\n-3R\tMB7\texon\t23943\t24494\t0\t+\t.\tParent=CG12582.a\n-3R\tMB7\texon\t24552\t24648\t0\t+\t.\tParent=CG12582.a\n-3R\tMB7\texon\t24727\t25015\t0\t+\t.\tParent=CG12582.a\n-3R\tMB7\texon\t27565\t28044\t0\t+\t.\tParent=CG12582.a\n-3R\tMB7\texon\t28098\t28519\t0\t+\t.\tParent=CG12582.a\n-3R\tMB7\texon\t29504\t29884\t0\t+\t.\tParent=CG12582.a\n-3R\tMB7\texon\t29935\t30295\t0\t+\t.\tParent=CG12582.a\n-3R\tMB7\tfive_prime_UTR\t23013\t23284\t0\t+\t.\tParent=CG125'..b'517\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\texon\t1121579\t1121685\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\texon\t1121869\t1122357\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\texon\t1123924\t1124211\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\texon\t1125192\t1125295\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\texon\t1129833\t1129904\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\texon\t1138711\t1139219\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\texon\t1139660\t1140027\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\texon\t1148710\t1148847\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\texon\t1149387\t1149566\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\tthree_prime_UTR\t1098665\t1099668\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\tstop_codon\t1099669\t1099671\t0\t-\t0\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1099669\t1099804\t0\t-\t1\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1099871\t1100040\t0\t-\t0\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1100457\t1100616\t0\t-\t1\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1100688\t1100809\t0\t-\t0\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1118362\t1118563\t0\t-\t1\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1118720\t1118882\t0\t-\t2\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1118941\t1119092\t0\t-\t1\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1119784\t1119956\t0\t-\t0\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1120028\t1120577\t0\t-\t1\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1121363\t1121517\t0\t-\t0\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1121579\t1121685\t0\t-\t2\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1121869\t1122357\t0\t-\t2\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1123924\t1124211\t0\t-\t2\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1125192\t1125295\t0\t-\t1\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1129833\t1129904\t0\t-\t1\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1138711\t1139219\t0\t-\t0\tParent=CG32464-RB\n-3R\tMB7\tCDS\t1139660\t1139920\t0\t-\t0\tParent=CG32464-RB\n-3R\tMB7\tstart_codon\t1139918\t1139920\t0\t-\t0\tParent=CG32464-RB\n-3R\tMB7\tfive_prime_UTR\t1139921\t1140027\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\tfive_prime_UTR\t1148710\t1148847\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\tfive_prime_UTR\t1149387\t1149566\t0\t-\t.\tParent=CG32464-RB\n-3R\tMB7\tmRNA\t1098665\t1149566\t3\t-\t.\tID=CG32464-RU;Parent=CG32464;Name=CG32464-RU\n-3R\tMB7\texon\t1098665\t1099804\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1099871\t1100040\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1100457\t1100616\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1100688\t1100809\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1118362\t1118563\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1118720\t1118882\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1118941\t1119092\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1119784\t1119956\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1120028\t1120577\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1121363\t1121517\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1121579\t1121685\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1121869\t1122357\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1123924\t1124211\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1125192\t1125295\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1138711\t1139219\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1139660\t1140027\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1148710\t1148847\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\texon\t1149387\t1149566\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\tthree_prime_UTR\t1098665\t1099668\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\tstop_codon\t1099669\t1099671\t0\t-\t0\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1099669\t1099804\t0\t-\t1\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1099871\t1100040\t0\t-\t0\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1100457\t1100616\t0\t-\t1\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1100688\t1100809\t0\t-\t0\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1118362\t1118563\t0\t-\t1\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1118720\t1118882\t0\t-\t2\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1118941\t1119092\t0\t-\t1\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1119784\t1119956\t0\t-\t0\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1120028\t1120577\t0\t-\t1\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1121363\t1121517\t0\t-\t0\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1121579\t1121685\t0\t-\t2\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1121869\t1122357\t0\t-\t2\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1123924\t1124211\t0\t-\t2\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1125192\t1125295\t0\t-\t1\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1138711\t1139219\t0\t-\t0\tParent=CG32464-RU\n-3R\tMB7\tCDS\t1139660\t1139920\t0\t-\t0\tParent=CG32464-RU\n-3R\tMB7\tstart_codon\t1139918\t1139920\t0\t-\t0\tParent=CG32464-RU\n-3R\tMB7\tfive_prime_UTR\t1139921\t1140027\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\tfive_prime_UTR\t1148710\t1148847\t0\t-\t.\tParent=CG32464-RU\n-3R\tMB7\tfive_prime_UTR\t1149387\t1149566\t0\t-\t.\tParent=CG32464-RU\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/UCSC_transcripts.gff3
--- a/fml_gff_converter_programs/t/UCSC_transcripts.gff3 Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,129 +0,0 @@\n-##gff-version 3\n-chr1\thg19_ccdsGene\tgene\t896074\t900571\t.\t+\t.\tID=Gene:CCDS30550.1;Name=Gene:CCDS30550.1\n-chr1\thg19_ccdsGene\tmRNA\t896074\t900571\t0.000000\t+\t.\tID=Transcript:CCDS30550.1;Parent=Gene:CCDS30550.1\n-chr1\thg19_ccdsGene\tCDS\t896074\t896180\t.\t+\t0\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\tCDS\t896673\t896932\t.\t+\t1\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\tCDS\t897009\t897130\t.\t+\t2\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\tCDS\t897206\t897427\t.\t+\t0\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\tCDS\t897735\t897851\t.\t+\t0\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\tCDS\t898084\t898297\t.\t+\t0\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\tCDS\t898489\t898633\t.\t+\t2\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\tCDS\t898717\t898884\t.\t+\t1\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\tCDS\t899300\t899388\t.\t+\t1\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\tCDS\t899487\t899560\t.\t+\t2\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\tCDS\t899729\t899910\t.\t+\t0\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\tCDS\t900343\t900571\t.\t+\t1\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\texon\t896074\t896180\t.\t+\t.\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\texon\t896673\t896932\t.\t+\t.\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\texon\t897009\t897130\t.\t+\t.\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\texon\t897206\t897427\t.\t+\t.\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\texon\t897735\t897851\t.\t+\t.\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\texon\t898084\t898297\t.\t+\t.\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\texon\t898489\t898633\t.\t+\t.\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\texon\t898717\t898884\t.\t+\t.\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\texon\t899300\t899388\t.\t+\t.\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\texon\t899487\t899560\t.\t+\t.\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\texon\t899729\t899910\t.\t+\t.\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\texon\t900343\t900571\t.\t+\t.\tParent=Transcript:CCDS30550.1\n-chr1\thg19_ccdsGene\tgene\t880074\t894620\t.\t-\t.\tID=Gene:CCDS3.1;Name=Gene:CCDS3.1\n-chr1\thg19_ccdsGene\tmRNA\t880074\t894620\t0.000000\t-\t.\tID=Transcript:CCDS3.1;Parent=Gene:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t880074\t880180\t.\t-\t2\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t880437\t880526\t.\t-\t2\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t880898\t881033\t.\t-\t0\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t881553\t881666\t.\t-\t0\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t881782\t881925\t.\t-\t0\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t883511\t883612\t.\t-\t0\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t883870\t883983\t.\t-\t0\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t886507\t886618\t.\t-\t1\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t887380\t887519\t.\t-\t0\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t887792\t887980\t.\t-\t0\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t888555\t888668\t.\t-\t0\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t889162\t889272\t.\t-\t0\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t889384\t889462\t.\t-\t1\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t891303\t891393\t.\t-\t2\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t891475\t891595\t.\t-\t0\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t892274\t892405\t.\t-\t0\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t892479\t892653\t.\t-\t1\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t894309\t894461\t.\t-\t1\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\tCDS\t894595\t894620\t.\t-\t0\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\texon\t880074\t880180\t.\t-\t.\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\texon\t880437\t880526\t.\t-\t.\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\texon\t880898\t881033\t.\t-\t.\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\texon\t881553\t881666\t.\t-\t.\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\texon\t881782\t881925\t.\t-\t.\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\texon\t883511\t883612\t.\t-\t.\tParent=Transcript:CCDS3.1\n-chr1\thg19_ccdsGene\texon\t883870\t883983\t.\t-\t.\tParent=Transc'..b'anscript:CCDS2.2\n-chr1\thg19_ccdsGene\tCDS\t874420\t874509\t.\t+\t2\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\tCDS\t874655\t874840\t.\t+\t2\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\tCDS\t876524\t876686\t.\t+\t2\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\tCDS\t877516\t877631\t.\t+\t1\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\tCDS\t877790\t877868\t.\t+\t2\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\tCDS\t877939\t878438\t.\t+\t1\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\tCDS\t878633\t878757\t.\t+\t2\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\tCDS\t879078\t879188\t.\t+\t0\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\tCDS\t879288\t879533\t.\t+\t0\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t861322\t861393\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t865535\t865716\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t866419\t866469\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t871152\t871276\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t874420\t874509\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t874655\t874840\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t876524\t876686\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t877516\t877631\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t877790\t877868\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t877939\t878438\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t878633\t878757\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t879078\t879188\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\texon\t879288\t879533\t.\t+\t.\tParent=Transcript:CCDS2.2\n-chr1\thg19_ccdsGene\tgene\t901912\t909955\t.\t+\t.\tID=Gene:CCDS4.1;Name=Gene:CCDS4.1\n-chr1\thg19_ccdsGene\tmRNA\t901912\t909955\t0.000000\t+\t.\tID=Transcript:CCDS4.1;Parent=Gene:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t901912\t901994\t.\t+\t0\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t902084\t902183\t.\t+\t1\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t905657\t905803\t.\t+\t0\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t905901\t905981\t.\t+\t0\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t906066\t906138\t.\t+\t0\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t906259\t906386\t.\t+\t2\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t906493\t906588\t.\t+\t0\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t906704\t906784\t.\t+\t0\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t907455\t907530\t.\t+\t0\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t907668\t907804\t.\t+\t2\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t908241\t908390\t.\t+\t0\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t908566\t908706\t.\t+\t0\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t908880\t909020\t.\t+\t0\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t909213\t909431\t.\t+\t0\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t909696\t909744\t.\t+\t0\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\tCDS\t909822\t909955\t.\t+\t2\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t901912\t901994\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t902084\t902183\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t905657\t905803\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t905901\t905981\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t906066\t906138\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t906259\t906386\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t906493\t906588\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t906704\t906784\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t907455\t907530\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t907668\t907804\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t908241\t908390\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t908566\t908706\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t908880\t909020\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t909213\t909431\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t909696\t909744\t.\t+\t.\tParent=Transcript:CCDS4.1\n-chr1\thg19_ccdsGene\texon\t909822\t909955\t.\t+\t.\tParent=Transcript:CCDS4.1\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/UCSC_transcripts.gtf
--- a/fml_gff_converter_programs/t/UCSC_transcripts.gtf Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,129 +0,0 @@\n-chr1\thg19_ccdsGene\tstart_codon\t861322\t861324\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t861322\t861393\t0.000000\t+\t0\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t861322\t861393\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t865535\t865716\t0.000000\t+\t0\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t865535\t865716\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t866419\t866469\t0.000000\t+\t1\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t866419\t866469\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t871152\t871276\t0.000000\t+\t1\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t871152\t871276\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t874420\t874509\t0.000000\t+\t2\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t874420\t874509\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t874655\t874840\t0.000000\t+\t2\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t874655\t874840\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t876524\t876686\t0.000000\t+\t2\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t876524\t876686\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t877516\t877631\t0.000000\t+\t1\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t877516\t877631\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t877790\t877868\t0.000000\t+\t2\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t877790\t877868\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t877939\t878438\t0.000000\t+\t1\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t877939\t878438\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t878633\t878757\t0.000000\t+\t2\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t878633\t878757\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t879078\t879188\t0.000000\t+\t0\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t879078\t879188\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tCDS\t879288\t879530\t0.000000\t+\t0\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tstop_codon\t879531\t879533\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\texon\t879288\t879533\t0.000000\t+\t.\tgene_id "CCDS2.2"; transcript_id "CCDS2.2"; \n-chr1\thg19_ccdsGene\tstop_codon\t880074\t880076\t0.000000\t-\t.\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr1\thg19_ccdsGene\tCDS\t880077\t880180\t0.000000\t-\t2\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr1\thg19_ccdsGene\texon\t880074\t880180\t0.000000\t-\t.\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr1\thg19_ccdsGene\tCDS\t880437\t880526\t0.000000\t-\t2\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr1\thg19_ccdsGene\texon\t880437\t880526\t0.000000\t-\t.\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr1\thg19_ccdsGene\tCDS\t880898\t881033\t0.000000\t-\t0\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr1\thg19_ccdsGene\texon\t880898\t881033\t0.000000\t-\t.\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr1\thg19_ccdsGene\tCDS\t881553\t881666\t0.000000\t-\t0\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr1\thg19_ccdsGene\texon\t881553\t881666\t0.000000\t-\t.\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr1\thg19_ccdsGene\tCDS\t881782\t881925\t0.000000\t-\t0\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr1\thg19_ccdsGene\texon\t881782\t881925\t0.000000\t-\t.\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr1\thg19_ccdsGene\tCDS\t883511\t883612\t0.000000\t-\t0\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr1\thg19_ccdsGene\texon\t883511\t883612\t0.000000\t-\t.\tgene_id "CCDS3.1"; transcript_id "CCDS3.1"; \n-chr'..b'0\t0.000000\t+\t2\tgene_id "CCDS30550.1"; transcript_id "CCDS30550.1"; \n-chr1\thg19_ccdsGene\texon\t899487\t899560\t0.000000\t+\t.\tgene_id "CCDS30550.1"; transcript_id "CCDS30550.1"; \n-chr1\thg19_ccdsGene\tCDS\t899729\t899910\t0.000000\t+\t0\tgene_id "CCDS30550.1"; transcript_id "CCDS30550.1"; \n-chr1\thg19_ccdsGene\texon\t899729\t899910\t0.000000\t+\t.\tgene_id "CCDS30550.1"; transcript_id "CCDS30550.1"; \n-chr1\thg19_ccdsGene\tCDS\t900343\t900568\t0.000000\t+\t1\tgene_id "CCDS30550.1"; transcript_id "CCDS30550.1"; \n-chr1\thg19_ccdsGene\tstop_codon\t900569\t900571\t0.000000\t+\t.\tgene_id "CCDS30550.1"; transcript_id "CCDS30550.1"; \n-chr1\thg19_ccdsGene\texon\t900343\t900571\t0.000000\t+\t.\tgene_id "CCDS30550.1"; transcript_id "CCDS30550.1"; \n-chr1\thg19_ccdsGene\tstart_codon\t901912\t901914\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t901912\t901994\t0.000000\t+\t0\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t901912\t901994\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t902084\t902183\t0.000000\t+\t1\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t902084\t902183\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t905657\t905803\t0.000000\t+\t0\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t905657\t905803\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t905901\t905981\t0.000000\t+\t0\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t905901\t905981\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t906066\t906138\t0.000000\t+\t0\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t906066\t906138\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t906259\t906386\t0.000000\t+\t2\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t906259\t906386\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t906493\t906588\t0.000000\t+\t0\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t906493\t906588\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t906704\t906784\t0.000000\t+\t0\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t906704\t906784\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t907455\t907530\t0.000000\t+\t0\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t907455\t907530\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t907668\t907804\t0.000000\t+\t2\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t907668\t907804\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t908241\t908390\t0.000000\t+\t0\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t908241\t908390\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t908566\t908706\t0.000000\t+\t0\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t908566\t908706\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t908880\t909020\t0.000000\t+\t0\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t908880\t909020\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t909213\t909431\t0.000000\t+\t0\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t909213\t909431\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t909696\t909744\t0.000000\t+\t0\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t909696\t909744\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tCDS\t909822\t909952\t0.000000\t+\t2\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\tstop_codon\t909953\t909955\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-chr1\thg19_ccdsGene\texon\t909822\t909955\t0.000000\t+\t.\tgene_id "CCDS4.1"; transcript_id "CCDS4.1"; \n-\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/ccds_genes.bed
--- a/fml_gff_converter_programs/t/ccds_genes.bed Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
@@ -1,20 +0,0 @@
-chr1 92149295 92327088 CCDS30770.1 0 - 92149295 92327088 0 16 119,108,42,121,300,159,141,153,338,190,148,169,184,138,185,61, 0,11933,14350,24924,28504,32497,32829,35573,36154,38216,43920,46066,51037,74874,113548,177732,
-chr1 67000041 67208778 CCDS30744.1 0 + 67000041 67208778 0 25 10,64,25,72,57,55,176,12,12,25,52,86,93,75,501,128,127,60,112,156,133,203,65,165,23, 0,91488,98711,101585,105418,108451,109185,126154,133171,136636,137585,138922,142645,145319,147510,154789,155831,161075,184935,194905,199389,204976,206299,206913,208714,
-chr1 8384389 8404073 CCDS30577.1 0 + 8384389 8404073 0 8 397,93,225,728,154,177,206,267, 0,968,1488,5879,11107,13486,15163,19417,
-chr1 16767256 16785385 CCDS44067.1 0 + 16767256 16785385 0 8 14,101,105,82,109,178,76,49, 0,2870,7108,7298,8331,11076,15056,18080,
-chr1 16767256 16785491 CCDS44066.1 0 + 16767256 16785491 0 7 92,101,105,82,109,178,155, 0,2870,7108,7298,8331,11076,18080,
-chr1 16767256 16785385 CCDS173.1 0 + 16767256 16785385 0 8 92,101,105,82,109,178,76,49, 0,2870,7108,7298,8331,11076,15056,18080,
-chr1 25072044 25167428 CCDS256.1 0 + 25072044 25167428 0 6 72,110,126,107,182,165, 0,52188,68540,81456,94306,95219,
-chr1 33547850 33585783 CCDS375.1 0 + 33547850 33585783 0 9 105,174,173,135,166,163,113,215,139, 0,1704,9800,11032,12298,14457,15817,35652,37794,
-chr1 48999844 50489468 CCDS44137.1 0 - 48999844 50489468 0 14 121,27,97,163,153,112,115,90,40,217,95,125,123,34, 0,717,5469,52831,56660,100320,119164,128979,333018,511411,711597,1163140,1317223,1489590,
-chr1 100661810 100715376 CCDS767.1 0 - 100661810 100715376 0 11 168,72,192,78,167,217,122,182,76,124,51, 0,9975,10190,14439,18562,19728,22371,34478,39181,44506,53515,
-chr1 150981108 151006710 CCDS977.1 0 + 150981108 151006710 0 8 39,93,203,185,159,95,159,429, 0,9179,9834,15978,16882,18600,20153,25173,
-chr1 175914288 176176114 CCDS44279.1 0 - 175914288 176176114 0 19 18,45,161,125,118,117,82,109,144,136,115,58,77,69,120,65,98,60,407, 0,2042,41790,43135,44209,82419,98033,98557,101028,135999,140623,171471,189857,203853,217716,218674,230757,239480,261419,
-chr1 175914288 176176114 CCDS30944.1 0 - 175914288 176176114 0 20 18,45,161,125,118,117,82,109,144,136,115,58,77,60,69,120,77,98,60,407, 0,2042,41790,43135,44209,82419,98033,98557,101028,135999,140623,171471,189857,191335,203853,217716,218662,230757,239480,261419,
-chr1 184446643 184588690 CCDS1362.1 0 + 184446643 184588690 0 5 94,95,77,61,39, 0,30078,113229,120891,142008,
-chr1 226420201 226496888 CCDS1553.1 0 - 226420201 226496888 0 15 106,98,180,126,81,102,120,134,158,126,134,105,95,33,79, 0,595,843,6470,18338,33032,33712,35456,45274,53832,55163,63341,65218,68672,76608,
-chr1 1982069 2116448 CCDS37.1 0 + 1982069 2116448 0 18 71,122,90,51,86,132,82,53,189,98,87,136,88,120,80,90,116,88, 0,4810,5853,8910,84631,93579,95396,98241,100159,105364,118887,121424,121670,123266,124123,124593,133952,134291,
-chr1 2075777 2116448 CCDS41229.1 0 + 2075777 2116448 0 13 3,82,53,189,98,87,136,88,120,80,90,116,88, 0,1688,4533,6451,11656,25179,27716,27962,29558,30415,30885,40244,40583,
-chr1 2985823 3350375 CCDS44048.1 0 + 2985823 3350375 0 17 37,350,51,135,103,208,148,154,1417,85,170,78,170,175,237,175,78, 0,116865,174827,315892,327231,333531,335479,336235,342124,345303,348568,349407,356321,356791,361612,362706,364474,
-chr1 2985823 3350375 CCDS41236.1 0 + 2985823 3350375 0 17 37,350,51,135,103,208,148,154,1417,85,170,78,170,175,237,175,135, 0,116865,174827,315892,327231,333531,335479,336235,342124,345303,348568,349407,356321,356791,361612,362706,364417,
-chr1 6285139 6295971 CCDS61.1 0 - 6285139 6295971 0 5 183,218,170,89,195, 0,6822,8394,9806,10637,
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/ccds_genes.gff3
--- a/fml_gff_converter_programs/t/ccds_genes.gff3 Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,290 +0,0 @@\n-##gff-version 3\n-chr1\tbed2gff\tgene\t92149296\t92327088\t0\t-\t.\tID=Gene:CCDS30770.1;Name=Gene:CCDS30770.1\n-chr1\tbed2gff\ttranscript\t92149296\t92327088\t0\t-\t.\tID=CCDS30770.1;Name=CCDS30770.1;Parent=Gene:CCDS30770.1\n-chr1\tbed2gff\texon\t92149296\t92149414\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92161229\t92161336\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92163646\t92163687\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92174220\t92174340\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92177800\t92178099\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92181793\t92181951\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92182125\t92182265\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92184869\t92185021\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92185450\t92185787\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92187512\t92187701\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92193216\t92193363\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92195362\t92195530\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92200333\t92200516\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92224170\t92224307\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92262844\t92263028\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\texon\t92327028\t92327088\t0\t-\t.\tParent=CCDS30770.1\n-chr1\tbed2gff\tgene\t67000042\t67208778\t0\t+\t.\tID=Gene:CCDS30744.1;Name=Gene:CCDS30744.1\n-chr1\tbed2gff\ttranscript\t67000042\t67208778\t0\t+\t.\tID=CCDS30744.1;Name=CCDS30744.1;Parent=Gene:CCDS30744.1\n-chr1\tbed2gff\texon\t67000042\t67000051\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67091530\t67091593\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67098753\t67098777\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67101627\t67101698\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67105460\t67105516\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67108493\t67108547\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67109227\t67109402\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67126196\t67126207\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67133213\t67133224\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67136678\t67136702\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67137627\t67137678\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67138964\t67139049\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67142687\t67142779\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67145361\t67145435\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67147552\t67148052\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67154831\t67154958\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67155873\t67155999\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67161117\t67161176\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67184977\t67185088\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67194947\t67195102\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67199431\t67199563\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67205018\t67205220\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67206341\t67206405\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67206955\t67207119\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\texon\t67208756\t67208778\t0\t+\t.\tParent=CCDS30744.1\n-chr1\tbed2gff\tgene\t8384390\t8404073\t0\t+\t.\tID=Gene:CCDS30577.1;Name=Gene:CCDS30577.1\n-chr1\tbed2gff\ttranscript\t8384390\t8404073\t0\t+\t.\tID=CCDS30577.1;Name=CCDS30577.1;Parent=Gene:CCDS30577.1\n-chr1\tbed2gff\texon\t8384390\t8384786\t0\t+\t.\tParent=CCDS30577.1\n-chr1\tbed2gff\texon\t8385358\t8385450\t0\t+\t.\tParent=CCDS30577.1\n-chr1\tbed2gff\texon\t8385878\t8386102\t0\t+\t.\tParent=CCDS30577.1\n-chr1\tbed2gff\texon\t8390269\t8390996\t0\t+\t.\tParent=CCDS30577.1\n-chr1\tbed2gff\texon\t8395497\t8395650\t0\t+\t.\tParent=CCDS30577.1\n-chr1\tbed2gff\texon\t8397876\t8398052\t0\t+\t.\tParent=CCDS30577.1\n-chr1\tbed2gff\texon\t8399553\t8399758\t0\t+\t.\tParent=CCDS30577.1\n-chr1\tbed2gff\texon\t8403807\t8404073\t0\t+\t.\tParent=CCDS30577.1\n-chr1\tbed2gff\tgene\t16767257\t16785385\t0\t+\t.\tID=Gene:CCDS44067.1;Name=Gene:CCDS44067.1\n-chr1\tbed2gff\ttranscript\t16767257\t16785385\t0\t+\t.\tID=CCDS44067.1;Name=CCDS44067.1;Parent=Gene:CCDS44067.1\n-chr1\tbed2gff\texon\t16767257\t16767270\t0\t+\t.\tParent=CCDS44067.1\n-chr1\tbed2gff\texon\t16770127\t16770227\t0\t+\t.\tParent=CCDS44067.1\n-chr1\tbed2gff\texon\t16774365\t16774469\t0\t+\t.\tParent=CCDS44067.1\n'..b'bed2gff\texon\t2106663\t2106752\t0\t+\t.\tParent=CCDS37.1\n-chr1\tbed2gff\texon\t2116022\t2116137\t0\t+\t.\tParent=CCDS37.1\n-chr1\tbed2gff\texon\t2116361\t2116448\t0\t+\t.\tParent=CCDS37.1\n-chr1\tbed2gff\tgene\t2075778\t2116448\t0\t+\t.\tID=Gene:CCDS41229.1;Name=Gene:CCDS41229.1\n-chr1\tbed2gff\ttranscript\t2075778\t2116448\t0\t+\t.\tID=CCDS41229.1;Name=CCDS41229.1;Parent=Gene:CCDS41229.1\n-chr1\tbed2gff\texon\t2075778\t2075780\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\texon\t2077466\t2077547\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\texon\t2080311\t2080363\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\texon\t2082229\t2082417\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\texon\t2087434\t2087531\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\texon\t2100957\t2101043\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\texon\t2103494\t2103629\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\texon\t2103740\t2103827\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\texon\t2105336\t2105455\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\texon\t2106193\t2106272\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\texon\t2106663\t2106752\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\texon\t2116022\t2116137\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\texon\t2116361\t2116448\t0\t+\t.\tParent=CCDS41229.1\n-chr1\tbed2gff\tgene\t2985824\t3350375\t0\t+\t.\tID=Gene:CCDS44048.1;Name=Gene:CCDS44048.1\n-chr1\tbed2gff\ttranscript\t2985824\t3350375\t0\t+\t.\tID=CCDS44048.1;Name=CCDS44048.1;Parent=Gene:CCDS44048.1\n-chr1\tbed2gff\texon\t2985824\t2985860\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3102689\t3103038\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3160651\t3160701\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3301716\t3301850\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3313055\t3313157\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3319355\t3319562\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3321303\t3321450\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3322059\t3322212\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3327948\t3329364\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3331127\t3331211\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3334392\t3334561\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3335231\t3335308\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3342145\t3342314\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3342615\t3342789\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3347436\t3347672\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3348530\t3348704\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\texon\t3350298\t3350375\t0\t+\t.\tParent=CCDS44048.1\n-chr1\tbed2gff\tgene\t2985824\t3350375\t0\t+\t.\tID=Gene:CCDS41236.1;Name=Gene:CCDS41236.1\n-chr1\tbed2gff\ttranscript\t2985824\t3350375\t0\t+\t.\tID=CCDS41236.1;Name=CCDS41236.1;Parent=Gene:CCDS41236.1\n-chr1\tbed2gff\texon\t2985824\t2985860\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3102689\t3103038\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3160651\t3160701\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3301716\t3301850\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3313055\t3313157\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3319355\t3319562\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3321303\t3321450\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3322059\t3322212\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3327948\t3329364\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3331127\t3331211\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3334392\t3334561\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3335231\t3335308\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3342145\t3342314\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3342615\t3342789\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3347436\t3347672\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3348530\t3348704\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\texon\t3350241\t3350375\t0\t+\t.\tParent=CCDS41236.1\n-chr1\tbed2gff\tgene\t6285140\t6295971\t0\t-\t.\tID=Gene:CCDS61.1;Name=Gene:CCDS61.1\n-chr1\tbed2gff\ttranscript\t6285140\t6295971\t0\t-\t.\tID=CCDS61.1;Name=CCDS61.1;Parent=Gene:CCDS61.1\n-chr1\tbed2gff\texon\t6285140\t6285322\t0\t-\t.\tParent=CCDS61.1\n-chr1\tbed2gff\texon\t6291962\t6292179\t0\t-\t.\tParent=CCDS61.1\n-chr1\tbed2gff\texon\t6293534\t6293703\t0\t-\t.\tParent=CCDS61.1\n-chr1\tbed2gff\texon\t6294946\t6295034\t0\t-\t.\tParent=CCDS61.1\n-chr1\tbed2gff\texon\t6295777\t6295971\t0\t-\t.\tParent=CCDS61.1\n'
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/hs_2009.bed
--- a/fml_gff_converter_programs/t/hs_2009.bed Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
@@ -1,58 +0,0 @@
-chr1 11873 14409 uc001aaa.3 0 + 11873 11873 0 3 354,109,1189, 0,739,1347,
-chr1 11873 14409 uc010nxq.1 0 + 12189 13639 0 3 354,127,1007, 0,721,1529,
-chr1 11873 14409 uc010nxr.1 0 + 11873 11873 0 3 354,52,1189, 0,772,1347,
-chr1 14362 16765 uc009vis.2 0 - 14362 14362 0 4 467,69,147,159, 0,607,1433,2244,
-chr1 16857 17751 uc009vjc.1 0 - 16857 16857 0 2 198,519, 0,375,
-chr1 15795 18061 uc009vjd.2 0 - 15795 15795 0 5 152,159,198,136,456, 0,811,1062,1437,1810,
-chr1 14362 19759 uc009vit.2 0 - 14362 14362 0 9 467,69,152,159,198,510,147,99,847, 0,607,1433,2244,2495,2870,3552,3905,4550,
-chr1 14362 19759 uc001aae.3 0 - 14362 14362 0 10 467,69,152,159,198,136,137,147,99,847, 0,607,1433,2244,2495,2870,3243,3552,3905,4550,
-chr1 14362 19759 uc009viu.2 0 - 14362 14362 0 10 467,69,152,159,198,510,147,102,54,847, 0,607,1433,2244,2495,2870,3552,3905,4138,4550,
-chr1 16857 19759 uc001aai.1 0 - 16857 16857 0 6 198,136,137,147,112,847, 0,375,748,1057,1410,2055,
-chr1 14362 24901 uc001aab.3 0 - 14362 14362 0 10 467,69,152,159,202,136,137,147,112,164, 0,607,1433,2244,2491,2870,3243,3552,3905,10375,
-chr1 14362 29370 uc001aah.3 0 - 14362 14362 0 11 467,69,152,159,198,136,137,147,99,154,50, 0,607,1433,2244,2495,2870,3243,3552,3905,10375,14958,
-chr1 14362 29370 uc009vir.2 0 - 14362 14362 0 10 467,69,152,159,198,510,147,99,154,50, 0,607,1433,2244,2495,2870,3552,3905,10375,14958,
-chr1 14362 29370 uc009viq.2 0 - 14362 14362 0 7 467,152,159,198,456,154,50, 0,1433,2244,2495,3243,10375,14958,
-chr1 14362 29370 uc001aac.3 0 - 14362 14362 0 11 467,69,152,159,198,110,137,147,102,154,50, 0,607,1433,2244,2495,2896,3243,3552,3905,10375,14958,
-chr1 14406 29370 uc009viv.2 0 - 14406 14406 0 7 2359,198,136,137,147,154,50, 0,2451,2826,3199,3508,10331,14914,
-chr1 14406 29370 uc009viw.2 0 - 14406 14406 0 7 2359,198,510,147,99,154,50, 0,2451,2826,3508,3861,10331,14914,
-chr1 15602 29370 uc009vix.2 0 - 15602 15602 0 7 345,159,198,136,147,154,50, 0,1004,1255,1630,2312,9135,13718,
-chr1 16606 29370 uc009viz.2 0 - 16606 16606 0 8 159,202,136,137,147,112,154,50, 0,247,626,999,1308,1661,8131,12714,
-chr1 16606 29370 uc009viy.2 0 - 16606 16606 0 9 159,198,136,137,147,95,58,154,50, 0,251,626,999,1308,1661,1890,8131,12714,
-chr1 16857 29370 uc010nxs.1 0 - 16857 16857 0 8 198,136,137,147,99,227,154,50, 0,375,748,1057,1410,2055,7880,12463,
-chr1 17232 29370 uc009vje.2 0 - 17232 17232 0 4 510,147,99,50, 0,682,1035,12088,
-chr1 17605 29370 uc009vjf.2 0 - 17605 17605 0 7 137,147,95,58,227,154,50, 0,309,662,891,1307,7132,11715,
-chr1 16857 29961 uc009vjb.1 0 - 16857 16857 0 7 198,136,137,147,112,154,138, 0,375,748,1057,1410,7880,12966,
-chr1 34611 36081 uc001aak.2 0 - 34611 34611 0 3 563,205,361, 0,665,1109,
-chr1 69090 70008 uc001aal.1 0 + 69090 70008 0 1 918, 0,
-chr1 137838 139228 uc001aam.3 0 - 137838 137838 0 1 1390, 0,
-chr1 89294 237877 uc010nxt.1 0 - 89294 89294 0 2 1110,1263, 0,147320,
-chr1 321083 321114 uc001aaq.1 0 + 321083 321083 0 1 31, 0,
-chr1 321145 321223 uc001aar.1 0 + 321145 321145 0 1 78, 0,
-chr1 322036 326938 uc009vjk.2 0 + 324342 325605 0 3 192,58,2500, 0,2251,2402,
-chr1 323891 328580 uc001aav.3 0 + 323891 323891 0 4 169,58,2500,1545, 0,396,547,3144,
-chr1 323891 328580 uc001aau.2 0 + 324342 325605 0 3 169,58,4142, 0,396,547,
-chr1 367658 368595 uc010nxu.1 0 + 367658 368594 0 1 937, 0,
-chr1 420205 421839 uc001aax.1 0 + 420205 420205 0 3 91,267,444, 0,786,1190,
-chr1 566461 568045 uc001aaz.2 0 + 566461 566461 0 1 1584, 0,
-chr1 568148 568842 uc001aba.1 0 + 568148 568148 0 1 694, 0,
-chr1 568843 568912 uc001abb.2 0 + 568843 568843 0 1 69, 0,
-chr1 569326 570349 uc001abc.2 0 + 569326 569326 0 1 1023, 0,
-chr1 621097 622034 uc010nxv.1 0 - 621098 622034 0 1 937, 0,
-chr1 661139 665731 uc001abe.3 0 - 664484 665108 0 3 4045,58,169, 0,4138,4423,
-chr1 668401 668479 uc001abi.1 0 - 668401 668401 0 1 78, 0,
-chr1 668510 668541 uc001abj.2 0 - 668510 668510 0 1 31, 0,
-chr1 661139 670994 uc009vjm.2 0 - 664484 665108 0 3 4045,58,192, 0,7547,9663,
-chr1 671807 671885 uc010nxw.1 0 - 671807 671807 0 1 78, 0,
-chr1 671916 671947 uc001abl.2 0 - 671916 671916 0 1 31, 0,
-chr1 661139 679736 uc002khh.2 0 - 661139 661139 0 7 4045,58,191,233,58,65,162, 0,4138,6257,14043,14369,17526,18435,
-chr1 674239 679736 uc001abm.2 0 - 674239 674239 0 5 165,233,58,65,162, 0,943,1269,4426,5335,
-chr1 700236 714006 uc001abo.2 0 - 700236 700236 0 7 391,59,66,216,132,110,343, 0,1472,3691,4640,8119,9314,13427,
-chr1 761586 762902 uc010nxx.1 0 - 762079 762571 0 1 1316, 0,
-chr1 763063 788902 uc009vjn.1 0 + 763063 763063 0 4 92,102,184,132, 0,1319,24243,25707,
-chr1 763063 788997 uc001abp.1 0 + 763063 763063 0 6 92,102,153,184,96,227, 0,1319,19970,24243,24987,25707,
-chr1 763063 788997 uc001abq.1 0 + 763063 763063 0 5 92,102,184,96,227, 0,1319,24243,24987,25707,
-chr1 763063 788997 uc009vjo.1 0 + 763063 763063 0 4 92,102,96,227, 0,1319,24987,25707,
-chr1 763063 789740 uc001abr.1 0 + 763063 763063 0 7 92,102,153,184,96,132,784, 0,1319,19970,24243,24987,25707,25893,
-chr1 791897 794579 uc001abs.2 0 + 791897 791897 0 1 2682, 0,
-chr1 803452 812182 uc001abt.3 0 - 803452 803452 0 3 603,1044,57, 0,6039,8673,
-chr1 846814 850328 uc001abu.1 0 + 846814 846814 0 2 39,3004, 0,510,
b
diff -r c6cf723c2ed4 -r db3c67b03d55 fml_gff_converter_programs/t/hs_2009.gff3
--- a/fml_gff_converter_programs/t/hs_2009.gff3 Tue Jun 10 19:57:56 2014 -0400
+++ /dev/null Thu Jan 01 00:00:00 1970 +0000
b
b'@@ -1,375 +0,0 @@\n-##gff-version 3\n-chr1\tbed2gff\tgene\t11874\t14409\t0\t+\t.\tID=Gene:uc001aaa.3;Name=Gene:uc001aaa.3\n-chr1\tbed2gff\ttranscript\t11874\t14409\t0\t+\t.\tID=uc001aaa.3;Name=uc001aaa.3;Parent=Gene:uc001aaa.3\n-chr1\tbed2gff\texon\t11874\t12227\t0\t+\t.\tParent=uc001aaa.3\n-chr1\tbed2gff\texon\t12613\t12721\t0\t+\t.\tParent=uc001aaa.3\n-chr1\tbed2gff\texon\t13221\t14409\t0\t+\t.\tParent=uc001aaa.3\n-chr1\tbed2gff\tgene\t11874\t14409\t0\t+\t.\tID=Gene:uc010nxq.1;Name=Gene:uc010nxq.1\n-chr1\tbed2gff\ttranscript\t11874\t14409\t0\t+\t.\tID=uc010nxq.1;Name=uc010nxq.1;Parent=Gene:uc010nxq.1\n-chr1\tbed2gff\texon\t11874\t12227\t0\t+\t.\tParent=uc010nxq.1\n-chr1\tbed2gff\texon\t12595\t12721\t0\t+\t.\tParent=uc010nxq.1\n-chr1\tbed2gff\texon\t13403\t14409\t0\t+\t.\tParent=uc010nxq.1\n-chr1\tbed2gff\tgene\t11874\t14409\t0\t+\t.\tID=Gene:uc010nxr.1;Name=Gene:uc010nxr.1\n-chr1\tbed2gff\ttranscript\t11874\t14409\t0\t+\t.\tID=uc010nxr.1;Name=uc010nxr.1;Parent=Gene:uc010nxr.1\n-chr1\tbed2gff\texon\t11874\t12227\t0\t+\t.\tParent=uc010nxr.1\n-chr1\tbed2gff\texon\t12646\t12697\t0\t+\t.\tParent=uc010nxr.1\n-chr1\tbed2gff\texon\t13221\t14409\t0\t+\t.\tParent=uc010nxr.1\n-chr1\tbed2gff\tgene\t14363\t16765\t0\t-\t.\tID=Gene:uc009vis.2;Name=Gene:uc009vis.2\n-chr1\tbed2gff\ttranscript\t14363\t16765\t0\t-\t.\tID=uc009vis.2;Name=uc009vis.2;Parent=Gene:uc009vis.2\n-chr1\tbed2gff\texon\t14363\t14829\t0\t-\t.\tParent=uc009vis.2\n-chr1\tbed2gff\texon\t14970\t15038\t0\t-\t.\tParent=uc009vis.2\n-chr1\tbed2gff\texon\t15796\t15942\t0\t-\t.\tParent=uc009vis.2\n-chr1\tbed2gff\texon\t16607\t16765\t0\t-\t.\tParent=uc009vis.2\n-chr1\tbed2gff\tgene\t16858\t17751\t0\t-\t.\tID=Gene:uc009vjc.1;Name=Gene:uc009vjc.1\n-chr1\tbed2gff\ttranscript\t16858\t17751\t0\t-\t.\tID=uc009vjc.1;Name=uc009vjc.1;Parent=Gene:uc009vjc.1\n-chr1\tbed2gff\texon\t16858\t17055\t0\t-\t.\tParent=uc009vjc.1\n-chr1\tbed2gff\texon\t17233\t17751\t0\t-\t.\tParent=uc009vjc.1\n-chr1\tbed2gff\tgene\t15796\t18061\t0\t-\t.\tID=Gene:uc009vjd.2;Name=Gene:uc009vjd.2\n-chr1\tbed2gff\ttranscript\t15796\t18061\t0\t-\t.\tID=uc009vjd.2;Name=uc009vjd.2;Parent=Gene:uc009vjd.2\n-chr1\tbed2gff\texon\t15796\t15947\t0\t-\t.\tParent=uc009vjd.2\n-chr1\tbed2gff\texon\t16607\t16765\t0\t-\t.\tParent=uc009vjd.2\n-chr1\tbed2gff\texon\t16858\t17055\t0\t-\t.\tParent=uc009vjd.2\n-chr1\tbed2gff\texon\t17233\t17368\t0\t-\t.\tParent=uc009vjd.2\n-chr1\tbed2gff\texon\t17606\t18061\t0\t-\t.\tParent=uc009vjd.2\n-chr1\tbed2gff\tgene\t14363\t19759\t0\t-\t.\tID=Gene:uc009vit.2;Name=Gene:uc009vit.2\n-chr1\tbed2gff\ttranscript\t14363\t19759\t0\t-\t.\tID=uc009vit.2;Name=uc009vit.2;Parent=Gene:uc009vit.2\n-chr1\tbed2gff\texon\t14363\t14829\t0\t-\t.\tParent=uc009vit.2\n-chr1\tbed2gff\texon\t14970\t15038\t0\t-\t.\tParent=uc009vit.2\n-chr1\tbed2gff\texon\t15796\t15947\t0\t-\t.\tParent=uc009vit.2\n-chr1\tbed2gff\texon\t16607\t16765\t0\t-\t.\tParent=uc009vit.2\n-chr1\tbed2gff\texon\t16858\t17055\t0\t-\t.\tParent=uc009vit.2\n-chr1\tbed2gff\texon\t17233\t17742\t0\t-\t.\tParent=uc009vit.2\n-chr1\tbed2gff\texon\t17915\t18061\t0\t-\t.\tParent=uc009vit.2\n-chr1\tbed2gff\texon\t18268\t18366\t0\t-\t.\tParent=uc009vit.2\n-chr1\tbed2gff\texon\t18913\t19759\t0\t-\t.\tParent=uc009vit.2\n-chr1\tbed2gff\tgene\t14363\t19759\t0\t-\t.\tID=Gene:uc001aae.3;Name=Gene:uc001aae.3\n-chr1\tbed2gff\ttranscript\t14363\t19759\t0\t-\t.\tID=uc001aae.3;Name=uc001aae.3;Parent=Gene:uc001aae.3\n-chr1\tbed2gff\texon\t14363\t14829\t0\t-\t.\tParent=uc001aae.3\n-chr1\tbed2gff\texon\t14970\t15038\t0\t-\t.\tParent=uc001aae.3\n-chr1\tbed2gff\texon\t15796\t15947\t0\t-\t.\tParent=uc001aae.3\n-chr1\tbed2gff\texon\t16607\t16765\t0\t-\t.\tParent=uc001aae.3\n-chr1\tbed2gff\texon\t16858\t17055\t0\t-\t.\tParent=uc001aae.3\n-chr1\tbed2gff\texon\t17233\t17368\t0\t-\t.\tParent=uc001aae.3\n-chr1\tbed2gff\texon\t17606\t17742\t0\t-\t.\tParent=uc001aae.3\n-chr1\tbed2gff\texon\t17915\t18061\t0\t-\t.\tParent=uc001aae.3\n-chr1\tbed2gff\texon\t18268\t18366\t0\t-\t.\tParent=uc001aae.3\n-chr1\tbed2gff\texon\t18913\t19759\t0\t-\t.\tParent=uc001aae.3\n-chr1\tbed2gff\tgene\t14363\t19759\t0\t-\t.\tID=Gene:uc009viu.2;Name=Gene:uc009viu.2\n-chr1\tbed2gff\ttranscript\t14363\t19759\t0\t-\t.\tID=uc009viu.2;Name=uc009viu.2;Parent=Gene:uc009viu.2\n-chr1\tbed2gff\texon\t14363\t14829\t0\t-\t.\tParent=uc009viu.2\n-chr1\tbed2gff\texon\t14970\t15038\t0\t-\t.\tParent=uc009viu.2\n-chr1\tbed2gff\texon\t15796\t15947\t0\t-\t.\tParent=uc009viu.2\n-chr1\tbed2gff\texon\t16607\t16765\t0\t-\t.\tParent=uc009viu.2\n-chr1\tbed2gff\texon\t16858\t17055\t0\t-\t.\tParen'..b'1abo.2;Name=Gene:uc001abo.2\n-chr1\tbed2gff\ttranscript\t700237\t714006\t0\t-\t.\tID=uc001abo.2;Name=uc001abo.2;Parent=Gene:uc001abo.2\n-chr1\tbed2gff\texon\t700237\t700627\t0\t-\t.\tParent=uc001abo.2\n-chr1\tbed2gff\texon\t701709\t701767\t0\t-\t.\tParent=uc001abo.2\n-chr1\tbed2gff\texon\t703928\t703993\t0\t-\t.\tParent=uc001abo.2\n-chr1\tbed2gff\texon\t704877\t705092\t0\t-\t.\tParent=uc001abo.2\n-chr1\tbed2gff\texon\t708356\t708487\t0\t-\t.\tParent=uc001abo.2\n-chr1\tbed2gff\texon\t709551\t709660\t0\t-\t.\tParent=uc001abo.2\n-chr1\tbed2gff\texon\t713664\t714006\t0\t-\t.\tParent=uc001abo.2\n-chr1\tbed2gff\tgene\t761587\t762902\t0\t-\t.\tID=Gene:uc010nxx.1;Name=Gene:uc010nxx.1\n-chr1\tbed2gff\ttranscript\t761587\t762902\t0\t-\t.\tID=uc010nxx.1;Name=uc010nxx.1;Parent=Gene:uc010nxx.1\n-chr1\tbed2gff\texon\t761587\t762902\t0\t-\t.\tParent=uc010nxx.1\n-chr1\tbed2gff\tgene\t763064\t788902\t0\t+\t.\tID=Gene:uc009vjn.1;Name=Gene:uc009vjn.1\n-chr1\tbed2gff\ttranscript\t763064\t788902\t0\t+\t.\tID=uc009vjn.1;Name=uc009vjn.1;Parent=Gene:uc009vjn.1\n-chr1\tbed2gff\texon\t763064\t763155\t0\t+\t.\tParent=uc009vjn.1\n-chr1\tbed2gff\texon\t764383\t764484\t0\t+\t.\tParent=uc009vjn.1\n-chr1\tbed2gff\texon\t787307\t787490\t0\t+\t.\tParent=uc009vjn.1\n-chr1\tbed2gff\texon\t788771\t788902\t0\t+\t.\tParent=uc009vjn.1\n-chr1\tbed2gff\tgene\t763064\t788997\t0\t+\t.\tID=Gene:uc001abp.1;Name=Gene:uc001abp.1\n-chr1\tbed2gff\ttranscript\t763064\t788997\t0\t+\t.\tID=uc001abp.1;Name=uc001abp.1;Parent=Gene:uc001abp.1\n-chr1\tbed2gff\texon\t763064\t763155\t0\t+\t.\tParent=uc001abp.1\n-chr1\tbed2gff\texon\t764383\t764484\t0\t+\t.\tParent=uc001abp.1\n-chr1\tbed2gff\texon\t783034\t783186\t0\t+\t.\tParent=uc001abp.1\n-chr1\tbed2gff\texon\t787307\t787490\t0\t+\t.\tParent=uc001abp.1\n-chr1\tbed2gff\texon\t788051\t788146\t0\t+\t.\tParent=uc001abp.1\n-chr1\tbed2gff\texon\t788771\t788997\t0\t+\t.\tParent=uc001abp.1\n-chr1\tbed2gff\tgene\t763064\t788997\t0\t+\t.\tID=Gene:uc001abq.1;Name=Gene:uc001abq.1\n-chr1\tbed2gff\ttranscript\t763064\t788997\t0\t+\t.\tID=uc001abq.1;Name=uc001abq.1;Parent=Gene:uc001abq.1\n-chr1\tbed2gff\texon\t763064\t763155\t0\t+\t.\tParent=uc001abq.1\n-chr1\tbed2gff\texon\t764383\t764484\t0\t+\t.\tParent=uc001abq.1\n-chr1\tbed2gff\texon\t787307\t787490\t0\t+\t.\tParent=uc001abq.1\n-chr1\tbed2gff\texon\t788051\t788146\t0\t+\t.\tParent=uc001abq.1\n-chr1\tbed2gff\texon\t788771\t788997\t0\t+\t.\tParent=uc001abq.1\n-chr1\tbed2gff\tgene\t763064\t788997\t0\t+\t.\tID=Gene:uc009vjo.1;Name=Gene:uc009vjo.1\n-chr1\tbed2gff\ttranscript\t763064\t788997\t0\t+\t.\tID=uc009vjo.1;Name=uc009vjo.1;Parent=Gene:uc009vjo.1\n-chr1\tbed2gff\texon\t763064\t763155\t0\t+\t.\tParent=uc009vjo.1\n-chr1\tbed2gff\texon\t764383\t764484\t0\t+\t.\tParent=uc009vjo.1\n-chr1\tbed2gff\texon\t788051\t788146\t0\t+\t.\tParent=uc009vjo.1\n-chr1\tbed2gff\texon\t788771\t788997\t0\t+\t.\tParent=uc009vjo.1\n-chr1\tbed2gff\tgene\t763064\t789740\t0\t+\t.\tID=Gene:uc001abr.1;Name=Gene:uc001abr.1\n-chr1\tbed2gff\ttranscript\t763064\t789740\t0\t+\t.\tID=uc001abr.1;Name=uc001abr.1;Parent=Gene:uc001abr.1\n-chr1\tbed2gff\texon\t763064\t763155\t0\t+\t.\tParent=uc001abr.1\n-chr1\tbed2gff\texon\t764383\t764484\t0\t+\t.\tParent=uc001abr.1\n-chr1\tbed2gff\texon\t783034\t783186\t0\t+\t.\tParent=uc001abr.1\n-chr1\tbed2gff\texon\t787307\t787490\t0\t+\t.\tParent=uc001abr.1\n-chr1\tbed2gff\texon\t788051\t788146\t0\t+\t.\tParent=uc001abr.1\n-chr1\tbed2gff\texon\t788771\t788902\t0\t+\t.\tParent=uc001abr.1\n-chr1\tbed2gff\texon\t788957\t789740\t0\t+\t.\tParent=uc001abr.1\n-chr1\tbed2gff\tgene\t791898\t794579\t0\t+\t.\tID=Gene:uc001abs.2;Name=Gene:uc001abs.2\n-chr1\tbed2gff\ttranscript\t791898\t794579\t0\t+\t.\tID=uc001abs.2;Name=uc001abs.2;Parent=Gene:uc001abs.2\n-chr1\tbed2gff\texon\t791898\t794579\t0\t+\t.\tParent=uc001abs.2\n-chr1\tbed2gff\tgene\t803453\t812182\t0\t-\t.\tID=Gene:uc001abt.3;Name=Gene:uc001abt.3\n-chr1\tbed2gff\ttranscript\t803453\t812182\t0\t-\t.\tID=uc001abt.3;Name=uc001abt.3;Parent=Gene:uc001abt.3\n-chr1\tbed2gff\texon\t803453\t804055\t0\t-\t.\tParent=uc001abt.3\n-chr1\tbed2gff\texon\t809492\t810535\t0\t-\t.\tParent=uc001abt.3\n-chr1\tbed2gff\texon\t812126\t812182\t0\t-\t.\tParent=uc001abt.3\n-chr1\tbed2gff\tgene\t846815\t850328\t0\t+\t.\tID=Gene:uc001abu.1;Name=Gene:uc001abu.1\n-chr1\tbed2gff\ttranscript\t846815\t850328\t0\t+\t.\tID=uc001abu.1;Name=uc001abu.1;Parent=Gene:uc001abu.1\n-chr1\tbed2gff\texon\t846815\t846853\t0\t+\t.\tParent=uc001abu.1\n-chr1\tbed2gff\texon\t847325\t850328\t0\t+\t.\tParent=uc001abu.1\n'