Galaxy | Tool Preview

CTAT Genome Resource Libraries Data Manager (version 2.0.0)
Select a CTAT Genome Resource Library to Download.

Retrieve, and/or specify the location of, a CTAT Genome Resource Library.

There are three options: 1) Download from CTAT and build if needed; 2) Build library from local source data; 3) Specify location of built library.

  1. Download from CTAT and build if needed: This option uses the files at https://data.broadinstitute.org/Trinity/CTAT_RESOURCE_LIB/ as selectors for the user to choose among. Specify the Full Path of the location where the CTAT Genome Resource Library should be placed. The other options should be self-explanatory.
  2. Build library from local source data: This assumes that you have already downloaded the source data for a build and you want to build it.
  3. Specify location of built library: This assumes you already have a build library and need tell galaxy where it is.

All options allow the user to do a gmap_build on the library and also to integrate ctat-mutation resources into the library.

You will need approximately 62GB of space for a human genome resource library, once it is built, but if downloading and building, to be safe provide at least 75GB.

The installation of this tool takes some time, due to building a conda environment for the dependencies. The "source_data" files download faster, but then must be built. Building the library from the "source_data" files can take many hours, depending on the resources of your machine. The "plug-n-play" can take considerable time to download, depending on your internet connection. Even with high speed, it is about 25GB that is transfered, so plan accordingly. If you have a good speed internet connection, downloading the plug-n-play will usually be faster than building.

If a download or a build is interrupted, re-running the job should pick up where it left off.

Neither the "source_data" nor the "plug-n-play" versions have had their gmap index built. If you are not going to be using gmap_fusion, then you can uncheck the gmap_build check box and save the space and time building the index consumes.

Neither the "source_data" nor the "plug-n-play" versions have Mutation Resources included. Those must be downloaded separately and integrated into the Library. If you are going to be using the ctat_mutations tool, check the Download Mutation Library check box. Whether or not you check this box, the ctat_cravat_tissues table needed by the ctat_mutations tool will be created.

In order to integrate the Mutation Resources into a CTAT Genome Resource Library, you must have previously downloaded COSMIC resources (See Step 2 from https://github.com/NCIP/ctat-mutations/tree/master/mutation_lib_prep ) You can place them directly into the Genome Resource Library location, or if the Library is not built yet, or you do not know the full path to it, specify the directory where the COSMIC files are, so they can be integrated into the Library.

The Mouse genome is not currently supported by ctat_mutations.

If the Reference Genome Display Name is left empty a name will be created, but any text that will best guide the user can be entered here. It will be the text that is used for selecting the library in pull down lists requiring a Genome Reference Library resource (These are stored in the ctat_genome_resource_libs table).

For more information on CTAT Genome Resource Libraries, see https://github.com/FusionFilter/FusionFilter/wiki