comparison iedb_api.xml @ 2:883cdf0ffae5 draft

"planemo upload for repository https://github.com/jj-umn/galaxytools/tree/master/iedb_api commit 8e90bc55d0f086038b1665521fd992bed7aaa8ca"
author jjohnson
date Wed, 26 Feb 2020 15:02:37 -0500
parents 4a89ba6cfc63
children 153d5fa7af53
comparison
equal deleted inserted replaced
1:4a89ba6cfc63 2:883cdf0ffae5
399 399
400 **OUTPUTS** 400 **OUTPUTS**
401 401
402 A tabular file containing the results returned from the IEDB web service 402 A tabular file containing the results returned from the IEDB web service
403 403
404 **Typical Workflow for Human MHC I Binding Prediction**
405
406 The RNAseq data for the subject would be used for:
407
408 - HLA prediction by seq2HLA_
409 - Novel Antigen Prediction by a variety of workflows to generate a Antigen peptide fasta
410
411 .. _seq2HLA: https://toolshed.g2.bx.psu.edu/view/iuc/seq2hla/52bba1cd3823
412
413 .. image:: $PATH_TO_IMAGES/IEDB_Workflow_QueryTabular.png
414 :width: 584
415 :height: 430
416
417 .. note:: The seq2HLA ClassI.HLAgenotype4digits output needs to be converted for IEDB alleles.
418
419 The seq2HLA ClassI.HLAgenotype4digits output:
420
421 .. image:: $PATH_TO_IMAGES/seq2HLA_ClassI.HLAgenotype4digits.png
422 :width: 285
423 :height: 77
424
425 Needs to be converted into IEDB formatted alleles:
426
427 .. image:: $PATH_TO_IMAGES/IEDB_formatted_alleles.png
428 :width: 74
429 :height: 81
430
431 In the workflow above QueryTabular_ tool converts the alleles:
432
433 - Filter Dataset Input
434
435 * skip leading lines - *skip lines:* 1
436 * select columns - *columns:* 2,4
437 * regex replace value in column - *column:* 1 *regex pattern:* ^(\\w+[*]\\d\\d:\\d\\d\\d?).*$ *replacement expression:* HLA-\\1
438 * regex replace value in column - *column:* 2 *regex pattern:* ^(\\w+[*]\\d\\d:\\d\\d\\d?).*$ *replacement expression:* HLA-\\1
439
440 - SQL Query to generate tabular output
441
442 * SELECT c1 FROM t1 UNION SELECT c2 FROM t1
443
444 .. _QueryTabular: https://toolshed.g2.bx.psu.edu/view/iuc/query_tabular/0c95a3f1654f
445
446
447 The IEDB formatting can also be performed by TextProcessing_ tools:
448
449 .. _TextProcessing: https://toolshed.g2.bx.psu.edu/view/bgruening/text_processing/0a8c6b61f0f4
450
451 .. image:: $PATH_TO_IMAGES/TextProcessingConversion.png
452 :width: 608
453 :height: 87
454
455 The TextProcessing steps to convert the alleles:
456
457 - Remove beginning - removes the header line
458 - Replace Text - picks Allele 1 and Allele 2 from each line and reformats each on a separate line
459
460 * *Find pattern:* ^.*\\t([a-zA-Z]+[*][0-9]{2}:[0-9]{2,3}).*\\t.*\\t([a-zA-Z]+[*][0-9]{2}:[0-9]{2,3}).*\\t.*$
461 * *Replace with:* HLA-\\1\\nHLA-\\2
462
463 - Unique - remove duplicates
464
465
404 ]]></help> 466 ]]></help>
405 <citations> 467 <citations>
406 <citation type="doi">10.1093/nar/gku938</citation> 468 <citation type="doi">10.1093/nar/gku938</citation>
407 </citations> 469 </citations>
408 </tool> 470 </tool>