Mercurial > repos > bimib > cobraxy
comparison COBRAxy/docs/getting-started.md @ 492:4ed95023af20 draft
Uploaded
| author | francesco_lapi |
|---|---|
| date | Tue, 30 Sep 2025 14:02:17 +0000 |
| parents | |
| children | fcdbc81feb45 |
comparison
equal
deleted
inserted
replaced
| 491:7a413a5ec566 | 492:4ed95023af20 |
|---|---|
| 1 # Getting Started | |
| 2 | |
| 3 Welcome to COBRAxy! This guide will help you get up and running with metabolic flux analysis. | |
| 4 | |
| 5 ## What is COBRAxy? | |
| 6 | |
| 7 COBRAxy is a comprehensive toolkit for metabolic flux analysis that bridges the gap between omics data and biological insights. It provides: | |
| 8 | |
| 9 - **Data Integration**: Combine gene expression and metabolite data | |
| 10 - **Metabolic Modeling**: Use constraint-based models for flux analysis | |
| 11 - **Visualization**: Generate interactive pathway maps | |
| 12 - **Statistical Analysis**: Perform enrichment and sensitivity analysis | |
| 13 | |
| 14 ## Core Concepts | |
| 15 | |
| 16 ### Reaction Activity Scores (RAS) | |
| 17 RAS quantify how active metabolic reactions are based on gene expression data. COBRAxy computes RAS by: | |
| 18 1. Mapping genes to reactions via GPR (Gene-Protein-Reaction) rules | |
| 19 2. Applying logical operations (AND/OR) based on enzyme complexes | |
| 20 3. Producing activity scores for each reaction in each sample | |
| 21 | |
| 22 ### Reaction Propensity Scores (RPS) | |
| 23 RPS indicate metabolic preferences based on metabolite abundance: | |
| 24 1. Map metabolites to reactions as substrates/products | |
| 25 2. Weight by stoichiometry and frequency | |
| 26 3. Compute propensity scores using log-normalized formulas | |
| 27 | |
| 28 ### Flux Sampling | |
| 29 Sample feasible flux distributions using: | |
| 30 - **CBS (Coordinate Hit-and-Run with Rounding)**: Fast, uniform sampling | |
| 31 - **OptGP (Optimal Growth Parallel)**: Growth-optimized sampling | |
| 32 | |
| 33 ## Analysis Workflows | |
| 34 | |
| 35 COBRAxy supports two main analysis paths: | |
| 36 | |
| 37 ### 1. Enrichment Analysis Workflow | |
| 38 ```bash | |
| 39 # Generate activity scores | |
| 40 ras_generator → RAS values | |
| 41 rps_generator → RPS values | |
| 42 | |
| 43 # Statistical enrichment analysis | |
| 44 marea → Enriched pathway maps | |
| 45 ``` | |
| 46 | |
| 47 **Use when**: You want to identify significantly altered pathways and create publication-ready maps. | |
| 48 | |
| 49 ### 2. Flux Simulation Workflow | |
| 50 ```bash | |
| 51 # Apply constraints to model | |
| 52 ras_generator → RAS values | |
| 53 ras_to_bounds → Constrained model | |
| 54 | |
| 55 # Sample flux distributions | |
| 56 flux_simulation → Flux samples | |
| 57 flux_to_map → Final visualizations | |
| 58 ``` | |
| 59 | |
| 60 **Use when**: You want to predict metabolic flux distributions and study network-wide changes. | |
| 61 | |
| 62 ## Your First Analysis | |
| 63 | |
| 64 Let's run a basic analysis with sample data: | |
| 65 | |
| 66 ### Step 1: Prepare Your Data | |
| 67 | |
| 68 You'll need: | |
| 69 - **Gene expression data**: TSV file with genes (rows) × samples (columns) | |
| 70 - **Metabolic model**: SBML file or use built-in models (ENGRO2, Recon) | |
| 71 - **Metabolite data** (optional): TSV file with metabolites (rows) × samples (columns) | |
| 72 | |
| 73 ### Step 2: Generate Activity Scores | |
| 74 | |
| 75 ```bash | |
| 76 # Generate RAS from expression data | |
| 77 ras_generator -td $(pwd) \ | |
| 78 -in expression_data.tsv \ | |
| 79 -ra ras_output.tsv \ | |
| 80 -rs ENGRO2 | |
| 81 ``` | |
| 82 | |
| 83 ### Step 3: Create Pathway Maps | |
| 84 | |
| 85 ```bash | |
| 86 # Generate enriched pathway maps | |
| 87 marea -td $(pwd) \ | |
| 88 -using_RAS true \ | |
| 89 -input_data ras_output.tsv \ | |
| 90 -choice_map ENGRO2 \ | |
| 91 -gs true \ | |
| 92 -idop pathway_maps | |
| 93 ``` | |
| 94 | |
| 95 ### Step 4: View Results | |
| 96 | |
| 97 Your analysis will generate: | |
| 98 - **RAS values**: `ras_output.tsv` - Activity scores for each reaction | |
| 99 - **Statistical maps**: `pathway_maps/` - SVG files with enrichment visualization | |
| 100 - **Log files**: Detailed execution logs for troubleshooting | |
| 101 | |
| 102 ## Built-in Models | |
| 103 | |
| 104 COBRAxy includes ready-to-use metabolic models: | |
| 105 | |
| 106 | Model | Organism | Reactions | Genes | Description | | |
| 107 |-------|----------|-----------|-------|-------------| | |
| 108 | **ENGRO2** | Human | ~2,000 | ~500 | Focused human metabolism model | | |
| 109 | **Recon** | Human | ~10,000 | ~2,000 | Comprehensive human metabolism | | |
| 110 | |
| 111 Models are stored in the `local/` directory and include: | |
| 112 - SBML files | |
| 113 - GPR rules | |
| 114 - Gene mapping tables | |
| 115 - Pathway templates | |
| 116 | |
| 117 ## Data Formats | |
| 118 | |
| 119 ### Gene Expression Format | |
| 120 ```tsv | |
| 121 Gene_ID Sample_1 Sample_2 Sample_3 | |
| 122 HGNC:5 12.5 8.3 15.7 | |
| 123 HGNC:10 3.2 4.1 2.8 | |
| 124 HGNC:15 7.9 11.2 6.4 | |
| 125 ``` | |
| 126 | |
| 127 ### Metabolite Format | |
| 128 ```tsv | |
| 129 Metabolite_ID Sample_1 Sample_2 Sample_3 | |
| 130 glucose 100.5 85.3 120.7 | |
| 131 pyruvate 45.2 38.1 52.8 | |
| 132 lactate 23.9 41.2 19.4 | |
| 133 ``` | |
| 134 | |
| 135 ## Command Line vs Python API | |
| 136 | |
| 137 COBRAxy offers two usage modes: | |
| 138 | |
| 139 ### Command Line (Quick Analysis) | |
| 140 ```bash | |
| 141 # Simple command-line execution | |
| 142 ras_generator -td $(pwd) -in data.tsv -ra output.tsv -rs ENGRO2 | |
| 143 ``` | |
| 144 | |
| 145 ### Python API (Programming) | |
| 146 ```python | |
| 147 import ras_generator | |
| 148 # Call main function with arguments | |
| 149 ras_generator.main(['-td', '/path', '-in', 'data.tsv', '-ra', 'output.tsv', '-rs', 'ENGRO2']) | |
| 150 ``` | |
| 151 | |
| 152 ## Next Steps | |
| 153 | |
| 154 Now that you understand the basics: | |
| 155 | |
| 156 1. **[Quick Start Guide](quickstart.md)** - Complete walkthrough with example data | |
| 157 2. **[Python API Tutorial](tutorials/python-api.md)** - Learn programmatic usage | |
| 158 3. **[Tools Reference](tools/)** - Detailed documentation for each tool | |
| 159 4. **[Examples](examples/)** - Real-world analysis examples | |
| 160 | |
| 161 ## Need Help? | |
| 162 | |
| 163 - **[Troubleshooting](troubleshooting.md)** - Common issues and solutions | |
| 164 - **[GitHub Issues](https://github.com/CompBtBs/COBRAxy/issues)** - Report bugs or ask questions | |
| 165 - **[Contributing](contributing.md)** - Help improve COBRAxy |
