Mercurial > repos > ecology > harmonize_insitu_to_netcdf
comparison bgc_harmonizer.xml @ 0:15260949227d draft default tip
planemo upload for repository https://github.com/galaxyecology/tools-ecology/tree/master/tools/bgc_ocean commit 124fc5205762aaf4f7b39b09a85b5b3fb918370d
author | ecology |
---|---|
date | Thu, 05 Dec 2024 14:16:02 +0000 |
parents | |
children |
comparison
equal
deleted
inserted
replaced
-1:000000000000 | 0:15260949227d |
---|---|
1 <tool id="harmonize_insitu_to_netcdf" name="QCV harmonizer" version="@TOOL_VERSION@+galaxy@VERSION_SUFFIX@" profile="22.01" license="MIT"> | |
2 <description>and aggregator of insitu marine physical and biogeochemical data</description> | |
3 <macros> | |
4 <token name="@TOOL_VERSION@">1.1</token> | |
5 <token name="@VERSION_SUFFIX@">0</token> | |
6 </macros> | |
7 <requirements> | |
8 <container type="docker">pokapok/qcv_ingester:@TOOL_VERSION@</container> | |
9 </requirements> | |
10 <command detect_errors="exit_code"><![CDATA[ | |
11 export HOME=\$PWD && | |
12 | |
13 #for $i, $infile in enumerate($infiles): | |
14 cp '$infile' '/runtime/data/original_data/work/ga_la_xy/${infile.element_identifier}' && | |
15 #end for | |
16 | |
17 /app/launchers/start-app.sh GALAXY && | |
18 | |
19 cp /runtime/data/harmonized_data/work/ga_la_xy_harm_agg.nc '$output_net' | |
20 ]]></command> | |
21 <inputs> | |
22 <param name="infiles" type="data" format="netcdf" multiple="true" label="Input the netcdf data files" help="This files can netcdf raw Argo or Gliders datafiles following CMEMS convention."/> | |
23 </inputs> | |
24 <outputs> | |
25 <data name="output_net" format="netcdf" from_work_dir="/runtime/data/harmonized_data/work/*.nc" label="${tool.name} netcdf data" /> | |
26 </outputs> | |
27 <tests> | |
28 <test expect_num_outputs="1"> | |
29 <param name="infiles" value="D6901758_001.nc,D6901758_002.nc,D6901758_003.nc,D6901758_004.nc,D6901758_005.nc"/> | |
30 <output name="output_net"> | |
31 <assert_contents> | |
32 <has_size value="427535" delta="0"/> | |
33 </assert_contents> | |
34 </output> | |
35 </test> | |
36 </tests> | |
37 <help><![CDATA[ | |
38 | |
39 .. class:: infomark | |
40 | |
41 **What it does** | |
42 General presentation | |
43 | |
44 The cerb_harmonizer tool aggregates and harmonizes marine in-situ data following the needs of the UseCase 2.1-BCG - Fair-Ease. It converts files of individual or already aggregated data profiles into concatened single file with harmonized vocabullary needed for the project. | |
45 Profiles are concatenated along the time dimension in the order given by the lising : if BGC data preceed CORE profiles, all BGC data will preceed CORE data. Use cycle_number variable to associate them again. | |
46 Vocabulary translations are below. On the left, the writing of each variable in the output file, on the right, all possible translations found over data exploration. | |
47 | |
48 Coordinates and variables : | |
49 | |
50 "lon" : ["longitude", "LONGITUDE", "LON", "lon"], | |
51 "lat" : ["latitude", "LATITUDE", "LAT", "lat"], | |
52 "time" : ["date", "time", "TIME", "JULD"], | |
53 "time_qc" : ["date_qc", "time_qc", "TIME_QC", "JULD_QC"], | |
54 "depth" : ["DEPTH", "depth"], | |
55 "cycle_number" : ["CYCLE_NUMBER"], | |
56 "ref_time" : ["REFERENCE_DATE_TIME"], | |
57 "pos_qc" : ["POSITION_QC",] | |
58 | |
59 -- | |
60 | |
61 "temperature" : ["TEMP", "TEMPERATURE"], | |
62 "salinity" : ["PSAL", "PRACTICAL_SALINITY"], | |
63 "oxygen" : ["DOXY"], | |
64 "pressure" : ["PRES"], | |
65 "chlorophylle" : ["CHLA"], | |
66 "nitrate" : ["NO3", "NITRATE", "n_an"], | |
67 "bbp700" : ["BBP700"], | |
68 | |
69 All variables are tagged with a suffix to indicate the state of the data (_raw, _dmadjusted, _rtadjusted) | |
70 | |
71 -- | |
72 | |
73 Variables extracted from meta files are : | |
74 | |
75 LAUNCH_DATE | |
76 PLATFORM_TYPE | |
77 PI_NAME meta variables are stored as metadata | |
78 | |
79 -- | |
80 | |
81 Units : | |
82 | |
83 "degree_east" : ["degree_east"], | |
84 "degree_north" : ["degree_north"], | |
85 "degree_celsius" : ["degree_celsius", "degree_Celsius"], # temperature | |
86 "psu" : ["psu", "practical_salinity_unit", "PSU"], # salinity | |
87 "micromol/l" : ["micromole.l-1", "micromole_per_liter", "μmol.l-1", "μmol/l", "μmol/L", "μmol.L-1"], # concentration liters | |
88 "mg/m3" : ["mg/m3",], | |
89 "micromole/kg" : ["micromole/kg",], | |
90 "m-1" : ["m-1"], | |
91 "decibar" : ["decibar", "dbar"], # pressure | |
92 | |
93 Arbitrarily, | |
94 | |
95 dates are written following : "seconds since 1950-01-01T00:00:00 in julian calendar" | |
96 longitude is set between : -180° and 180° | |
97 latitude is set between : -90° and 90° | |
98 | |
99 WARNING : This application works only platform by platform. For example, it is possible to aggregate and harmonize a whole argo trajectory but one at a time. If two argo trajectories are needed to process, this tool needs to be run 2 times. | |
100 | |
101 **Input** | |
102 | |
103 a list of files in a txt file named cerb_listing.txt copntaining only filenames without paths. The tool will find the files automatically. A listing example here : | |
104 | |
105 BD6901580_003.nc BD6901580_004.nc BD6901580_005.nc BD6901580_006.nc D6901580_003.nc D6901580_004.nc D6901580_005.nc D6901580_006.nc 6901580_meta.nc | |
106 | |
107 paths of where are the data (volumes) containing configurations, listings and data. Paths are : | |
108 | |
109 config path : where your textfile containing the list of files names is : it contains the listing cerb_listing.txt | |
110 data_path : highest folder including all the files to harmonize written in the textfile listing | |
111 | |
112 **Output** | |
113 | |
114 A concatenated and harmonized netcdf file | |
115 | |
116 ]]></help> | |
117 <citations> | |
118 <citation type="bibtex"> | |
119 @Manual{, | |
120 title = {QCV Ingester}, | |
121 author = {Pokapok}, | |
122 year = {2024}, | |
123 note = {https://gitlab.com/pokapok-projects/PKP8-OGI-BGC/qcv_ingester} | |
124 </citation> | |
125 </citations> | |
126 </tool> |