comparison bgc_harmonizer.xml @ 0:15260949227d draft default tip

planemo upload for repository https://github.com/galaxyecology/tools-ecology/tree/master/tools/bgc_ocean commit 124fc5205762aaf4f7b39b09a85b5b3fb918370d
author ecology
date Thu, 05 Dec 2024 14:16:02 +0000
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:15260949227d
1 <tool id="harmonize_insitu_to_netcdf" name="QCV harmonizer" version="@TOOL_VERSION@+galaxy@VERSION_SUFFIX@" profile="22.01" license="MIT">
2 <description>and aggregator of insitu marine physical and biogeochemical data</description>
3 <macros>
4 <token name="@TOOL_VERSION@">1.1</token>
5 <token name="@VERSION_SUFFIX@">0</token>
6 </macros>
7 <requirements>
8 <container type="docker">pokapok/qcv_ingester:@TOOL_VERSION@</container>
9 </requirements>
10 <command detect_errors="exit_code"><![CDATA[
11 export HOME=\$PWD &&
12
13 #for $i, $infile in enumerate($infiles):
14 cp '$infile' '/runtime/data/original_data/work/ga_la_xy/${infile.element_identifier}' &&
15 #end for
16
17 /app/launchers/start-app.sh GALAXY &&
18
19 cp /runtime/data/harmonized_data/work/ga_la_xy_harm_agg.nc '$output_net'
20 ]]></command>
21 <inputs>
22 <param name="infiles" type="data" format="netcdf" multiple="true" label="Input the netcdf data files" help="This files can netcdf raw Argo or Gliders datafiles following CMEMS convention."/>
23 </inputs>
24 <outputs>
25 <data name="output_net" format="netcdf" from_work_dir="/runtime/data/harmonized_data/work/*.nc" label="${tool.name} netcdf data" />
26 </outputs>
27 <tests>
28 <test expect_num_outputs="1">
29 <param name="infiles" value="D6901758_001.nc,D6901758_002.nc,D6901758_003.nc,D6901758_004.nc,D6901758_005.nc"/>
30 <output name="output_net">
31 <assert_contents>
32 <has_size value="427535" delta="0"/>
33 </assert_contents>
34 </output>
35 </test>
36 </tests>
37 <help><![CDATA[
38
39 .. class:: infomark
40
41 **What it does**
42 General presentation
43
44 The cerb_harmonizer tool aggregates and harmonizes marine in-situ data following the needs of the UseCase 2.1-BCG - Fair-Ease. It converts files of individual or already aggregated data profiles into concatened single file with harmonized vocabullary needed for the project.
45 Profiles are concatenated along the time dimension in the order given by the lising : if BGC data preceed CORE profiles, all BGC data will preceed CORE data. Use cycle_number variable to associate them again.
46 Vocabulary translations are below. On the left, the writing of each variable in the output file, on the right, all possible translations found over data exploration.
47
48 Coordinates and variables :
49
50 "lon" : ["longitude", "LONGITUDE", "LON", "lon"],
51 "lat" : ["latitude", "LATITUDE", "LAT", "lat"],
52 "time" : ["date", "time", "TIME", "JULD"],
53 "time_qc" : ["date_qc", "time_qc", "TIME_QC", "JULD_QC"],
54 "depth" : ["DEPTH", "depth"],
55 "cycle_number" : ["CYCLE_NUMBER"],
56 "ref_time" : ["REFERENCE_DATE_TIME"],
57 "pos_qc" : ["POSITION_QC",]
58
59 --
60
61 "temperature" : ["TEMP", "TEMPERATURE"],
62 "salinity" : ["PSAL", "PRACTICAL_SALINITY"],
63 "oxygen" : ["DOXY"],
64 "pressure" : ["PRES"],
65 "chlorophylle" : ["CHLA"],
66 "nitrate" : ["NO3", "NITRATE", "n_an"],
67 "bbp700" : ["BBP700"],
68
69 All variables are tagged with a suffix to indicate the state of the data (_raw, _dmadjusted, _rtadjusted)
70
71 --
72
73 Variables extracted from meta files are :
74
75 LAUNCH_DATE
76 PLATFORM_TYPE
77 PI_NAME meta variables are stored as metadata
78
79 --
80
81 Units :
82
83 "degree_east" : ["degree_east"],
84 "degree_north" : ["degree_north"],
85 "degree_celsius" : ["degree_celsius", "degree_Celsius"], # temperature
86 "psu" : ["psu", "practical_salinity_unit", "PSU"], # salinity
87 "micromol/l" : ["micromole.l-1", "micromole_per_liter", "μmol.l-1", "μmol/l", "μmol/L", "μmol.L-1"], # concentration liters
88 "mg/m3" : ["mg/m3",],
89 "micromole/kg" : ["micromole/kg",],
90 "m-1" : ["m-1"],
91 "decibar" : ["decibar", "dbar"], # pressure
92
93 Arbitrarily,
94
95 dates are written following : "seconds since 1950-01-01T00:00:00 in julian calendar"
96 longitude is set between : -180° and 180°
97 latitude is set between : -90° and 90°
98
99 WARNING : This application works only platform by platform. For example, it is possible to aggregate and harmonize a whole argo trajectory but one at a time. If two argo trajectories are needed to process, this tool needs to be run 2 times.
100
101 **Input**
102
103 a list of files in a txt file named cerb_listing.txt copntaining only filenames without paths. The tool will find the files automatically. A listing example here :
104
105 BD6901580_003.nc BD6901580_004.nc BD6901580_005.nc BD6901580_006.nc D6901580_003.nc D6901580_004.nc D6901580_005.nc D6901580_006.nc 6901580_meta.nc
106
107 paths of where are the data (volumes) containing configurations, listings and data. Paths are :
108
109 config path : where your textfile containing the list of files names is : it contains the listing cerb_listing.txt
110 data_path : highest folder including all the files to harmonize written in the textfile listing
111
112 **Output**
113
114 A concatenated and harmonized netcdf file
115
116 ]]></help>
117 <citations>
118 <citation type="bibtex">
119 @Manual{,
120 title = {QCV Ingester},
121 author = {Pokapok},
122 year = {2024},
123 note = {https://gitlab.com/pokapok-projects/PKP8-OGI-BGC/qcv_ingester}
124 </citation>
125 </citations>
126 </tool>