Galaxy |

DocLayout-YOLO (version 0.0.4.1+galaxy0)

Yolo model:

Please upload a Yolo model.

Input image:

Please provide an input image for the analysis.

Confidence:

Set confidence threshold between 0.0 and 1.0 for drawing bounding boxes. Higher values indicate higher probablity of segmentation.

Image size:

Set input image size for image resize by Doclayout Yolo model. Larger values may provide better accuracy in segmentation but could be slower. Lower values might be faster with lower accuracy.

What it does

The tool takes a Yolo model trained for annotating bounding boxes around text. It takes a pretrained Yolo model and predicts bounding boxes in the input image where any text is found. It is based on document layout analysis: https://github.com/opendatalab/DocLayout-YOLO. The Yolo model can be downloaded from: https://huggingface.co/juliozhao/DocLayout-YOLO-DocLayNet-Docsynth300K_pretrained/tree/main

Input files

Yolo model (as .pt file)
Input image containing text
Confidence score to be used for drawing bounding boxes
Image size to be resized to by Yolo model

Output files

Segmented image
Coordinates of bounding boxes as Geojson file