Persian Text Detection using Wikipedia persian

GitHub page !

In the initial phase of this project, I aim to create an article scraper for Wikipedia in Persian. It will start from a random URL, extract all links on that page, then randomly open another page and repeat the process. Next, I aim to overlay text on different random images. Depending on the length of the text, I will break it into smaller chunks, and for each chunk, I will create images using a random font and draw the text at a random position on the image. I also create augmented images for better performance. Using a YOLO v5 model, The model will then take an image as input, with up to four possible locations of text and their bounding boxes. The model's output will be the number of detected bounding boxes and their coordinates. It was trained for maximum 4 text areas inside an image.
Some labels and prediction