FenixEdu™

Dissertação

{en_GB=AI-based Visual Navigation Solution for Autonomous Underwater Vehicles} {} EVALUATED

Detalhes: {pt=Este trabalho foca-se em testar o impacto de introduzir uma abordagem com fundamentos de Inteligência Artificial em navegação visual para veículos totalmente autónomos. Para atingir estes objetivos, há vários obstáculos a ultrapassar. Primeiro, é essencial ter um dataset com pares de imagens e mascaras de segmentação, para treinar uma rede neuronal adequada a segmentação. Segundo, escolher uma rede neuronal adequada a tarefa de segmentar imagens, treiná-la e testá-la com diferentes parâmetros para determinar a configuração que nos trará melhores resultados. Por último, avaliar se estes métodos melhoram a performance de algoritmos de navegação visual. Para criar a base de dados, precisamos de imagens e de uma ferramenta que auxilie o processo de criação das máscaras ground truth. Esta ferramenta deve fazer uma boa estimativa de uma máscara de segmentação, que será posteriormente corrigida à mão para obter resultados mais precisos. O modelo de deep learning escolhido para a tarefa de segmentação foi uma Fully Convolutional Network (FCN), que e um robusto modelo de ponta. Com esta escolha foi possível atingir valores de pixel accuracy que rondam os 93% e Intersections over Union a rondar 85%. Esta solução foi testada em algoritmos de SLAM online e offline, ao sobrepor as imagens originais com as mascaras de segmentação. Verificou-se que a segmentação contribui para melhores resultados quando aplicada aos algoritmos de SLAM., en=This research aims at testing the impact of introducing an AI-based approach for visual navigation. To achieve this several challenges have to be overtaken. First, an annotated dataset with pairs of input images and segmentation grounds truths is essential for training an artificial neural network suited for semantic segmentation. Second, choosing a state of the art neural network for segmentation, then training and testing it with different parameters to determine what configuration provides the best results. Finally, evaluate if this methodology improves the accuracy of visual navigation and scene reconstruction algorithms, such as online and offline SLAM. To create the dataset, there is the need for a tool that outputs a good initial prediction of a segmentation mask, which should subsequently hand corrected for increased accuracy. The deep learning model used for semantic segmentation was a Fully Convolutional Netwok (FCN), which is a robust state of the art model, that takes a pre-trained convolutional neural network and augments the pixel-wise predictions to a segmentation map. This model achieved an accuracy close to 93%. This methodology was then tested on offline and online SLAM algorithms, by coupling the original frames with the respective segmentation masks yielded by the FCN. Using segmentation was useful in improving the results of SLAM algorithms.}
Keywords: {pt=Veículos Subaquáticos Autónomos, Deep Learning, Segmentação, SLAM, en=Autonomous Underwater Vehicles, Deep Learning, Segmentation, SLAM}

Discussão: outubro 1, 2021, 12:30