Comparative analysis of YOLOV5 and YOLOV8 for automated fish detection and classification in underwater environments
- Authors: Kuhlane, Luxolo
- Date: 2024-10-11
- Subjects: Artificial intelligence , Deep learning (Machine learning) , Machine learning , Neural networks (Computer science) , You Only Look Once , YOLOv5 , YOLOv8
- Language: English
- Type: Academic theses , Master's theses , text
- Identifier: http://hdl.handle.net/10962/464333 , vital:76502
- Description: The application of traditional manual techniques for fish detection and classification faces significant challenges, primarily stemming from their labour-intensive nature and limited scalability. Automating these kinds of processes through computer vision practices and machine learning techniques has emerged as a potential solution in recent years. With the development of and increase in ease of access to new technology in recent years, the use of a deep learning object detector known as YOLO (You Only Look Once) in the detection and classification of fish has steadily become notably popular. This thesis thus explores suitable YOLO architectures for detecting and classifying fish. The YOLOv5 and YOLOv8 models were evaluated explicitly for detecting and classifying fish in underwater environments. The selection of these models was based on a literature review highlighting their success in similar applications but remains largely understudied in underwater environments. Therefore, the effectiveness of these models was evaluated through comprehensive experimentation on collected and publicly available underwater fish datasets. In collaboration with the South African Institute of Biodiversity (SAIAB), five datasets were collected and manually annotated for labels for supervised machine learning. Moreover, two publicly available datasets were sourced for comparison to the literature. Furthermore, after determining that the smallest YOLO architectures are better suited to these imbalanced datasets, hyperparameter tuning tailored the models to the characteristics of the various underwater environments used in the research. The popular DeepFish dataset was evaluated to establish a baseline and feasibility of these models in the understudied domain. The results demonstrated high detection accuracy for both YOLOv5 and YOLOv8. However, YOLOv8 outperformed YOLOv5, achieving 97.43% accuracy compared to 94.53%. After experiments on seven datasets, trends revealed YOLOv8’s enhanced generalisation accuracy due to architectural improvements, particularly in detecting smaller fish. Overall, YOLOv8 demonstrated that it is the better fish detection and classification model on diverse data. , Thesis (MSc) -- Faculty of Science, Computer Science, 2024
- Full Text:
- Date Issued: 2024-10-11
- Authors: Kuhlane, Luxolo
- Date: 2024-10-11
- Subjects: Artificial intelligence , Deep learning (Machine learning) , Machine learning , Neural networks (Computer science) , You Only Look Once , YOLOv5 , YOLOv8
- Language: English
- Type: Academic theses , Master's theses , text
- Identifier: http://hdl.handle.net/10962/464333 , vital:76502
- Description: The application of traditional manual techniques for fish detection and classification faces significant challenges, primarily stemming from their labour-intensive nature and limited scalability. Automating these kinds of processes through computer vision practices and machine learning techniques has emerged as a potential solution in recent years. With the development of and increase in ease of access to new technology in recent years, the use of a deep learning object detector known as YOLO (You Only Look Once) in the detection and classification of fish has steadily become notably popular. This thesis thus explores suitable YOLO architectures for detecting and classifying fish. The YOLOv5 and YOLOv8 models were evaluated explicitly for detecting and classifying fish in underwater environments. The selection of these models was based on a literature review highlighting their success in similar applications but remains largely understudied in underwater environments. Therefore, the effectiveness of these models was evaluated through comprehensive experimentation on collected and publicly available underwater fish datasets. In collaboration with the South African Institute of Biodiversity (SAIAB), five datasets were collected and manually annotated for labels for supervised machine learning. Moreover, two publicly available datasets were sourced for comparison to the literature. Furthermore, after determining that the smallest YOLO architectures are better suited to these imbalanced datasets, hyperparameter tuning tailored the models to the characteristics of the various underwater environments used in the research. The popular DeepFish dataset was evaluated to establish a baseline and feasibility of these models in the understudied domain. The results demonstrated high detection accuracy for both YOLOv5 and YOLOv8. However, YOLOv8 outperformed YOLOv5, achieving 97.43% accuracy compared to 94.53%. After experiments on seven datasets, trends revealed YOLOv8’s enhanced generalisation accuracy due to architectural improvements, particularly in detecting smaller fish. Overall, YOLOv8 demonstrated that it is the better fish detection and classification model on diverse data. , Thesis (MSc) -- Faculty of Science, Computer Science, 2024
- Full Text:
- Date Issued: 2024-10-11
Towards an artificial intelligence-based agent for characterising the organisation of primes
- Authors: Oyetunji, Nicole Armlade
- Date: 2024-04-04
- Subjects: Numbers, Prime , Odd number , Machine learning , Deep learning (Machine learning) , Mathematical forecasting , Neural networks (Computer science) , Artificial intelligence
- Language: English
- Type: Academic theses , Master's theses , text
- Identifier: http://hdl.handle.net/10962/435389 , vital:73153
- Description: Machine learning has experienced significant growth in recent decades, driven by advancements in computational power and data storage. One of the applications of machine learning is in the field of number theory. Prime numbers hold significant importance in mathematics and its applications, for example in cryptography, owing to their distinct properties. Therefore, it is crucial to efficiently obtain the complete list of primes below a given threshold, with low relatively computational cost. This study extensively explores a deterministic scheme, proposed by Hawing and Okouma (2016), that is centered around Consecutive Composite Odd Numbers, showing the link between these numbers and prime numbers by examining their internal structure. The main objective of this dissertation is to develop two main artificial intelligence agents capable of learning and recognizing patterns within a list of consecutive composite odd numbers. To achieve this, the mathematical foundations of the deterministic scheme are used to generate a dataset of consecutive composite odd numbers. This dataset is further transformed into a dataset of differences to simplify the prediction problem. A literature review is conducted which encompasses research from the domains of machine learning and deep learning. Two main machine learning algorithms are implemented along with their variations, Long Short-Term Memory Networks and Error Correction Neural Networks. These models are trained independently on two separate but related datasets, the dataset of consecutive composite odd numbers and the dataset of differences between those numbers. The evaluation of these models includes relevant metrics, for example, Root Mean Square Error, Mean Absolute Percentage Error, Theil U coefficient, and Directional Accuracy. Through a comparative analysis, the study identifies the top-performing 3 models, with a particular emphasis on accuracy and computational efficiency. The results indicate that the LSTM model, when trained on difference data and coupled with exponential smoothing, displays superior performance as the most accurate model overall. It achieves a RMSE of 0.08, which significantly outperforms the dataset’s standard deviation of 0.42. This model exceeds the performance of basic estimator models, implying that a data-driven approach utilizing machine learning techniques can provide valuable insights in the field of number theory. The second best model, the ECNN trained on difference data combined with exponential smoothing, achieves an RMSE of 0.28. However, it is worth mentioning that this model is the most computationally efficient, being 32 times faster than the LSTM model. , Thesis (MSc) -- Faculty of Science, Mathematics, 2024
- Full Text:
- Date Issued: 2024-04-04
- Authors: Oyetunji, Nicole Armlade
- Date: 2024-04-04
- Subjects: Numbers, Prime , Odd number , Machine learning , Deep learning (Machine learning) , Mathematical forecasting , Neural networks (Computer science) , Artificial intelligence
- Language: English
- Type: Academic theses , Master's theses , text
- Identifier: http://hdl.handle.net/10962/435389 , vital:73153
- Description: Machine learning has experienced significant growth in recent decades, driven by advancements in computational power and data storage. One of the applications of machine learning is in the field of number theory. Prime numbers hold significant importance in mathematics and its applications, for example in cryptography, owing to their distinct properties. Therefore, it is crucial to efficiently obtain the complete list of primes below a given threshold, with low relatively computational cost. This study extensively explores a deterministic scheme, proposed by Hawing and Okouma (2016), that is centered around Consecutive Composite Odd Numbers, showing the link between these numbers and prime numbers by examining their internal structure. The main objective of this dissertation is to develop two main artificial intelligence agents capable of learning and recognizing patterns within a list of consecutive composite odd numbers. To achieve this, the mathematical foundations of the deterministic scheme are used to generate a dataset of consecutive composite odd numbers. This dataset is further transformed into a dataset of differences to simplify the prediction problem. A literature review is conducted which encompasses research from the domains of machine learning and deep learning. Two main machine learning algorithms are implemented along with their variations, Long Short-Term Memory Networks and Error Correction Neural Networks. These models are trained independently on two separate but related datasets, the dataset of consecutive composite odd numbers and the dataset of differences between those numbers. The evaluation of these models includes relevant metrics, for example, Root Mean Square Error, Mean Absolute Percentage Error, Theil U coefficient, and Directional Accuracy. Through a comparative analysis, the study identifies the top-performing 3 models, with a particular emphasis on accuracy and computational efficiency. The results indicate that the LSTM model, when trained on difference data and coupled with exponential smoothing, displays superior performance as the most accurate model overall. It achieves a RMSE of 0.08, which significantly outperforms the dataset’s standard deviation of 0.42. This model exceeds the performance of basic estimator models, implying that a data-driven approach utilizing machine learning techniques can provide valuable insights in the field of number theory. The second best model, the ECNN trained on difference data combined with exponential smoothing, achieves an RMSE of 0.28. However, it is worth mentioning that this model is the most computationally efficient, being 32 times faster than the LSTM model. , Thesis (MSc) -- Faculty of Science, Mathematics, 2024
- Full Text:
- Date Issued: 2024-04-04
- «
- ‹
- 1
- ›
- »